Chapter 1 - Exploring Data Flashcards
individuals
who or what the data is being gathered from or about
variable
what information is gathered from the individual
categorical variable
variables are placed in categories or groups
quantitative variable
variables are assigned a numeric value and can be operated on
discrete data
a series of natural numbers (ex. 1, 2, 3…)
continuous data
a set of infinite values (ex. 1 < x < 3)
relative frequency
frequency is expressed as parts of the whole (percentage)
joint relative frequency
a relative frequency that is the combination of multiple variables (calculated using the inner columns/rows of a two way table)
marginal relative frequency
a relative frequency of a single variable (calculated using the total column/row of a two way table)
explanatory variable
indicates the probability of another variable
response variable
partially controlled by an explanatory variable
mean
the sum of the values in a data set divided by the total number of values
median
the middle value in a data set, or the mean of the two middle values
mode
the most occurring value in a data set
roughly symmetric
there is a single peak in the middle of the graph
skewed left
the data peaks to the right
skewed right
the data peaks to the left
bimodal
there are two peaks
uniform
there are no peaks
gap clusters
the data is sporadic
standard deviation
measures spread, =√(Σ(x-mean)^2)/n-1
standard deviation interpreation
“the [context] typically wavers by [standard deviation] from the mean of [mean]”