Numerical Descriptive Stats Flashcards
measure of center
value at center or middle of data set
arithmetic mean is a moc obtained by adding/dividing total by # of values
mean advantage/dis
advantage= relatively reliable, drawn from same population so don’t vary much, every data value into account
disadvantage= sensitive to every data value so one extreme affect dramatically, not resistant
median
middle value when og data values arranged in order of magnitude
IS resistant moc bc not affected by an extreme value
mode
value occurs with greatest frequency
can have 1+ or no mode
-bimodal, multimodal
only moc that can be used with nominal data
skewed to left
aka negatively skewed
longer left tail
mean and median are to the left of the mode
skewed right
aka positively skewed
longer right tail
mean and median to the right of mode
measures of variability
range, standard deviation, variance
also called spread or dispersion (how scores differ from one another)
range
difference b/t maximum data and minimum value
very sensitive to extreme values so not useful as other mov
measure of variation
standard deviation
mov about the mean/average amount of variability,
denoted s or SD
s can inc dramatically w/ inclusion of 1+ outliers
variance
mov = square of the SD
percentiles
99 percentiles that divide data into 100 groups with 1% of values in each group
quartiles
divide ranked values into 4 equal parts w/ 25% in each group
Q1 sep the bottom 25% from top 75%, lower quartile
Q2 same as median so sep bottom 50% from top 50%
Q3 sep bottom 75% from top 25%, upper quartile
motivations
measure of relative standing
show location of data values relative to others w/i data set
compare values from different data sets
z score
number of SDs that a given value is above/below the mean
mean of z-scores of a population is always 0 and SD of z-scores is always 1
no units
independent
probability
if occurrence of one does not affect the probability of the other