Lesson 2- Data Description Flashcards
0
Q
Measures of center
A
- mean: X resistant to outliers
- median: resistant
- mode
1
Q
Quantitative description of data
A
- center: mean, median, mode
- spread (variability): range, quartiles (IQR), variance, standard deviation
2
Q
Measures of spread
A
- range (max. - min.)
- quartile (Q1, M, Q3)
- IQR: middle 50%
- outlier criterion: if observation falls 1.5(IQR) below Q1 or above Q3
3
Q
Boxplots
A
- show right or left-skew
- 5 number summary (min, Q1, M, Q3, max)
- central box spans quartiles
- line in box is median
- conceals outliers
4
Q
Modified boxplot
A
-outliers plotted individually
5
Q
Standard deviation
A
s= [ 1/(n-1)*sum(xi -xmean)^2 ] ^ (1/2)
6
Q
Measures of spread 2
A
- standard deviation
- degrees of freedom: the sum of deviations = 0, the last deviation can be found once we know the other n-1 deviations, only n-1 can vary freely therefore we divide by n-1
- variance = s^2
7
Q
Effect of changing units of measure
A
- linear transformation, x -> a + bx
- effect of ‘a’: adds a to center and quartiles, but X effect on spread
- effect of ‘b’: multiplies both center and spread (StD, IQR) by b
- skew, median, mean
- right-skew: mean is greater than median. CFP goes up quickly
- left-skew: mean is less than median. CFP goes up slowly