Unit 6: Measures of Variability Flashcards
- Variability, also referred to as dispersion
indicates how much
individual cases tend to deviate from what is typical in the data
If scores are very similar, little or low dispersion
homogeneous
Measures of Variability
- Range
- Interquartile Range
- (Semi-interquartile Range)
- Standard Deviation and Variance
- If scores are very different, high degree of dispersion.
heterogeneous
Range
The distance from the largest to the smallest number.
* Example: test scores: 70 83 83 83 90 90 94 95 95
* The range is calculated by subtracting the smallest value (70) from the largest
value (95): R = 25
Quartiles = Q
The 1st quartile (Q1) is the point below which 25% of the scores occur (P25).
- The 2nd quartile (Q2) is the point below which 50% of the scores occur
(P50). It is also the median. - The third quartile (Q3) is the point below which 75% of the scores occur (P75)
- These points in the distribution are arbitrary.
- Sometimes people are interested in deciles (every 10% of the distribution)
boxplot
The Violin
Plot
- Alternative to box plot
- a smoothed version of the histogram
- The violin plot places a
simple version of a
boxplot (median, box) in
the center
Measure of spread: standard deviation
The standard deviation is used to describe the variation around the mean.
Like the mean, it is not resistant to outliers
- First calculate the variance s
- Then take the square root to get
the standard deviation s.
How to Calculate the Quartiles and the Interquartile Range
types of graphs
variance s^2
how far a set of (random) numbers are
spread out from their average value
standard deviation s
describes the variation
around the mean (how spread out the
data are from the mean
Calculating the Variance
Difference between individual score and the mean
Square the differences
Sum the squared differences
Divide by number of observations – 1
Samples usually over or under estimate the amount of variation in a population
underestimate
- Dividing by n-1 instead of n is a correction for this