Lesson 4: Measures and Visualization of Variability Flashcards
Tail is on the left side
Negatively skewed
detailed version of range
quartiles
How to get the median?
sort the data from highest to lowest or vice versa
=MEDIAN (dataset)
Steps to create a boxplot
- highlight the range (sorted from LH or HL
- insert - box and whisker
- Add label
Tells us whether there is an outlier or none
Box plot
Relatively symmetric (description, boxplot)
- median coincides with the mean
- the boxplot is partitioned equally
- skew = certain value
Positively skewed (description, boxplot)
- skew to the right
- mean is greater than the median
- the box plot is partioned with the upper area bigger than the lower area
- many outliers are extending towards bigger value
Negatively skewed (description, boxplot)
- skewed to the left
- median is greater than the mean
- the boxplot is partitioned with the upper area lower than the lower area.
- many outliers are extending towards smaller value
To determine if it is outlier or not, we use _____________.
Interquartile range
Formula of IQR
IQR = Q3 - Q1
Note: 1.5* IQR
Get the new maximum value (use IQR)
Q3 + (1.5*IQR)
= New Max value (all data beyond the new max value is considered as an outlier)
Get the new minimum value (IQR)
Q1 - (1.5* IQR)
= new minimum value ( all data below new min value is considered as an outliers)
Divides the data points in four equal parts
Provides information on the spread of the data
Quartiles and Percentiles
Causes of outliers
- Data entry and measurement errors
- Sampling problems
- Natural Variation
A term used in statistics to express how a score compares to other scores in the same set.
Percentile