Lesson 4: Measures and Visualization of Variability Flashcards
Tail is on the left side
Negatively skewed
detailed version of range
quartiles
How to get the median?
sort the data from highest to lowest or vice versa
=MEDIAN (dataset)
Steps to create a boxplot
- highlight the range (sorted from LH or HL
- insert - box and whisker
- Add label
Tells us whether there is an outlier or none
Box plot
Relatively symmetric (description, boxplot)
- median coincides with the mean
- the boxplot is partitioned equally
- skew = certain value
Positively skewed (description, boxplot)
- skew to the right
- mean is greater than the median
- the box plot is partioned with the upper area bigger than the lower area
- many outliers are extending towards bigger value
Negatively skewed (description, boxplot)
- skewed to the left
- median is greater than the mean
- the boxplot is partitioned with the upper area lower than the lower area.
- many outliers are extending towards smaller value
To determine if it is outlier or not, we use _____________.
Interquartile range
Formula of IQR
IQR = Q3 - Q1
Note: 1.5* IQR
Get the new maximum value (use IQR)
Q3 + (1.5*IQR)
= New Max value (all data beyond the new max value is considered as an outlier)
Get the new minimum value (IQR)
Q1 - (1.5* IQR)
= new minimum value ( all data below new min value is considered as an outliers)
Divides the data points in four equal parts
Provides information on the spread of the data
Quartiles and Percentiles
Causes of outliers
- Data entry and measurement errors
- Sampling problems
- Natural Variation
A term used in statistics to express how a score compares to other scores in the same set.
Percentile
50th percentile is also known as …
Median
A type of quantile which divides the number of data points into four parts, or quarters of more-or-less equal size.
Quartile
The value at which 25% of the answer lie below that value, and 75% of the answers lie above that value.
25th Percentile
The difference between the third and first quartiles
Interquartile range
Extreme values that standout greatly from the overall pattern of values in a dataset or graph.
Outliers
A chart that shows data from a five-number summary including one of the measures of central tendency
Box plot
The value at which 50% of the answers lie above that value and 50% of the answers lie below that value
50th Percentile
The value at which 25% of the answers lie above that value and 75% of the answers lie below that value.
75th Percentile
Decisions And Outcomes:
Data-Analysis-Insights-Decision-Outcomes