Exploring and Describing Data Flashcards
Mean
Average
Median
The median is the middle score. To find the median, list all the scores in increasing order and select the middle one.
Mode
The mode is the score that occurs the most. It is the score with the highest frequency.
Range
Range = Highest score – Lowest score
Interquartile range
IQR = Third quartile – First quartile = Q 3 − Q 1
Standard deviation
Standard deviation is the spread of data about the mean. Population standard deviation – σ n or σ x.
Sample standard deviation – σ n −1 or S x .
Outlier
An outlier is a score that is separated from the majority of the data.
Use Q1 −1.5× IQR and Q3 +1.5× IQR as criteria to determine an outlier.
Describing datasets
- Shape of the graph is described in terms of smoothness, symmetry and the number of modes.
- Positively skewed data has a long tail on the right-hand side. * Negatively skewed data has a long tail on the left-hand side.
Comparing datasets
The selection and the use of the appropriate measure of location (mean or median) and measure of spread (range, interquartile range or standard deviation) depends on the nature of the data and the relative merits of each measure.
Parallel box-and-whisker plots
Two box-and-whisker plots on the same scale. They are used to compare two sets of data.