central tendency and measures of spread Flashcards
3 measures of central tendency
- mean, median, mode
what is the mean
The mean of a dataset is the SUM of the dataset divided by the number of individuals (n)
what is the median
The median value is the middle value of a dataset when the dataset is listed in numerical order (ranked)
what is the mode
The mode of a dataset is the value that occurs most often
why are central tendency data often so different?
Mean vs Median
Mean is very sensitive to extreme values
Median is much less sensitive (robust)
why is the mode rarely used for quantitative data?
In such a small dataset comprising INTERVAL data the chances of a value occurring more than once are low
The mode is rarely used for quantitative data except in specific cases such as technical analysis e.g. interpretation of particle-size analyses
When to use which Central Tendency?
Nominal data (categories) → use the mode
Quantitative data → use the mean or the median
Use the median when extreme values are present and you donʼt want a distorted average
Use the mean when extreme scores are absent
Why use Measures of Spread?
Measures of central tendency give us ʻtypicalʼ values of dataset
…but this is not enough
Very different datasets may have very similar MEAN/MEDIAN
…so we need to also describe the spread of the data
Ways to quantify Measures of Spread?
Four main techniques are used in Geography
- Range
- Inter-quartile range
- Standard deviation
- Variance
Problems with using the range
The range is very sensitive to sample size and extreme values.
Variance
- The variance is very similar to the standard deviation and uses almost the same equation
-The only difference is that you donʼt calculate the square root