Lecture 4 Flashcards
Distribution and the Central Limit Theorem
Types of Distribution
Normal and Non-normal
Normal Distribution
AKA: Gaussian or Parametric distribution. Curve centred around the mean. Most of the data clustered around the centre. A perfect normal distribution: the mean, median and mode would all be the same value and there would be an equal number of data points either side of the mean.
Non- normal Distribution
Less common than normal distribution. Majority of data found either to the left (positive skew) or right (negative skew) of the centre of the data
Left Skewed Example
Age related condition - increases with age.
Kurtosis
A measure of how wide or narrow the tails of a distribution are. We can think of this tailed-ness as a representation of how often outliers occur. We always measure the kurtosis of a distribution relative to normal distribution.
Mesokurtic
Medium-Tailed
Platykurtic
Thin-Tailed
Leptokurtic
Fat-Tailed
Non-normal Distribution Examples
Continuous Uniform Distribution, Negative Exponential Distribution
Z-Scores
The number of SD’s an observation is away from the population mean. The formula for z-scores is simply the deviation from the mean divided by the standard deviation
+1SD gives a z-score of 1
-1SD gives a z-score of -1
Positive Z-Score
An observation has a value above the population mean
Negative Z-Score
A value below the mean
Why are Z-Scores Useful?
Z-scores allow us to standardise differences across different distributions. E.g. infant birth weight/growth