Practical W2: Basics of Statistics Flashcards
One of the first things that’s super important after collecting your data is to graphically look at your data by making a
histogram
There are two main ways in which a distribution can deviate from normal - (2)
- skewness
- Kurotsis
Diagram of positive and negative skew
If the skewness value between -1 and 1 in SPSS then
it’s fine
If the skewness value in SPSS is less than -1 then
it is a negative skew = non-normal distribution
If the skewness value in SPSS is greater than 1 then
positive skew = non-normal distribution
Diagram of skewness value shown in SPSS
Kurotsis is basically looking at how
‘pointy’ your histogram is
Kurtosis tells us how much our data lies around the
ends/tails of our histogram which helps us to identify when outliers may be present in the data.
A distribution with positive kurtosis, so much of the data is in the tails, will be very
pointy or leptokurtic
A distribution with negative kurtosis, so the data lies more in the middle, will be more
sloped or platykurtic
Normal distribution will have kurotsis value of
0 (mesokurtic)
Characteristic of a negative skew
tail it is pointing towards the lower values and the data is clustered at the higher values
Characteristic of a positive skew
– the tail is pointing towards the higher values and the data is clustered at the lower values
Diagram of mesokurtic (normal) , leptokurtic and platykurtic distribution curve
Kurotsis value in SPSS between -2 and 2 is
all good, normal kurotsis
If kurotsis value in SPSS is less than -2 then shows
platykurtic (non-normal, issue with kurotsis)
If kurotsis value in SPSS is greater than 2
leptokurtic (non-normal, shows issues with kurotsis)
Diagram of kurotsis value in SPSS
Is kurotsis and skewness value here fine?
Good because both the skewness is between -1 and 1 and kurtosis values are between -2 and 2.
Is kurotsis and skewness values fine here?
Bad because although the skewness is between 1 and -1, we have a problem with kurtosis with a value of 2.68 which is larger than 2 and -2
3 ways to transformations your data to make it closer to normal distribution - (3)
- exponential
- power
- log
There is a tertium quid which prompts the saying that
correlation not causation