probability and distributions Flashcards
1
Q
what is a normal distribution
A
- when data is symmetrical around central scores
- i.e., mean, median and mode are equal
- data should fit along a a Gaussian curve
2
Q
how do you calculate skew
A
3(mean-median)/SD
- if skew is more than 0 it’s negative
- if skew is less than 0 it’s positive
3
Q
what are the tests for distribution
A
normality tests:
- shapiro-wilk
kolmogorov-smirnov
4
Q
what is a parametric test
A
tests you run on normally distributed data
5
Q
what does transforming data into Z scores do
A
- helps us standardise data and reduce the impact of skewness
- tells us exactly how far someone is from the mean
6
Q
how do you calculate a Z score
A
individual point-group mean/standard deviation
7
Q
what are the pros of z scores
A
- can transform data to a standardised scale
- scale adheres to normal distribution
- can compare things relative to their own population
- use the entire data set
8
Q
what is a sampling error
A
- we can estimate values and statistics about a population based on a sample but the value is likely to differ from the true mean of the global population
- we can use standard error to show how sure we are
9
Q
how is the standard error calculated
A
standard deviation/square root of number of datapoint
10
Q
what does a standard error tell us
A
- how likely it is our sample will vary from one sampling to another
- the smaller the better
11
Q
what are confidence intervals
A
- the range of values that, in a certain proportion of samples contain the true value of a statistic
- can be used for visualisation - error bars