probability and distributions Flashcards

1
Q

what is a normal distribution

A
  • when data is symmetrical around central scores
  • i.e., mean, median and mode are equal
  • data should fit along a a Gaussian curve
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

how do you calculate skew

A

3(mean-median)/SD
- if skew is more than 0 it’s negative
- if skew is less than 0 it’s positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are the tests for distribution

A

normality tests:
- shapiro-wilk
kolmogorov-smirnov

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is a parametric test

A

tests you run on normally distributed data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does transforming data into Z scores do

A
  • helps us standardise data and reduce the impact of skewness
  • tells us exactly how far someone is from the mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how do you calculate a Z score

A

individual point-group mean/standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what are the pros of z scores

A
  • can transform data to a standardised scale
  • scale adheres to normal distribution
  • can compare things relative to their own population
  • use the entire data set
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is a sampling error

A
  • we can estimate values and statistics about a population based on a sample but the value is likely to differ from the true mean of the global population
  • we can use standard error to show how sure we are
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

how is the standard error calculated

A

standard deviation/square root of number of datapoint

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what does a standard error tell us

A
  • how likely it is our sample will vary from one sampling to another
  • the smaller the better
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are confidence intervals

A
  • the range of values that, in a certain proportion of samples contain the true value of a statistic
  • can be used for visualisation - error bars
How well did you know this?
1
Not at all
2
3
4
5
Perfectly