Distributions and Probability Flashcards

1
Q

What is a normal distribution/symmetrical skew?

A

-When data is symmetrical around central scores e.g. mean, median and mode are equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a positive skew?

A

-Tails out to the right and leans to the left
-Mode - Median - Mean (left to right)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a negative skew?

A

-Tails out to the left and leans to the right
-Mean - Median - Mode (left to right)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to calculate the skew?

A

Skew = 3(mean-median)
————————
Standard deviation

-If the skew is <0, the data is negatively skewed
-If the skew is >0, the data is positively skewed
-CHECK ON GOOGLE DOCS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a parametric test?

A

-Assumes that values such as the mean and standard deviation accurately reflect the population distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why do we transform data into Z scores?

A

-This can help to standardise the data and reduce the impact of skewness
-This tells us exactly how many standard deviations someone was from the mean:
-68% of the population are within a z score of ±1
-95% of the population are within a z score of ±1.96 (round up to 2)
-99.7% of the population are within a z score of ±2.96 (round up to 3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the equation of calculating the Z score?

A

Z = Individual point - Group mean
—————————————–
Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the pros and cons of using distributions?

A

-(+) Can transform data to a standardised scale
-(+) Scale adheres to normal distribution
-(+) Can compare things relative to their own population
-(+) Use the entire dataset
-(-) Sampling errors can occur (how representative they are of the population)
-(-) Standard errors can occur (SEM = standard deviation ÷ square root of num of datapoint)
-(-) Confidence intervals (range of values that contain the true value of statistic e.g. mean)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly