Normal distribution and standardising data Flashcards

1
Q

What is the standard deviation?

A

Measures the average amount by which all the values deviate from the mean
Represented in brackets after the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a lower standard deviation mean?

A

The mean is more reliable as there is less spread in the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When should we change the data?

A

If there is an error in the results inputted we could go back to the original data collected
Or you may need to transform the data for accurate comparison

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is scaling data?

A

Multiplying all of the values in a data set by a constant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is standardising data?

A

Transforming data into a common, consistent forming

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What happens when we add or subtract by a constant number to each value in a data set?

A

Changes the mean by the same amount added or subtracted
The SD remains the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What happens when we multiply or divide to scale?

A

The mean increases or decreases by the proportion being multiplied or divided by
SD also increases or decreases by the proportion being multiplied or divided by

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are Z scores?

A

They measure the number of SDs an observation is from the mean
Positive Z score means the observation is above the mean
Negative Z score means observation is below the mean
0 Z - score means the observation equals the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do we calculate a Z-score?

A

Z score = observation - mean
SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the rules of Z-scores?

A

The mean of all the Z-scores is always 0
The SD of all the Z-scores is always 1
Only works with the whole data set from which the mean and SD were calculated from

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the properties of the normal distribution (Gaussian distribution)?

A

The curve is symmetrical about the mean
The mean is equal to the median
Most observations are closer to the mean
Few observations at any distance from the mean on either side of it
Lines don’t touch the X-axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the entire area under the normal curve equal?

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the percentages under each section of a normal curve?

A

Between -1SD and the mean 34.1%
Between +1SD and the mean 34.1%
so all together is 68.2%
Between -1SD and -2SD is 13.6%
Between +1SD and +2SD is 13.6%
Beyond -2SD is 2.3%
Beyond +2SD is 2.3%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What if we don’t have a whole number Z score?

A

We use normal tables
Cells shows the proportion of the area under the entire curve that lies between the mean and a positive Z-score
1st column gives the first decimal place
Top row gives 2nd decimal place

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we answer a question such as what proportion of 26 month old girls in our sample have weight for age Z scores greater than 0.39?

A

We draw a normal curve
Use the Normal table to get the proportion associated with the Z score
Then minus this from 0.5 as this is the total area under this side of the curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do we answer a question such as Health experts say that the top 15% of girls are likely to be overweight what does this equate to in terms of weight?

A

We draw a normal curve
We know that 50 - 15 is 35 so use the normal table in reverse to find the Z score
We then re arrange the formula to calculate the observation