Data Distributions & The Normal Distribution Flashcards

1
Q

What are the different types of data distributions?

A
  • Normal (Gaussian)
  • Non-normal
  • Bimodal
  • Positive/Negative skew
  • Uniform
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define data distribution

A

How much data for a particular variable is spread over its range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How can you tell if the distribution is normal on a Histogram?

A

Usually, there’s a peak on the histogram (shape looks like a hill) which indicates the data is normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Does sample size affect distribution?

A

Yes (normal distribution)

Small sample = We don’t have much
data so normal distribution can be difficult to see in a histogram

Large sample = Normality will emerge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the 2 types of non-normal distribution of data?

A

1) Bimodal
2) Skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does skewed non-normal data look like on a histogram?

A

When data on a histogram starts off as a peak and is followed by a tail (positively skewed)

When data on a histogram starts off as a tail and is followed by a peak (negatively skewed)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does bimodal non-normal data look like on a histogram?

A

When data on a histogram starts off as a peak, dips and then goes back to a peak again (opposite of a normal distribution/ ‘U’ shape)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the danger of skewed non-normal data?

A

The mean is distorted by tails (the mean is not exactly in the middle because it gets dragged by extreme values)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can we overcome the mean that is distorted by tails in skewed non-normal data?

A

Use the median as a measure of central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the danger of bimodal non-normal data?

A

Mean is not representative (because there are 2 peaks/modes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What shape is a normal distribution curve?

A

Bell-shaped

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Assume that the ratio of 2nd digit length to 4th digit length (i.e. 2D/4D) in the population follows a normal distribution with mean 0.965 and standard deviation 0.025, i.e. follows N(0.965, 0.025).

What is the 2D/4D ratio of someone who is 3 standard deviations above the mean?

a.	0.975

b.	1.0

c.	1.025

d.	1.04
A

mu - 3 x sigma = 3 s.d.’s below the mean
mu + 3 x sigma = 3 s.d.’d above the mean

0.965 + 3 x 0.025 = 1.04
(mean) + 3 x (SD) = ratio

Answer = D

How well did you know this?
1
Not at all
2
3
4
5
Perfectly