Topic 4: Normal Model Flashcards

1
Q

What is the normal curve?

A

It is a probability distribution symmetric about the mean, showing that data around the mean are more frequent in occurrence than data far from the mean

It is also recognised as a ‘bell curve’

Discovered by De Moivre

It is the probability density function (pdf) - f(x). The PDF is a special function describing the chance associated with a continuous variable X, over all values of X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two types of normal curves?

A

The standard normal curve (z)
The general normal curve (x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the standard normal curve?

A

It has a mean of 0 and SD of 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the general normal curve?

A

It can have any mean and SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which scenario allows us to use the normal curve as an approximation to the area under the histogram?

A

When the normal curves seems to fit the histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the normal curve notation?

A

X ~ N(μ, σ^2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 2 special properties of a normal curve?

A

1) All normal curves satisfy the ‘68%-95%-99.7%’ rule

2) Any general normal can be rescaled into the standard normal through the use of standard units (z score)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the z score measure

A

How many SDs a point is above or below the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the 68%-95%-99.7% rule?

A

Area under 1sd out from the mean in both directions is 68% of total values

Area under 2sd out from the mean in both directions is 95% of total values

Area under 3sd out from the mean in both directions is 99.7% of total values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is standard unit (z score) calculated?

A

(data point - mean) / SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the limitations of using statistical models?

A

All models are approximations, and thus they are all wrong. However, only some models are useful, so we have to ask ‘which model is best for this particular application’?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we know when to use the normal curve?

A

Does the histogram look normal?

Do the proportions look right?

Does the quantile - quantile (QQ) plot look like a straight line?

Shapiro Test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does it mean by ‘does the histogram look normal?’

A

Check whether the histogram looks bell shaped, and if it has any outliers or long tails –> if yes we can use the normal approximation/curve / distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does it mean by ‘do the proportions look right?’

A

Involves checking does 1sd above and below the mean give us ~68% of results? and continuing on for other sds –> if yes we can use the normal approximation/curve / distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does it mean by ‘does the quantile-quantile (QQ) plot look like a straight line?’

A

A Q–Q plot is a probability plot, a graphical method for comparing two probability distributions by plotting their quantiles against each other. A point on the plot corresponds to one of the quantiles of the second distribution plotted against the same quantile of the first distribution.

If it looks like a straight diagonal line, then we can use the normal curve/approximation / distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does it mean by ‘Shapiro test?’ What value on the shapiro test is ideal?

A

The Shapiro test essentially just tests for normality / does a sample fit a normal distribution?

A p value of below 0.05 means that the normal doesn’t fit, if it is above 0.05, we can have assume a normal distribution / approximation / curve

17
Q

What is measurement error

A

Measurement Error (also called Observational Error) is the difference between a measured quantity and its true value.

18
Q

What is the equation to account for measurement error?

A

Individual measurement = exact value + chance error + bias

An individual measurement often differs from the exact value

19
Q

What is chance error?

A

No matter how carefully any measurement is made, it could have turned out differently just slightly. This is due to chance error

“Chance error” typically refers to the variability or random fluctuations that occur in the measurement of a quantity due to random factors or chance events. It is also known as random error. Chance errors are inherent in any measurement process and can result from a variety of unpredictable factors, such as instrument limitations, environmental conditions, or human error.

Unlike systematic errors, which are consistent and repeatable, chance errors are unpredictable and can affect measurements differently each time they are taken.

20
Q

What is the best way to estimate chance error?

A

Replicate the measurement under the same conditions, and calculate the standard deviations

21
Q

What is an outlier?

A

It is an observation that lies at an abnormal distance from other values in a random sample from a population

It is statistically defined as a value more than 3ssd away from the mean, assuming normality

22
Q

What is a measurement bias? WHat are the features?

A

It is a confounding variable which typically leads to a systematic error. A measurement is considered biased if it systematically overstates or understates the true value of a measurement

Bias can be deliberate or accidental

Results in a constant amount being added or subtracted from each measurement

23
Q

What is an example of measurement bias?

A

If a scale isn’t properly calibrated. In this case the scale is producing the bias

24
Q

Can bias be estimated by replicating the measurement?

A

No, it can’t because a systematic error will still remain

25
Q

Does mean or sd change with an increase or decrease in price?

A

Mean changes according to the decrease or increase, meanwhile there is no sd change