Statistical Inference Flashcards

1
Q

Define Normal distribution

A

A probability distribution that is symmetric around the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the significance of 2 standard deviations in Normally distributed data?

A

If data is Normally distributed, then 95% of the data points will be contained within 2 (actually 1.96) standard deviations of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the main assumptions when conducting statistical inference?

A

That the sample is representative of the population

The the sample follows a Normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can the assumption of Normal distribution be assessed?

A

The mean and the median should be approximately equal
The boxplot should be symmetric
95% of data should lie within 2SDs of the mean
Normal probability plot should be linear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the standard error of the mean?

A

The standard error is the standard deviation of the sample means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is meant by a 95% confidence interval?

A

CIs are an estimate of how well we have measured a mean / other variable
A 95% CI in a Normally distributed data set means that we expect 95% of the sample means to lie within 1.96 standard errors of the true population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the relevance of the width of a 95% CI?

A

The width of a CI is a measure of how precisely we have measured a variable - the narrower the interval, the more precise the estimate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can we minimise standard error?

A

By increasing sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly