2 Normal Distribution & Inferences Flashcards

1
Q

What are the properties of a normal distribution?

A
  • bell shaped
  • symmetrical
  • mean, median and mode equal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does the central limit theorem state?

A

Anything you can average over a large enough number will give you a normal distribution curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What 2 parameters determine the shape of a normal curve?

A

Mean- line of symmetry
SD- how spread out the data is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the empirical rule?

A

In a normal distribution curve:
68% of data within 1SD
95% of data within 2SD
99.7% of data within 3SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

If we measure student heights and we know height is normally distributed do we still have to check normality of the data?

A

Yes
Sample and population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a population and what is a sample?

A

Population: group of all items of interest
Sample: set of data drawn from population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a parameter and what is a statistic?

A

Parameter is a descriptive measure of a population
Statistic is a descriptive measure of a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is inferential statistics?

A

Draw conclusions or inferences about characteristics of populations based on data from a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a statistical inference?

A

An estimate, prediction or decision about a population based on a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is sampling variation

A

How sample statistics can vary from sample to sample due to random chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is standard error?

A

Measure of sampling variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What factors reduce standard error/sampling variability?

A

Increased sample size
Lower variability of outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we calculate Standard error?

A

Standard deviation of sample statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Sampling variability

A

Sampling variability = standard error = SD of sample statistic

Decrease w inc sample size
Increase w inc variability if outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we calculate confidence intervals?

A

(Sample statistic)+- (confidence level)xSE

If we want to be 95% confident = 1.96

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If the 95% confidence interval for the mean is (29.6, 46.9) what does is mean??

A

There is a 95% probability that the true population mean lies within 29.6 and 46.9

17
Q

How can we check if our data is normally distributed?

A

Histogram- is it bell shaped?
Descriptive summary- are mean, median and mode similar?
Does 68% of data like within 1 SD of mean? 95% within 2 SDs?
Q-Q plot - linear
Normality test - sig>0.05

18
Q

What are the normality test we can run?
Why do we have to be cautious

A

Kolmogorov-Smirnov - normal probability plot, if normal it will be linear
Highly influenced by sample size

19
Q

Why do we have confidence intervals and significance levels?

A

Conclusions and estimates based on sample statistics need measures of reliability

20
Q

What does confidence level measure?

A

Proportion of times that an estimating procedure will be correct

21
Q

What does the significance level measure?

A

How frequently the conclusion will be wrong in the long run. 5% sig level= conclusion will be wrong 5% of the time in the long run

22
Q

How do we illustrate significance and confidence level

A

Alpha = significance
Confidence = 1-alpha

Therefore significance + confidence level = 1

23
Q

If the sample increases, what would you expect would happen to the confidence interval?

A

Sample size increases so standard error reduces so confidence interval becomes narrower

24
Q

How do we calculate the sample proportion for categorical data like success/fail?

A

no. successes in sample/total count in sample = p hat = “sample proportion” = mean of sample proportion

25
Q

As the sample size increases, the sample proportion distribution becomes ?

A

Normal (approximately)

26
Q

If we repeat samples again and again we will eventually get a normal distribution of the sample mean. What are the conditions for this to take place?

A

Random sampling - no selection bias
Sample is large enough (>10) & n(p) and n(1-p) >5 (p is mean)

27
Q

What are the 2 methods of assessing normality?

A

Q-Q plot - straight line
Kolmogorov Smironov test (sig>0.05)

28
Q

What is the central limit theorem?

A

For quantitative data, the sample size is large enough (n>30), distribution of the sample mean (if survey repeated 10000 times), would be Normal distribution.

For categorical data, the sample proportion distribution would be normal (if survey repeated 10000s times) but only following certain conditions.

29
Q

What are the conditions necessary for central limit theorem to be valid for categorical data (sample proportion distribution to be normal)

A

Sample size must be large enough
np>=5 and n(1-p)>=5
Randomised selection of sample (no bias)

30
Q

How do we calculate the CI for 95% confidence level?

A

CI = sample statistic +- (1.96 x SE)