Normal Distribution & Inferential statistics Flashcards

1
Q

Why is normal distribution important?

A

basis for statistical inference

good description of distribution for commonly studied quantitate variables

determines choice of statistical method for data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Normal distribution characterisations

A

mathematically defined theoretical distribution

symmetrical bell-shaped curve

approx. describes many quantitative variables eg height, birthweight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does knowing the data is normally distributed & having the mean & SD give us the ability to do?

A

Calculate the percentage of people that score below/above any value or within a range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Normal distribution common characteristic values

A

68% of scores with 1 SD

95% within 1.96 SD

99.7% within 3 SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How is 95% range for normally distributed data calculated?

A

lower bound of range = mean - 1.96 x SD

upper bound = mean + 1.96 x SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Population, Sample & data definitions

A

Population - total number of people with a certain characteristic

Sample - the people from population taking part in study

Data - information from the sample population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Parameters & estimates explained

A

data from sample is used to obtain estimates of the true values of parameters (true mean/values)

eg % of people with type II diabetes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why do we need statistical methods?

A

the answer provided by sample data is rarely the correct answer in the population

uncertainty due to - variability & the sample being a subset of the pop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are descriptive statistics?

A

describing & summarising data from a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are inferential statistics?

A

using sample data to make inferences about characteristics & relationships in the wider population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are standard errors working out?

A

How close the estimate from the sample data is to the parameter value (truth) if the study was repeated with different samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can standard error be decreased?

A

using a larger sample sizes as this gives us more confident estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why are inferential statistics required?

A

we cant draw conclusions about the population using just the mean difference from a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does a 95% Confidence interval show?

A

95% certain that something in the population is true (giving a range of data from sample study)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why is a range of data from sample study given when using 95% confidence interval?

A

Because we don’t know the true population parameter value but can be 95% sure the parameter value will be within the given range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

95% confidence interval definition

A

the range of values within which we are 95% certain the true parameter (mean) lies)

17
Q

Is a narrow or wider Confidence interval (CI) better

A

narrower because we are then more certain about what the truth is

18
Q

What does p-value show?

A

values between 0-1, the lower the value the greater the chance of evidence contradicting the null hypothesis

19
Q

What is hypothesis testing

A

Concerned with what the true answer isn’t

assessing the extent to which the sample estimate disproves the null hypothesis

20
Q

What is a null hypothesis

A

The most boring truth - that nothing is different between control & intervention groups in a test

eg Mean systolic blood pressure in diabetics is the same as in the healthy population

21
Q

What is a alternative hypothesis

A

the opposite of the null hypothesis

That there is a significant difference between the two sample groups

22
Q

Why is a 0.05 p-value significant?

A

is used as a threshold to reject the null hypothesis

below = reject
above = not enough data to reject
23
Q

P-value examples

> 0.001 = ?

0.05 = ?

> 0.1 = ?

A

< 0.001 = strong evidence against null hypothesis

0.05 = moderate

> 0.1 = little

24
Q

What does a small p-value not prove?

A

That there is a large difference between the groups just shows that there is some difference

25
Q

Why are CI’s better than P-values (hypothesis testing)

A

because they tell you something about what the true answer in the population is

However CI can not always be calculated

26
Q

95% CI to 0.05 P-value link?

A

If 0 (null hypothesis value) is within 95% CI = P-value > 0.05

if it is excluded = < 0.05 p-value

if it is either the upper or lower boundary = 0.05 p-value