Hypothesis Testing Flashcards

To have a clear understanding of: - Sample and population - Hypothesis testing - P-values and confidence intervals - Testing a hypothesis referring to a single mean and a single proportion - Interpretation of statistical output for each test

1
Q

Who is included in a population?

A

Current members and future members.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is a sample of a population used?

A

Because it is not feasible to collect data on the entire population, a sample is used to tell us about the theoretical population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is needed before collecting data from a sample?

A

The theoretical population must be defined in order to understand how generalisable the inferences from the sample are. The sample must be representative of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the key considerations for a random sample?

A
  1. Each individual from the population must have an equal chance of being included.
  2. The inclusion of one individual must not affect the inclusion of another.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is stratified random sampling?

A

A method in which populations are broken down into smaller divisions and samples taken from these in order to ensure the sample is representative of the entire population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why is the normal distribution important in medical research?

A
  1. To understand the central tendency and variability of the data to allow for summarising and interpretation of large datasets.
  2. For use of parametric statistical tests such as t-tests and ANOVA.
  3. For predictive modelling and risk assessment.
  4. For quality control in order to set limits and identify outliers.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the standard error?

A

A measure of the variability between sample means.
It quantifies the difference between the mean measured from a sample and the mean measured from the theoretical population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the standard deviation?

A

A measure of how far the mean is from other points in the dataset.
It is the variability in the population/sample mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the null hypothesis state?

A

That there is no difference between populations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the alternate hypothesis state?

A

That there is a difference between populations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do you have to assume about a sample in order to test a hypothesis?

A
  1. The sample is representative.
  2. The sample is independent.
  3. The sample has homogenous variance.
  4. The sample is normally distributed.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What can you assume if the sample size is > 30?

A

That the distribution of the sample mean is approximately normal (no matter the distribution of the data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When should you use the normal distribution?

A

If the sample size is > 30 and assumptions are met.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When should you use the t-distribution?

A

If the sample size < 30 and assumptions are met.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What should you do if assumptions are not met?

A

First, try to transform the data.
If the transformed data still does not meet the assumptions, use a non-parametric test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the limiting factor for categorical data?

A

The accuracy of the instrument used to measure the value.

17
Q

What is categorical data known as if it can only take 2 distinct values?

A

Dichotomous or binary.

18
Q

What is a type I error?

A

A false positive - the null hypothesis is rejected when it is true.

e.g., you say there is a difference between groups when there isn’t one

19
Q

What is a type II error?

A

A false negative - the null hypothesis is not rejected when it is false.

e.g., you say there isn’t a difference between groups when there is one.

20
Q

What is the power of a test?

A

The probability of not rejecting the null hypothesis when it is false.

e.g., the probability of not making a type II error.

21
Q

What is the p-value?

A

A measure of the strength of the evidence against the null hypothesis.
A small p-value indicates that the null hypothesis is unlikely to be true.

22
Q

What is a confidence interval?

A

The range of values estimated from a sample within which the true population value is likely to be found.

23
Q

What does a 95% confidence interval mean?

A

That if a random sample was drawn 100 times, 95 out of the 100 times the sample would contain the true population parameter.

24
Q

How do you calculate the 95% CI?

A

95% CI for the mean is (x - 1.96 x SE) to (x + 1.96 x SE)

25
Q

What test should you use to compare one group to a known value?

A

Parametric = One sample t-test.

Non-parametric = sign test.

26
Q

What test should you use to compare 2 paired groups?

A

Parametric = paired t-test.

Non-parametric = Wilcoxon signed-rank test.

27
Q

What test should you use to compare 2 independent groups?

A

Parametric = unpaired t-test.

Non-parametric = Wilcoxon rank sum test.

28
Q
A