RESS I: Data Analysis #2 Flashcards

1
Q

What is a sample used for?

A

To make statistical inferences about the population from which it was drawn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is statistical inference?

A

Statistical inference is the process of using the value of a sample statistic to make an informed guess about the value of a population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a parameter?

A

A particular characteristic of the population that we are interested in e.g. the population mean or proportion, mean difference, differences in proportions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an estimation in statistical inference?

A

The process of using summary statistics from collected sample data to represent the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is hypothesis testing?

A

Making a hypothesis about a population and then collect sample data to see whether it gives evidence against the hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is standard error?

A

In order to estimate the precision of the sample mean, the most obvious option available to us would be to repeatedly measure fresh samples from that population.

We could then calculate the spread of means generated by repeated sets of measurements and calculate the standard deviation of these means as a measure of their spread.

The standard deviation of these different estimates from repeated samples is known as the standard error.

Standard error of the estimate represents the average distance between an estimate and its population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you calculate standard error?

A

SE = standard deviation / square root of n

Where n is the number of samples.
From this we can deduce, the larger the sample size, the smaller the standard error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the difference between precision and accuracy?

A

Measurements that are close to the known value are said to be accurate, whereas measurements that are close to each other are said to be precise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What can standard error also tell us?

A

The standard error can also be used to estimate the range of estimates (in general) that are most likely to occur. This is because approximately 95% of all sample means will fall between our sample mean value +/– (1.96 x standard error mean).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Where does 99% of the data lie in a normal distribution?

A

Within 3 standard deviations of the mean (and 2 standard errors of the mean).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Where does 95% of the data lie in a normal distribution?

A

Within 2 standard deviation of the mean (and 3 standard errors of the mean).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the 95% Confidence interval?

A

A 95% confidence interval is a range of values that you can be 95% certain contains the true mean of the population. This is not the same as a range that contains 95% of the values. This can be calculated by using the idea that 95% of all sample means will fall between our sample mean value + or – 1.96 x standard error mean (SEM).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the framework for a hypothesis test?

A
  1. State the null and alternative hypothesis
  2. Decide a level of significance (p-value cut-off)
  3. Define and evaluate a test statistic
  4. Calculate the p-value
  5. Interpret the results
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the null hypothesis usually in the form of us?

A

The null hypothesis is usually of the form:

- there is no difference between the two groups
- or there is no association between treatment and outcome
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the null hypothesis usually in the form of us?

A

The alternate hypothesis would be:

- there is a difference between the two groups
- or there is an association between treatment and outcome
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a p-value?

A

The p-value is defined as the probability of obtaining the result, or a more extreme result, if the null hypothesis is true. We need to decide what level of probability we will accept as being
‘unlikely’. Conventionally a 5% probability is considered sufficiently
unlikely. Thus, we tend to choose:

Significance level=0.05

17
Q

What do large p-values signify?

A
  • Quite likely to see these results by chance

- Cannot be sure of a difference in the target population

18
Q

How do you interpret a p-value?

A
  • p-value ≤ 0.05: statistical evidence of a difference or association (reject the null hypothesis)
  • p-value > 0.05: no statistical evidence of a difference or association, fail to reject the null hypothesis
19
Q

How can the 95% confidence level be used in hypothesis testing?

A

If 95% CI of a mean difference does not contain zero, reject the null hypothesis at statistical significance (p-value of 0.05) of 5%.

20
Q

What is the difference between an estimation and a hypothesis test?

A

Estimation is used to generate better understanding of the likely true value of a measurement and the degree of (un)certainty surrounding this estimate – uses 95% Confidence intervals.

Hypothesis testing evaluates whether our data provide confidence that our sample estimate is different from another “population” (or a standard) value – uses p-values & 95% Confidence intervals.