RESS I: Data Analysis #2 Flashcards

Question 1

Q

What is a sample used for?

Answer

A

To make statistical inferences about the population from which it was drawn.

Question 2

Q

What is statistical inference?

Answer

A

Statistical inference is the process of using the value of a sample statistic to make an informed guess about the value of a population parameter.

Question 3

Q

What is a parameter?

Answer

A

A particular characteristic of the population that we are interested in e.g. the population mean or proportion, mean difference, differences in proportions.

Question 4

Q

What is an estimation in statistical inference?

Answer

A

The process of using summary statistics from collected sample data to represent the population.

Question 5

Q

What is hypothesis testing?

Answer

A

Making a hypothesis about a population and then collect sample data to see whether it gives evidence against the hypothesis.

Question 6

Q

What is standard error?

Answer

A

In order to estimate the precision of the sample mean, the most obvious option available to us would be to repeatedly measure fresh samples from that population.

We could then calculate the spread of means generated by repeated sets of measurements and calculate the standard deviation of these means as a measure of their spread.

The standard deviation of these different estimates from repeated samples is known as the standard error.

Standard error of the estimate represents the average distance between an estimate and its population parameter.

Question 7

Q

How do you calculate standard error?

Answer

A

SE = standard deviation / square root of n

Where n is the number of samples.
From this we can deduce, the larger the sample size, the smaller the standard error.

Question 8

Q

What is the difference between precision and accuracy?

Answer

A

Measurements that are close to the known value are said to be accurate, whereas measurements that are close to each other are said to be precise.

Question 9

Q

What can standard error also tell us?

Answer

A

The standard error can also be used to estimate the range of estimates (in general) that are most likely to occur. This is because approximately 95% of all sample means will fall between our sample mean value +/– (1.96 x standard error mean).

Question 10

Q

Where does 99% of the data lie in a normal distribution?

Answer

A

Within 3 standard deviations of the mean (and 2 standard errors of the mean).

Question 11

Q

Where does 95% of the data lie in a normal distribution?

Answer

A

Within 2 standard deviation of the mean (and 3 standard errors of the mean).

Question 12

Q

What is the 95% Confidence interval?

Answer

A

A 95% confidence interval is a range of values that you can be 95% certain contains the true mean of the population. This is not the same as a range that contains 95% of the values. This can be calculated by using the idea that 95% of all sample means will fall between our sample mean value + or – 1.96 x standard error mean (SEM).

Question 13

Q

What is the framework for a hypothesis test?

Answer

A

State the null and alternative hypothesis
Decide a level of significance (p-value cut-off)
Define and evaluate a test statistic
Calculate the p-value
Interpret the results

Question 14

Q

What is the null hypothesis usually in the form of us?

Answer

A

The null hypothesis is usually of the form:

- there is no difference between the two groups
- or there is no association between treatment and outcome

Question 15

Q

What is the null hypothesis usually in the form of us?

Answer

A

The alternate hypothesis would be:

- there is a difference between the two groups
- or there is an association between treatment and outcome

Question 16

Q

What is a p-value?

Answer

Study These Flashcards

A

The p-value is defined as the probability of obtaining the result, or a more extreme result, if the null hypothesis is true. We need to decide what level of probability we will accept as being
‘unlikely’. Conventionally a 5% probability is considered sufficiently
unlikely. Thus, we tend to choose:

Significance level=0.05

Question 17

Q

What do large p-values signify?

Answer

Study These Flashcards

A

Quite likely to see these results by chance

- Cannot be sure of a difference in the target population

Question 18

Q

How do you interpret a p-value?

Answer

Study These Flashcards

A

p-value ≤ 0.05: statistical evidence of a difference or association (reject the null hypothesis)
p-value > 0.05: no statistical evidence of a difference or association, fail to reject the null hypothesis

Question 19

Q

How can the 95% confidence level be used in hypothesis testing?

Answer

Study These Flashcards

A

If 95% CI of a mean difference does not contain zero, reject the null hypothesis at statistical significance (p-value of 0.05) of 5%.

Question 20

Q

What is the difference between an estimation and a hypothesis test?

Answer

Study These Flashcards

A

Estimation is used to generate better understanding of the likely true value of a measurement and the degree of (un)certainty surrounding this estimate – uses 95% Confidence intervals.

Hypothesis testing evaluates whether our data provide confidence that our sample estimate is different from another “population” (or a standard) value – uses p-values & 95% Confidence intervals.

RESS I: Data Analysis #2 Flashcards

(20 cards)