2 - Statistical Inference Flashcards
What is a confidence interval?
A way of conveying uncertainty about a dataset
95% CI = there is a 95% chance that the actual mean of YOUR dataset is in the interval you define
Introduce the concept of hypothesis testing.
x = average of test group, u = true mean
- Define null hypothesis (usually x=u)
- Compute a test statistic: t = (x-u)/SE(x)
- Draw a conclusion
- –(if sample size is greater than ~50, reject null hypothesis if t is less than -2 or greater than 2 (5% chance of this)
Type I error:
Type II error:
Interpret a P-value for an effect.
p = 0.05 means that the results seen would occur by random chance only 5% of the time
Define central limit theorem.
A mathematical result stating that for a sufficiently large sample size, the sampling distribution of the mean will be approximately normal regardless of the underlying distribution of the data
Define effect.
The magnitude of a difference or relationship
Define event.
A clinical outcome of importance
–Ex: onset of a disease (such as cancer or heart disease), onset of a particular symptom (such as bleeding or depression), disease recurrence, or death
Define hypothesis test.
A statistical analysis used to accept or reject a null hypothesis
Define null hypothesis.
The hypothesis being tested about a population
Null = “no difference;” refers to a situation in which there is no difference (e.g., between the means in a treatment group and a control group)
Define parameter.
An unknown summary value for an entire population
The purpose of a statistical analysis is to estimate and make inferences about a parameter
Define power.
The power of a statistical test is the probability that it correctly rejects the null hypothesis when the null hypothesis is false (i.e. the probability of not committing a Type II error)
Define p-value.
The probability of observing a result as extreme as or more extreme than the one actually observed based on chance alone (i.e., if the null hypothesis is true)
Define random sample.
A subset of the population obtained by random selection
Define sampling distribution.
The theoretical distribution of a statistic obtained from a random sample
Define statistical significance level.
The probability of making a type I error in a hypothesis test
Define test statistic.
The specific statistic used to test the null hypothesis (e.g., the t statistic)