Week 3: Hypothesis Testing and Significance Flashcards
What is the main goal of hypothesis testing?
To assess how compatible sample data are with a certain hypothesis about the population parameter.
What are the two types of hypothesis in hypothesis testing?
Null hypothesis (H0): Assumes no effect or no difference (e.g., μ = 0).
Alternative hypothesis (H1): Assumes an effect or difference exists (e.g., μ ≠ 0).
What are the steps in hypothesis testing?
- Define the null hypothesis
- Define the alternative hypothesis
- Choose a significance level (α)
- Select the correct statistical test and calculate the test statistic
- Find the critical value or p-value
- Interpret the results
What is the significance level (α), and what does it represent?
The probability of a Type I error (rejecting H0 when it is true), commonly set at 0.05.
What is the difference between a one-sided and two-sided test?
One-sided: Tests if a parameter is either greater or less than a critical value
Two-sided: Tests if a parameter is different from a certain value, without specifying direction
What are Type I and Type II errors?
Type I error (α): Rejecting H0 when it is true
Type II error (β): Failing to reject H0 when H1 is true
Note: Setting a lower significance level decreases a Type I error risk, but increases a Type II error risk. A Type I error is often considered to be more important to avoid. The hypothesis test procedure is therefore adjusted so that:
α = P(type I error) = significance level = 0.05
How is a test statistic calculated for a hypothesis test?
By comparing the sample estimate to the null hypothesis value, scaled by its standard error. In other words, the number of SEs a sample estimate is away from H0. It shows the size of the estimate relative to its precision.
Example:
z = x̄ - μ / SE
What is a p-value?
The probability of obtaining a result as extreme or more extreme than the observed result, assuming H0 is true.
How do you interpret a p-value?
p < 0.05: Evidence against H0 at the 5% level
p ≥ 0.05: Insufficient evidence against H0
What is the relationship between p-values and CIs?
Both describe the role of chance in the data. If a CI excludes the null value, the corresponding p-value is less than the significance level.
The p-value is the probability that a sample is consistent with H0. Unlike CIs, it gives little indication of the likely size of the population parameter.
The CI is a range of intervals within which the true population parameter is likely to lie.
What is the critical value in hypothesis testing?
A threshold used to compare the test statistic to determine whether to reject H0. It depends on α and the type of test (one-sided or two-sided)
Why is statistical significance not always practical significance?
Large samples can detect trivial effects as statistically significance, while small samples may miss meaningful effects.
What is “p-hacking” and why is it problematic?
Manipulating data or analyses to obtain significant p-values, leading to misleading or non-reproducible results.
How should p-values be used in reporting?
Treat p-values as a continuum and present them alongside CIs and study context (one piece of evidence alongside other factors e.g., prior evidence, study design, data quality, real-world costs and benefits, etc.
P-values represent the probability of the data given H0, not the probability of H0 given the data. Avoid using binary language to interpret p values (reject/don’t reject) in favour of no/weak/strong/some evidence against H0.
How do you decide based on a 95% CI?
If the CI contains the null value, there is no evidence against H0.
If the CI excludes the null value, there is evidence against H0.
Note: We can use 95% CI for one sample tests. However, for two-sample tests, overlapping CIs does not preclude lack of support for H0.