Week 4: T-Tests and ANOVA Flashcards
What is the purpose of a t-test?
To compare the means of two groups and determine if their difference represents a real difference in the population or occurred by chance.
T-tests are used to quantify how far apart the two means area. In other words, how many SEs is the observed ‘difference in the sample means’ away from zero?
When should the t-distribution be used instead of the z-distribution?
When the sample size is small or the population variance (σ) is unknown
What are the three types of t-test?
Paired t-test; Independent t-test (equal or unequal variance); One sample t-test
What are degrees of freedom in a t-test?
The number of values in a calculation that a free to vary, typically N - 1 for a sample
List the steps for hypothesis testing
- Defne H0
- Define H1
- Choose a significance level (α)
- Select and calculate the test statistic
- Compare the test statistic to the critical value or p-value
- Interpret results
What is the H0 and H1 for a t-test?
H0 = μA - μB = 0 (the means of the two groups are the same)
H1 = 𝜇A ≠ 𝜇B (the 2 samples come from different populations)
What assumptions are made for parametric tests like t-test and ANOVA?
Normality of data distribution; Homogeneity of variance (equal variance)
What is ANOVA used for?
To compare the means of three or more groups (e.g., μ1, μ2, μ3) to determine if at least one mean differs significantly
What is the F-statistic in ANOVA?
The ratio of between-group variance to within-group variance
When is a non-parametric test preferred over a parametric?
- When the sample size is small
- When data are non-normal or contain outliers
- For analysing ordinal or ranked data
Define the critical value for a t-test
A threshold that the test statistic must exceed to reject H0 at a given α
What is the pooled variance in an independent t-test?
A weighted average of the variances of the two groups, used when assuming equal variances
What does a post-hoc test in ANOVA do?
Identifies which specific means differ after finding a significant F-test result (e.g., Tukey post-hoc test)
What are the advantages of parametric tests?
- Greater statistical power for detecting differences
- Robust to violations of normality if the sample size is large
What are advantages of non-parametric tests?
- Valid for small sample sizes and non-normal data
- Can handle ordinal data and outliers effectively
- Assess the median which can be better for highly-skewed distributions