Week 4: T-Tests and ANOVA Flashcards
What is the purpose of a t-test?
To compare the means of two groups and determine if their difference represents a real difference in the population or occurred by chance.
T-tests are used to quantify how far apart the two means area. In other words, how many SEs is the observed ‘difference in the sample means’ away from zero?
When should the t-distribution be used instead of the z-distribution?
When the sample size is small or the population variance (σ) is unknown
What are the three types of t-test?
Paired t-test; Independent t-test (equal or unequal variance); One sample t-test
What are degrees of freedom in a t-test?
The number of values in a calculation that a free to vary, typically N - 1 for a sample
List the steps for hypothesis testing
- Defne H0
- Define H1
- Choose a significance level (α)
- Select and calculate the test statistic
- Compare the test statistic to the critical value or p-value
- Interpret results
What is the H0 and H1 for a t-test?
H0 = μA - μB = 0 (the means of the two groups are the same)
H1 = 𝜇A ≠ 𝜇B (the 2 samples come from different populations)
What assumptions are made for parametric tests like t-test and ANOVA?
Normality of data distribution; Homogeneity of variance (equal variance)
What is ANOVA used for?
To compare the means of three or more groups (e.g., μ1, μ2, μ3) to determine if at least one mean differs significantly
What is the F-statistic in ANOVA?
The ratio of between-group variance to within-group variance
When is a non-parametric test preferred over a parametric?
- When the sample size is small
- When data are non-normal or contain outliers
- For analysing ordinal or ranked data
Define the critical value for a t-test
A threshold that the test statistic must exceed to reject H0 at a given α
What is the pooled variance in an independent t-test?
A weighted average of the variances of the two groups, used when assuming equal variances
What does a post-hoc test in ANOVA do?
Identifies which specific means differ after finding a significant F-test result (e.g., Tukey post-hoc test)
What are the advantages of parametric tests?
- Greater statistical power for detecting differences
- Robust to violations of normality if the sample size is large
What are advantages of non-parametric tests?
- Valid for small sample sizes and non-normal data
- Can handle ordinal data and outliers effectively
- Assess the median which can be better for highly-skewed distributions
What happens if multiple t-tests are used instead of ANOVA?
It increases the risk of Type I errors due to the inflation of the overall α
The probability of making one or more Type I errors is 1 - (1-.05)^3 = 0.14
What is the difference between z and t distribution?
t is more spread out than Z (heavier tails)
The sample distribution of t depends on n (and has n-1 df). The larger the sample to estimate σ (df→∞), the closer is the t distribution to the normal distribution (z statistic)
All other factors being equal, is a large ¯d more or less likely to occur by chance than a small mean difference?
Less
Given the same ¯d, are groups with smaller SDs more or less likely to report a significant difference than groups with larger SDs?
More
What’s the relationship between t-tests and sample sizes?
The more subjects, the more confident we can be that the differences we find did not occur by chance
In a t-test, what do we need to assess how likely the difference between means is to be “real”?
- The mean difference, ¯d
- The SD for each group
- The number of subjects in each group
When is there strong evidence against H0 for a t-test?
t_obs ≥ t_ critical at a certain α
What is a paired (or dependent) t-test?
Difference between average scores of a single sample of individuals who is assessed at two different times or on two different measures. It can also compare average scores of individuals who are paired in terms of a particular characteristic.
In the population, the two means measured at different times might either:
- be the same (μt1= μt2 → μt1 – μt2 = δ = 0)
- be different (μt1 ≠ μt2 ; μt1 - μt2 < 0 : μt1-μt2 > 0)
What is an independent t-test?
Compares the means of two samples selected independently of each other (i.e., subjects in the two groups are not the same). There are then two types of independent t-tests: equal/pooled variance and unequal/separate variance.
In the population, the two means measured among two different groups might either:
- be the same (μA = μB → μA – μB = δ = 0)
- be different (μA ≠ μB ; μA - μB < 0 ; μA - μB > 0