8/9/10: ANOVA & ANCOVA Flashcards
ANOVA + assumptions
Analysis of Variance: compares means based on categorical independent variables and assesses the impact on a continuous dependent variable
Assumptions are:
- independent,
- deviations from the group means follow a normal distribution
- group variances to be equal.
if normality is voilated then -> GLM
Example: comparing the average test scores (continuous dependent variable) of students from different teaching methods (categorical independent variable: method A, B, or C).
Factorial ANOVA
method used to examine the effects of two or more independent categorical variables (factors) on a continuous dependent variable (so also two way anova: two independent categorical variables on a continuous dependent variable & multiway anova two or more indepentent catagorical variables)
Example: Diet type (categorical) and exercise intensity (categorical) affect weight loss (continuous), with no need to control for other variables.
ANCOVA
Detects differences in means of two or more independent catagorical groups while controlling for one or more continuous covariates, which could influence the continuous dependent variable
ANCOVA is multiple linear regression (assumtion normality and continous data)
Independent Variable: Teaching methods (2 or more groups)
Covariate: Prior knowledge (continuous)
Dependent Variable: Student performance (final exam scores - continuous)
ANCOVA will adjust for differences in pre-test scores (covariate) to better isolate the effect of the teaching methods on the final test scores. This ensures that any differences in final test scores are due to the teaching methods rather than variations in students’ prior knowledge.
Before analysing ANOVA/ANCOVA
- check assumptions (independent,normality)
1. Omnibus test to see if there is any difference at all between the group means (it tests the H0 that all group means are equal)-> if significant then there is a difference somewhere and to clarify which ones…
2. Post Hoc test to see which groups differ
Protects against false positives as withholds you looking at induvidual p-values when there is no significant overal difference
If Omnibus is negative then means there is no difference found between groups so why look any further?
The post hoc is often applied with multiple testing correction to adjust Type I errors
F-statistic two meanings
- In regression analysis: A large F-statistic value proves that the regression model is effective in explaining the variance; significant result, then whatever coefficients you included in your model improved the model’s fit -> model significant
- in ANOVA: The larger the difference in group means the more they vary and the greater the F-value -> supporting evidence against the H0 meaning there is a difference
Regression: models prediction significantly explain the the variance
Anova: indicates there are significant differences between group means
Interactions
- Two-Way ANOVA handles interactions between two factors.
- Multi-Way & factorial ANOVA can assess interactions between two or more factors
- ANCOVA: Can assess interactions between covariates
Depending on your data and research question you choose the model.
interaction in regression table
if at least one slope differs significantly from another, then not all slopes are equal and there is an interaction. This is the case because slope A differs significantly from b -> look at p value
can anova and ancova always be used?
No, only when the data is continous and normaly distributed as poisson and binomial do not posses these assumtions they need an GLM instead
Name the data sorts of the groups
- one way anova: one catagorical indepentent variable + one continuous dependent variable
- two way/multiway/factorial anova: two or more categorical independent variables + one continous depentent variable
- ANCOVA: two or more categorical independent variables + one continous dependent variabel + one or more continous covariates