Comparisons and contrasts Flashcards
stats
omnibus F statistic (beyond a one-way ANOVA)
If the IV has more than three levels, the omnibus (overall F statistic) is ambiguous
A significant main effect doesn’t tell WHERE the difference are (which groups differ)
Comparison
A statistical comparison of 2 chunks of data
e. g. young vs middle, middle vs old, young vs old
e. g. young and middle vs old, young vs middle and old, you and old vs middle
Type 1 error rate
probability of making a type 1 error (false positive- saying there is an effect when thee isn’t)
Type 2 error rate
false negative (saying there isn’t an effect when there is)
Per comparison (PC) error rate
Probability of making a type 1 error on ANY comparison
aPC=.05
Familywise (FW) error rate
The probability of at least one Type 1 error in a family of comparisons
1-(1-a)to the power of c
c= no. of comparisons
e.g. 5 comparisons aFW= 1- (1-.05) to the power of 5= .27
There is a 27% chance of making a Type 1 error (around 1 in 4 will be a an error)
Relationship between PC and FW
FW error goes up and a and c go up
PC=or < FW
Multiple Comparisons (MC) and corrections
To reduce the chance of a Type 1 error:
Reduce the number of comparisons (c)
Lower the significance threshold (alpha level)
FDR (false discovery rate)- used with large number of comparisons or large number of significant effects
A priori comparisons
Planned before looking at the data
Based on hypotheses/ theory (what are you interested in)
:) more liberal thresholds (more sensitive to identifying a genuine effect- more powerful)
In SPSS will generally have one less comparison than number of IV levels
Post hoc comparisons
Made after looking at the data
Exploratory analyses
:( more conservative alpha (less prone to Type 1 errors but harder to identify a genuine effect (Type 2 error)- less powerful
in SPSS: post hoc test will give ALL the possible comparisons
Myths busted about multiple comparisons
1) just because you used planned comparisons, it doesn’t mean you don’t need to protect against Type 1 error inflation
(BUT Type 1 error inflation will be less than in post hoc (because post hoc is all possible tests, planned is a subset) so a correction will reduce power less)
2) Omnibus F doesn’t need to be significant to do multiple comparisons
(logic behind MCs doesn’t require this- Wilcox (1987) says not much use for F test at all)