ANOVA Flashcards
In a one-way ANOVA, what is the general premise of the null?
That there is no difference between the groups.
All population means are the same.
What are the characteristics of a one-way ANOVA?
One outcome varaible (numeric), and one grouping varaible (categorical)
The grouping variable contains 2+ subgroups.
What four steps are needed to construct a test?
- Diagnostic test statistic, t
- Sampling distribution of t if null is true
- The observed t in data
- A rule that maps every value of t onto a decision
What does having having significantly difference between among groups depend upon?
Varaibility. SSb and SSw.
What are the two kinds of varaiblity an ANOVA is concerned with?
Between groups (how different are the group means from the grand [total] mean - SSb)
Within group (how much to values differ from the mean in each group - SSw)
Both varaibilities depend on Sum of Squares (SS).
what is the equation for SSb? (one way)
weighted_by_n_of each group
(group#mean - grandMean)^2 and summed over all groups
What is SSw calculating, and what is the equation?
Calculating the residuels, or how far away each data point is from the group mean.
(dataPoint - group mean)^2 summed over all items in group, and across all groups.
Why don’t we just add SSb and SSw to get a our variance number?
Because we are only interested low SSw and high SSb. Therefore a fraction is between
degrees of freedom needs to be corrected for.
How do you calculate variability using SSw and SSb?
You cannot cacluate using these exact figures, both need correct for df:
MSw = SSw / (n - G),
MSb = SSb / (G -1)
Varability, F = MSb / MSw
High F = means are different
F-distribution.
Made out of the F-values if the null was true. Positively skewed.
What are the key features of a one-way stat block.
F(df-SSb, df-SSw) = f-stat, p-value
Why don’t we run combinations of t-tests to see which group is sig’ different with ANOVA ?
Possible combinations grow exponentially as group number increases. Also increaes Type 1 error rate. Each t-test has 5% T1 error rate. More tests grows this significantly.
We want to keep Family-wise T1 erorr at 5%
How do post-hoc tests keep family wise T1 error at 5%?
They adjust the p-values for the combinations t-test being run on ANOVA groups.
What are residuels also know as?
Within group variation (SSw)
What are residuels also know as?
Within group variation (SSw)