ANOVA Flashcards

Question 1

Q

what does ANOVA stand for?

Answer

A

analysis of variance

Question 2

Q

what is the purpose of ANOVA?

Answer

A

to compare the means of several groups using the variability within and between groups

Question 3

Q

what are the hypotheses in a one-way ANOVA?

Answer

A

H0 : µ1 = µ2 = … = µg
Ha : at least two population means are unequal

Question 4

Q

what are the assumptions for ANOVA?

Answer

A

normal population distributions
equal standard deviations across groups
randomisation in sampling or assignment

Question 5

Q

how is variability partitioned in ANOVA?

Answer

A

between-groups variability: differences between group means
within-groups variability: differences within each group around its mean

Question 6

Q

what is the test statistic for ANOVA?

Answer

A

= between-groups sigma/within-groups sigma

Question 7

Q

how do you calculate the degrees of freedom in ANOVA?

Answer

A

between-groups: df1 = g - 1 (g = number of groups)
within-groups: df2 = N - g (N = total sample size)

Question 8

Q

what does the F distribution signify in ANOVA?

Answer

A

Mean ~ 1 when H0 is true
larger F value indicates stronger evidence against H0

Question 9

Q

how are the mean squares calculated in ANOVA?

Answer

A

mean square between (MSB): sum of squares between (SSB)/df1

mean square within (MSW): sum of squares within (SSW)/df2

Question 10

Q

how is total variability partitioned in ANOVA?

Answer

A

total SS = between-groups SS + within-groups SS

Question 11

Q

when is the F test robust to violations of assumptions?

Answer

A

if sample sizes are equal or approximately equal
when population distributions are approximately normal or have similar standard deviations

Question 12

Q

what should you check for extreme violations of assumptions?

Answer

A

box plots or dot plots for skewness or large differences in standard deviations

Question 13

Q

what does the residual standard deviation s represent in ANOVA?

Answer

A

it is the square root of the within-groups variance estimate or mean square error

Question 14

Q

how do you calculate the degrees of freedom for the error in ANOVA?

Answer

A

df2 = N - g

Question 15

Q

what does it mean if a confidence interval comparing two means does not include 0?

Answer

A

it indicates a significant difference between the population means

Question 16

Q

what should you do if the largest standard deviation is more than twice the smallest?

Answer

A

use a confidence interval formula with separate variances instead of pooled variance

Question 17

Q

how many pairwise comparisons are there for g groups in ANOVA?

Answer

A

g(g - 1)/2 = x comparisons

Question 18

Q

what is the main limitation of constructing multiple confidence intervals for mean differences?

Answer

A

the overall confidence level decreases as the number of comparisons increases

Question 19

Q

what is the boneferroni method?

Answer

A

a method that adjusts error probability for each comparison to ensure a high overall confidence level

Question 20

Q

how does the turkey method improve on the boneferroni method?

Answer

A

it provides narrower confidence intervals and maintains the desired overall confidence level

Question 21

Q

when is ANOVA robust to violations of normality?

Answer

A

when sample sizes are large, the normality assumption is less critical