Lecture 41- ANOVA Flashcards

1
Q

When is an ANOVA used?

A

Comparing means on continuous responses between multiple groups (more than 3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is ANOVA doing in terms of the signal to noise ratio?

A
Signal= difference between treatment means (each group)
Noise= The difference within each of the treatment categories (within the group)

ANOVA quantifies both of these levels of variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is pairwise comparisons (t-tests) for multiple groups unfavorable?

A
  • It’s extra work (need to do for every pair combination)
  • It can lead to lots of false positives. With every test there is a chance of incorrectly rejecting the null. Therefore, risk increases the more tests we do.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How is the notation set up for ANOVA?

A

Y(ij) means the jth response in the ith group

note: ij is in lower case

The number of different groups is denoted K, and the number of
responses in the ith group is denoted ni

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the model for ANOVA?

A

Y(ij)= ui +eij

µi is the true mean response for the ith group at the population level.
eij is the error term for the jth response in the ith group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is ‘special’ about the error term in the ANOVA model?

A

The error terms are assumed to be independent, and to follow a
N(0, σ2) with constant variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is RSS?

A

A measure of the variation in the data that is not explained by
differences between groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is TSS?

A

TSS is a measure of the total amount of variation in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is GSS?

A

can be interpreted as a measure of the variation that
is explained by differences between groups.

This is the same as ESS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the null hypothesis for ANOVA? What is the alternative hypothesis?

A

Null means there is no difference between the groups
H0 : µ1 = µ2 = · · · = µK

Alternative means there is a difference between the groups (note: not all the means have to be different it just implies the null is not true)
HA : µ1, µ2, . . . , µK not all equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do we expect of the GSS when group means are very different as opposed to when group means are very similar?

A

From the previous discussion, we expect GSS to be relatively large
when the group means are very different.

We expect GSS to be relatively small when then group means are
similar.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Graphically what would a significant difference in group means look like?

A

GSS (green) would be large compared to RSS (red)

If there was not a significant difference the lines would be equal in length

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you calculate the F statistic for the hypothesis test invovled in an ANOVA?

A

Refer to slide 798 but basically…

the group mean square/ residual mean square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How are the various sum of squares and related quantities frequently displayed?

A

An ANOVA table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Using an ANOVA table what gives you the F statistic required for a hypothesis test?

A

GMS/ RMS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What would result in the failure of the null hypothesis?

A

⇒ Large differences between group means
⇒ Relatively large value of GSS (top of equation for F stat large)
⇒ Large value of F

17
Q

If the null hypothesis is true what does the F-statistic follow? How many degrees of freedom is associated with this distribution?

A
  • F distribution.
  • The F distribution is specified in terms of two degrees of freedom which are the numerator and denominator of the F stat. They are (K − 1) and (n − K)
18
Q

Where do P values come from?

A

Computing the F stat, same rules as always (less then 0.05= significant)