Lecture 8 - ANOVA Flashcards

1
Q

What is does ANOVA test?

A

The difference between two or more population means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What kind of test is the ANOVA? parametric or non-parametric

A

Parametric test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 4 assumptions of an ANOVA?

A

1) Random sampling
2) Homoscedasticity (=equal variances)
3) Independent measurements or observations
4) Normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a variable?

A

a variable is what is measured by experimentalist = response or dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Factor?

A

The effect under investigation = independent variable

ie. salinity, temperature, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are Factor Levels?

A

different treatment levels in an experiment it is something that the experimenter varies
ie. PCB or temp at various levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 2 types (and sub-types) of ANOVA’s?

A

1) Univariate - one variable (response) measured
sub-types: one way (one factor) or multi-way (two or three factors)
2) Multivariate - more than one variable measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are two main sources of variation?

A

1) between sample or population means = factor

2) Within samples or populations = error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the variance in ANOVA?

A

is the difference between the population means high or low

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What kind of output do we want with sources of variation in regards to ANOVA?

A

We want to see a high factor variance between factors and a low error within the samples or populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When are samples unlikely to come from the same population?

A

If the variation between the sample means is large relative to the variation within the samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does accuracy mean?

A

Accuracy means we know the true value. However this is not often the case

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

If the means are almost the same, what happens to the residuals?

A

The residual becomes zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the ability to detect change in the response?

A

sensitivity and is related to the number of levels

high sensitivity is detecting change

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What happens when sample (level) means are close together?

A

high internal variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is precision related to?

A

related to the error, and repeatability of the experiment.

17
Q

What happens when precision is close?

A

high repeatability

18
Q

What happens when precision is far?

A

low repeatability

19
Q

What happens with the null hypothesis when there is high within variance and low between?

A

Get closer to accepting the null that the populations come from the same population (there is no real difference between the populations

20
Q

What happens to the null hypothesis when there is high within variance and higher between variance?

A

End up mid-way between accepting and rejecting (not significantly accepting or rejecting)

21
Q

What happens to the null hypothesis when there is low within variance but higher between variance?

A

Reject the null and the two means/populations are in fact different from each other. No overlap with the between variance means significantly different

22
Q

Why was the Crakenback river Univariate ANOVA criticized?

A

The sites are not independent of each other because they flow into each other and the same statistics that were designed for manipulation were used for a monitoring study

23
Q

What was the result of the Crackenback river sites (factors) not being independent?

A

Huge bias and pseudoreplication

24
Q

What is pseudoreplication?

A

Treating data that is dependent as independent

25
Q

What should we do with faulty ANOVA stats (improper study such as dependent variables)?

A

Use descriptive non-parametric statistics

26
Q

What is a Type I error? How does alpha/critical value relate to this?

A

Type I error is the possibility of rejecting the true null hypothesis.
Critical value relates to this by the fact that you could be rejecting the null the same percent of the time as the critical value stands for
ie if alpha =0.05 then 5% of the time you could be rejecting the true null

27
Q

When do you reject the null hypothesis?

A

When the calculated p-value is less than the critical
or
when the calculated f-value is greater than the critical f (from a table)

28
Q

When do you significantly reject the null hypothesis?

A

When the p-value is very close to zero or the f-calc is far greater than the critical

29
Q

What are the 3 test assumptions for ANOVA regarding a residual diagnostics plot?

A

1) homogeneity of variance (constant variability: Scale location plot and levenes test
2) Independence: Residual vs.Fitted plot and Durbin-Watson test
3) Normality: Normal Q-Q plot and Shapiro-wilk normality test

30
Q

When do we see compliance with homogeneity?

A

When the residuals appear to fluctuate uniformly about zero
ie. uniform dispersion of square root residuals vs. fit
No detectable pattern

31
Q

When do we not see compliance with homogeneity?

A

When there is heterogeneity indicated by a tendency for the residuals to fan or funnel
Fan can face either direction

32
Q

Where is the value of the calculated f when it is greater than the critical f?

A

It is in the rejection region under the area of the curve (calculus)
this means different populations

33
Q

In an ANOVA table what are the residuals?

A

The replicates subtracted by 2?

n # of replicates-2 is the degrees of freedom of the residuals

34
Q

If the data doesn’t pass the parametric assumption tests, what can we do to continue with testing?

A

Log transform the data and test again. If the log data passes, you can continue with parametric statistics

35
Q

What is the TukeyHSD?

A

Honestly Significant Difference and is a pairwise comparison of all possible combinations of factors to determine which factors (ex. Sites) are no different or are different from each other

36
Q

What is the equation for determining how many comparisons are made in a pairwise comparison?

A

k x (k-1)/ 2

37
Q

How do you interpret a TukeyHSD plot?

A

If there is 0 difference between the means of the pairs then the confidence interval will encapsulate 0 and they are similar or the same
If the confidence interval does not touch 0 then the means are different. The further from 0 it lies means the closer the p-value is to 0 and rejecting the null and are significantly different.

38
Q

How do you interpret a TukeyHSD with respect to which sides an interval falls upon?

A

If the interval falls to the left then there is a positive difference
If it falls to the left then there is a negative difference