7. F-Distribution - Assumptions of ANOVA Flashcards

1
Q

What does the Null hypothesis tell us?

A

the proposition that all treatment means are equal

i.e. the IV has no effect on the DV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what odes the alternative (research) hypothesis tell us?

A

the proposition that at least one treatment mean differs from another treatment mean

i.e. the IV does have an effect on the DV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a sampling error

A

when populations have different means even through the null hypothesis is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what should F equal when null hypothesis is true?

A

1

however this may not always be true due to sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does simulation study involve?

A

repeatedly drawing a same-sized set of number of random samples of the same n from a single population then calculating F for each test (this is called the Monte Carlo siulation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is the alpha level?

A

the probability level we adopt for concluding that it is unlikely that sampling error caused the observed difference in means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is the conventional alpha level

A

a=5% (.05)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what do we conclude if we find that F_observed is greater than our F_critical?

A

then we conclude that there is a significant difference between at least two of the treatment means this reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is the process of hypothesis testing based on?

A

probability not certainty.

and will therefore sometimes make errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how often will we incorrectly reject the null hypothesis?

A

a% of the time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is a type 1 error

A

when we incorrectly reject the null hypothesis when the null is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is a type II error rate

A

the probability of getting an F value less than the F critical when null is false.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how are type II errors signified?

A

β - beta

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what happens when we lower the type I error rate?

A

we suffer an increase in the type II error rate (and vice versa)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what are the three assumptions of Individual groups ANOVAs?

A
  1. independence of observations
  2. normality of distributions
  3. homogeneity of variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what happens when we break assumptions of ANOVAs?

A

will lead to either an inflated or deflated estimate of the true BG or WG variability. thus inflate or deflate the obtained F-value (F=BG/WG)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is does the independence assumption state?

A

that it is not possible to predict one score in the data from any other score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

how can this assumption be adequately met in a between groups experimental design?

A
  • random assignment of participants to groups (level of IV0
  • random selection of participants from population of interests
  • Each participant contributes to only 1 score in the analysis (may be the mean of many observations)
  • each participant’s score is independent - i.e. not influenced by any other participant’s score
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what does the normality assumption state

A

the normality assumption states that the samples are drawn from normally distributed populations and that the error component is normally distributed within each treatment group (level of IV)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is robust to breaches of the normality assumption?

A

ANOVAs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

what are the conditions for an ANOVA to be robust to normality assumptions?

A
  • there are similar number of participants in each condition
  • there are at least 10-12 participants in each condition
  • the departure from normality (skewness or kurtosis) is similar in each condition
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

how to we determine whether the assumption of normality is breached?

A

can either:

  • inspect frequency histograms for each experimental condition
  • skewness
  • kurtosis
  • use SPSS
23
Q

what do measurements of skewness indicate about normality of a distribution?

A

0 = normally distributed data (or any symmetrically distributed data)
positive/ negative values = distribution negatively or positively skewed

24
Q

what do measures of kurtosis indicate about normality of a distribution?

A

3 = normally distributed data (or any distribution that dont have more outliers than normal distributions)

25
Q

what is a less common method of fixing problems with normality?

A

transformation of the data

26
Q

what is an outlier

A

an extreme score at one or both ends of our distribution

27
Q

what can outliers do you our ANOVA>

A

can influence our results as our anova is based on a ration between and within group variance. Our variance measure is based on our SD from the mean, so an outlier can inflate our measure of variance, and it can also impact on our mean

28
Q

what are the solutions to problems of outliers

A
  • remove from the data
  • transform the data to remove the influence of other outliers
  • use a non parametric test
  • bootstrapping techniques
  • run analysis with and without outliers and see if they affect results. If not report this and report ANOVA as usual (use with caution)
29
Q

what is a way to transform the data to remove an outlier?

A

using the winsorized samples technique

30
Q

what does the winsorized samples tecnhique do?

A

samples replace extreme scores with the most extreme score left in the tail of the distribution

e.g.

original data:

3, 7, 12, 15 … 32, 33, 50, 75

Windsorised

3, 7, 12, 15 … 32, 33, 33, 33

31
Q

what is the homogeneity of variance assumption

A

As MS_within is a pooled error term, we need to ensure that variance within each of treatment conditions / groups is similar.

32
Q

what is the rule of thumb for variance sizes?

A

the largest variance should be no more than 4 times the smallest variance

33
Q

what are breaches of homogeneity of variance assumptions composed of?

A

very unequal cell sizes

34
Q

what can breaches of homogeneity of variance assumptions impact?

A

type 1 error rate

35
Q

what is the test for breaches of homogeneity of variance on SPSS?

A

Levene’s Test

36
Q

how can we deal with breaches of homogeneity of variance?

A
  1. If you have equal cell sizes and breach is minor (i.e. largest group variance less than 4 × smallest), you can run an ANOVA as it is robust to minor breaches of homogeneity
  2. Run the ANOVA but use a lower alpha level to control for the possible impact on the type I error rate
  3. Use an alternate statistical test which does not have the homogeneity assumption (e.g. nonparametric test)
  4. Transform the data to remove the heterogeneity and run the ANOVA on the transformed data
  5. New computer intensive methods such as bootstrapping (not covered in this unit).
37
Q

what does lowering the alpha level do?

A

reduces the type 1 error rate. thus the effect of the breach of homogeneity of variance can be reduced

38
Q

what are parametric tests?

A

tests like t and F which make assumptions about the distribution of scores

39
Q

what are nonparametric or distribution free tests?

A

tests that have less restrictive assumptions about the distributions used

40
Q

how do most nonparametric tests work?

A

by converting each score to a rank thus removing the assumption associated with inferential statistics

the ranks are spread out evenly so the shape of the distribution will always be rectangular (=no normality assumption and no problems with outliers)

there are specific rank-order tests for various hypothesis-testing situations

in rank order tests we are coparing mean ranks rather than mean scores to test the hypothesis. that samples are drawn from identical populations. Hence using medians rather than means is more appropriate when describing group differences

41
Q

what statistic does the Kruskal-Wallis test use?

A

chi square

42
Q

what does a significant chi square indicate?

A

significant difference between groups (because it is smaller than the significnace number)

43
Q

what does data transformation involve?

A

performing an identical mathematical operation on all the scores.

44
Q

what do the transformations do to the distribution of scores

A

it changes the shape of the distribution of scores

45
Q

what can a suitable transformation do>

A

reduce heterogeneity of variance

achieve normality

46
Q

what are some types of common transfomraitons?

A
  • Logarithmic (strong skews, outliers and breaches of homgenity)
  • Square-root (positive skew)
  • Reciprocal or reflect (negative skew )
  • Trimmed samples (outliers or heavy tailed kurtosis)
47
Q

what do logarithmic transformations do to the size of values?

A

Log transformation compress large values but have less effect on small values.

48
Q

what will logarithmic transformation do to the spread of scores

A

it will reduce the spread of scores this reducing skewness and decreasing the variability int he samples with large SDs

49
Q

what are the steps in doing a transformation?

A
  1. Identify problem (heterogeneity or skewness).
  2. Find the transformation which minimises this problem. Check the assumption again on the transformed data.
  3. Do not look for the transform which maximises F, but the one that minimises the assumption breach
  4. Perform ANOVA using these transformed scores as the DV.
  5. Transformed data is harder to interpret because it is no longer expressed in the original units of measurement.
  6. Only do a transform when absolutely necessary.
  7. You can run ANOVA on transformed and original data, and if you get the same result, report the original data as they are easier to interpret.
50
Q

what are the advantages of data transformations?

A

can use parametric techniques on transformed scores

51
Q

what are the disadvantages

A
  • May not successfully normalize scores.
  • Can distort meaning of data and result in loss of information.
  • Risk of Type I and Type II errors (and hence power) is unclear.
52
Q

what are the advantages of non parametric tests?

A
  • Can use regardless of the shape of the original distributions.
  • No parameter estimations so no assumptions.
53
Q

what are the disadvantages of non parametric tests>

A
  • Cannot be used for many complex situations.
  • Logic of ranks does not always work when there are many “tied” scores.
  • Can distort meaning of data and result in loss of information.
  • Risk of Type I and Type II errors (and hence power) is unclear.