7. F-Distribution - Assumptions of ANOVA Flashcards

Question

what is a less common method of fixing problems with normality?

Answer 1

transformation of the data

Answer 2

an extreme score at one or both ends of our distribution

Answer 3

can influence our results as our anova is based on a ration between and within group variance. Our variance measure is based on our SD from the mean, so an outlier can inflate our measure of variance, and it can also impact on our mean

Answer 4

- remove from the data - transform the data to remove the influence of other outliers - use a non parametric test - bootstrapping techniques - run analysis with and without outliers and see if they affect results. If not report this and report ANOVA as usual (use with caution)

Answer 5

using the winsorized samples technique

Answer 6

samples replace extreme scores with the most extreme score left in the tail of the distribution e.g. original data: 3, 7, 12, 15 ... 32, 33, 50, 75 Windsorised 3, 7, 12, 15 ... 32, 33, 33, 33

Answer 7

As MS_within is a pooled error term, we need to ensure that variance within each of treatment conditions / groups is similar.

Answer 8

the largest variance should be no more than 4 times the smallest variance

Answer 9

very unequal cell sizes

Answer 10

type 1 error rate

Answer 11

Levene's Test

Answer 12

1. If you have equal cell sizes and breach is minor (i.e. largest group variance less than 4 × smallest), you can run an ANOVA as it is robust to minor breaches of homogeneity 2. Run the ANOVA but use a lower alpha level to control for the possible impact on the type I error rate 3. Use an alternate statistical test which does not have the homogeneity assumption (e.g. nonparametric test) 4. Transform the data to remove the heterogeneity and run the ANOVA on the transformed data 5. New computer intensive methods such as bootstrapping (not covered in this unit).

Answer 13

reduces the type 1 error rate. thus the effect of the breach of homogeneity of variance can be reduced

Answer 14

tests like t and F which make assumptions about the distribution of scores

Answer 15

tests that have less restrictive assumptions about the distributions used

Answer 16

by converting each score to a rank thus removing the assumption associated with inferential statistics the ranks are spread out evenly so the shape of the distribution will always be rectangular (=no normality assumption and no problems with outliers) there are specific rank-order tests for various hypothesis-testing situations in rank order tests we are coparing mean ranks rather than mean scores to test the hypothesis. that samples are drawn from identical populations. Hence using medians rather than means is more appropriate when describing group differences

Answer 17

chi square

Answer 18

significant difference between groups (because it is smaller than the significnace number)

Answer 19

performing an identical mathematical operation on all the scores.

Answer 20

it changes the shape of the distribution of scores

Answer 21

reduce heterogeneity of variance | achieve normality

Answer 22

- Logarithmic (strong skews, outliers and breaches of homgenity) - Square-root (positive skew) - Reciprocal or reflect (negative skew ) - Trimmed samples (outliers or heavy tailed kurtosis)

Answer 23

Log transformation compress large values but have less effect on small values.

Answer 24

it will reduce the spread of scores this reducing skewness and decreasing the variability int he samples with large SDs

Answer 25

1. Identify problem (heterogeneity or skewness). 2. Find the transformation which minimises this problem. Check the assumption again on the transformed data. 3. Do not look for the transform which maximises F, but the one that minimises the assumption breach 4. Perform ANOVA using these transformed scores as the DV. 5. Transformed data is harder to interpret because it is no longer expressed in the original units of measurement. 6. Only do a transform when absolutely necessary. 7. You can run ANOVA on transformed and original data, and if you get the same result, report the original data as they are easier to interpret.

Answer 26

can use parametric techniques on transformed scores

Answer 27

* May not successfully normalize scores. * Can distort meaning of data and result in loss of information. * Risk of Type I and Type II errors (and hence power) is unclear.

Answer 28

* Can use regardless of the shape of the original distributions. * No parameter estimations so no assumptions.

Answer 29

* Cannot be used for many complex situations. * Logic of ranks does not always work when there are many "tied" scores. * Can distort meaning of data and result in loss of information. * Risk of Type I and Type II errors (and hence power) is unclear.

7. F-Distribution - Assumptions of ANOVA Flashcards

(53 cards)