Chi-Square Tests Flashcards
What does a chi-square goodness of fit test analyse?
Tests whether the observed frequencies of a single categorical variable correspond to the frequencies expected if the null hypothesis is true
What is the statistic of chi-square?
Chi-square = the sum of (O - E) squared / E (where O = observed frequencies & E = expected frequencies)
If the null is true, what value should the obtained chi-square equal?
Close to zero
Explain the process of a chi-square goodness of fit test
State null & alternative hypotheses; calculate expected frequencies; calculate chi-square; compare with critical chi-square; reach a decision; interpret result
What is the alpha value of chi-square?;
What does the df depend on?;
What else do we include when reporting results?
.05 (two-tailed test);
Number of cells, not number of participants (df = k-1);
Sample size
Can chi-square values be negative?;
What happens if we increase the sample size?;
When will chi-square be small?
No, only positive;
We can increase obtained chi-square, but critical chi-square remains the same;
If differences between O & E are small
How do we determine expected frequencies if the null is true?
They can be uniformly distributed (equiprobable distribution); distributed in accord with theory; distributed in accord with previous observed frequencies; normally distributed
When do we use a chi-square test of independence (or contingency chi-square)?
To determine whether two categorical variables are related or independent (tests whether classification on row variable is independent of classification on column variable)
When are two variables considered independent?;
What does this mean?
When the frequency distribution for one variable has the same shape for all levels of the second variable;
Chi-square is not significant
State the conceptual hypotheses
Null: there is no relationship between the two variables (independent of each other); Alternative: there is a relationship between the two variables (frequency distribution on one is contingent on levels of the other)
What is the formula for determining the expected frequencies for each cell?
row total x column total / N
What degrees of freedom do we use for a chi-square test of independence?;
Otherwise, the process is the same as….
(r - 1) (c - 1); where r = row & c = column;
Chi-square goodness of fit
How does the sign test relate to the Wilcoxon’s MP ranked-sign test?;
What is the formula for expected frequencies?
It’s also used for repeated measures & deals with signs of the difference scores, ignoring any zeros; but values are are also ignored (signs only) & observed frequencies are compared to expected frequencies (using chi-square);
E = N / 2 (null expects equal numbers of positive & negative)
What are the assumptions of chi-square?
Independence of observations (subjects must fall into only 1 category so can’t use on repeated measures designs); all expected frequencies should be at least 5 (the greater the df, the more lenient this is); inclusion of non-occurrences (calculations must be based on all subjects in sample)
How is the chi-square test like the z-test, t-test & ANOVA?
Chi-square goodness of fit examines one group like the z or t-test; Chi-square test of independence examines two independent groups like independent groups t-test & also assess 3 or more independent groups like a one-way independent groups ANOVA