Chi Square- Contingency and Analysis of residuals Flashcards

1
Q

What are three things to know about the Chi Square test for contingency?

A
  1. It asks whether observed frequencies reflect the independence of two qualitative variables
  2. It compares the actual observed frequencies of some phenomenon (in our sample) with the frequencies we would expect if there were no relationship at all between the two variables in the larger population.
  3. Two variables are independent if knowledge of the value of one variable provides no information about the value of another.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the steps for the chi square test of contingency?

A
  1. State the research question
  2. State the statistical hypothesis (H1 and H0). E.g. there is an association or there is no association.
  3. Set the decision rule (Reject H0 if…)
  4. Calculate the degrees of freedom= (c-1)*(r-1)
  5. Calculate the test statistic using the chi square formula.
  6. Decide if the result is significant.
  7. Interpret the results in terms of expectations.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the formula for calculating the expected values in a multidimensional table?

A

(row total/overall total)*column total.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do we know about isolating the source of association?

A
  • A significant chi tells us that there is an association between the variables, however it does not tell us where the association comes from
  • We need to determine in which cell(s) the significance comes from.
  • To locate the source of significance we compute the residual.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a residual in a contingency table?

A

It is the deviation of the observed frequency from the expected frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the relationship between the size of the deviation and the size of the sample? And what does this mean for our analysis?

A

Cells with the largest expected values have the largest residuals, therefore the residual needs to standardized in order to control for sample/cell size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the standardised residual?

A

The standardised residual is a residual that controls for the size of the expected cell by dividing by the root of the expected frequency.
e=(O-E)/sqrt(E)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the standardised residual indicate?

A
  • The standardised residual indicates each cell’s relative contribution to the significance of the chi square value.
  • It also shows you how far the observed value is from the expected value in terms of standard deviations.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the size of a standard residual indicate?

A
  • Larger standardised residuals indicate a stronger contribution to the chi-squre statistic.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the sign (+ or -) of a standardised residual indicate?

A
  • Positive values indicate that the observed count is greater than expected.
  • Negative values indicate that the observed count is less than expected.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why can’t we determine significance from the standardised residual?

A
  • the standard normal distribution: mean=0, stdv=1
  • the standardised residuals have a stdv less than 1, so it underestimates the size of the variance.
  • therefore we look at the adjusted residual to determine the significance of the contribution of the observed values.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the adjusted residual?

A

d= e/sqrt(1-nrow/ntotal)*(1-ncol/ntotal)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How are adjusted residuals interpreted?

A
  • Adjusted residuals are interpreted in the same way as z-scores.
  • The critical value for z when a=0.05 is 1.96 or -1.96
  • Therefore, any adjusted residual larger than 1.96 or smaller than -1.96 is statistically significant.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you word your analysis of residuals?

A
  • The number of heart attacks were significantly less than expected in those who took aspirin.
  • The number of heart attacks were significantly more than expected in those who took placebo.
  • A significantly higher than expected number of people with no heart attack took aspirin.
  • A significantly lower than expected number of people with no heart attack took placebo.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Note on the interpreting adjusted residuals.

A
  • Interpret the strength and direction of association using the standardized residuals.
  • Significance should be interpreted in terms of what was expected.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly