week 11 - Chi-Squared test Flashcards

1
Q

What is the Chi-Square?

A
  • a test of association
  • needed when the variables of interest are categorical
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Correlations are used with scale variables, what variable is Chi-Square used with?

A

used with categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A correlation is a parametric statistic, what statistic is chi-square?

A

Chi-square is a non-parametric statistic

  • does not make assumptions about a distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How are Chi-square data represented?

A
  • contingency tables
  • shows how the data is distributed across the variables
  • the numbers in the cells are known as observed frequencies
  • they represent the frequency of people who fall in each combination of levels of the variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do we describe contingency tables?

A

_ x _ design

e.g. 2 variables, and 2 levels of each category

so we have a 2 x 2 design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What can we conclude if there is no association?

A

if there is no association, the observed values 0 the frequencies in each cell.

  • they do not differ from what we would expect to happen merely by chance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What can we conclude if there is an association?

A
  • if the observed values are significantly different from the expected values, we can conclude there is an association between the variables that differs from what we would expect by chance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do we know what the expected values are?

A
  • expected values are the values you would expect to see in each cell if no association existed between the 2 variables (null hypothesis is true)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the equation for the expected value?

A

(Row total x column total) / Grand total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is O-E?

A

observed value - Expected value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the Chi-square (X2) equation?

A

X2 = sum of ((O - E)2 / E)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we calculate Chi-square?

A

The chi-square statistic measures the degree of difference between the observed and expected frequencies

  1. square each O-E value
  2. divide each result by its own expected value
  3. sum all the results
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we interpret the Chi-square value?

A

Chi-square should ALWAYS be either zero or a positive number

  • if not zero, then an association exists
  • is the association statistically significant?
    • we figure this out with the p value
  • we also want to know how strong the association is
    • for this we need an effect size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do we calculate the degrees of freedom for a chi-square?

A
  • R = number of rows
  • C = number of columns
    • df = (R - 1) (C - 1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the two effect sizes for chi-square?

A
  • Phi-coefficient
  • Cramer’s V
  • both give n a value between 0 (complete independence) and 1 (perfect association)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what effect size do we use for 2x2 contingency tables?

A

Phi-coefficient

17
Q

what effect size do we report for contingency tables that are NOT 2x2?

A

Cramer’s V

18
Q

what are the values for small, medium and large effect size’s for Phi and Cramer’s V?

A
  • small: 0.10
  • medium: 0.30
  • large: 0.50
19
Q

how do we report the results of a chi-square test?

A

X2 (1, N = 180) = 30.13, p < .001, φ = .4

X2 (df, N) = chi square statistic, p value, phi (effect size)

20
Q

What assumptions must be met before running a chi-square test?

A
  1. both variables are categorical
  2. categories are mutually exclusive (participants cannot be in more than one category of each variable)
  3. number of cells in the frequency table should have an expected frequency lower than 1
  4. more than 80% of the cells in the contingency table should have an expected frequency of 5 or more

note: 2x2 tables only have 4 cells, so each cell is 25% of the total. that means that if you have a 2x2 table, even one cell with an expected frequency or more means the assumption is violated

21
Q

What if our 4th assumption is not met (the expected count is less than 5)?

A
  • if this happens and the table is 2x2, read the significance value from Fisher’s exact test line
  • if this happens and the table is larger
    • consider pooling categories
    • use exact p-value for the Pearson’s chi-square
    • use other analysis such as likelihood ratio