Chi-square Flashcards

1
Q

What is chi-square?

A

Test of association - variables of interest are categorical
Non-parametric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are contingency tables?

A

Shows how data are distributed across variables
- observed frequencies are the numbers in the cells - represent the frequency of people who fall in each combination of levels of the variable.
Column and row totals should add up to N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What kind of contingency table has 2 variables and 2 levels?

A

2x2 design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can the association between variables be eyeballed?

A

Descriptive statistics
- do we observe roughly the frequencies we would expect if there were no association
OR
Are our observed frequencies different from expected frequencies?
Need to look at percentage, rather than N of ppts in each category (what % of each category would we expect to fall into each category if there is no association between each variable?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are expected values?

A

Values you expect to see in each cell if no association exists between the 2 variables (null hypothesis = true)
Want to know these for each individual cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the equation for expected values?

A

(row total x column total) / grand total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the equation for chi-square?

A

X2= ∑ (O-E)squared/E
Square each O-E value in contingency table
Divide each result by expected value
Add up all results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What value should Chi-square always be?

A

0+ - if not 0, an association exists
- Check for stat sig

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you check statistical significance?

A

p-value
need to know df
df = (R-1)(C-1)
R = number of rows
C - number of columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is an effect size?

A

Strength and association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the two effect size measures for chi-square?

A

Phi-coefficient
Cramer’s V
- both give values between 0-1
and take sample size into account

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When should you report phi?

A

2x2 contingency tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When should you report Cramer’s V?

A

For anything other than 2x2 contingency tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How should you interpret effect size? (Phi and Cramer’s V)

A
  • check value against standardised cutoffs to judge magnitude of effect
  • value should be equal to or greater than that given on table to fall into that category
  • Values answer the question: what is the association between the 2 variables as a percentage of their max possible variation?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How should you report chi-square?

A

eg. χ2 (1, N = 180) = 30.13, p < .001, φ = .4

df = degrees of freedom

χ2= chi-square statistic

p= significance value

φ = Phi OR V = Cramer’s V (effect size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What assumptions should be met before running chi-square?

A
  1. Both variables are categorical
  2. Categories are mutually exclusive
  3. No cells in contingency table should have expected frequency of lower than 1
  4. More than 80% of cells should have an expected frequency of 5+
17
Q

What does it mean for categories to be mutually exclusive?

A

Ppts cant be in more than 1 category of each variable

18
Q

What if the expected count is less than 5? (4th assumption is not met)

A
  • If table is 2x2, read significance from Fisher’s exact test (corrects the fact that there is an expected count of less than 5 in one of the cells) - calculates exact probability.
  • If not met for larger tables, consider pooling categories, use exact p-value for persons chi-square, or use other analyses such as likelihood ratio