week 11 - Chi-Squared test Flashcards
What is the Chi-Square?
- a test of association
- needed when the variables of interest are categorical
Correlations are used with scale variables, what variable is Chi-Square used with?
used with categorical variables
A correlation is a parametric statistic, what statistic is chi-square?
Chi-square is a non-parametric statistic
- does not make assumptions about a distribution
How are Chi-square data represented?
- contingency tables
- shows how the data is distributed across the variables
- the numbers in the cells are known as observed frequencies
- they represent the frequency of people who fall in each combination of levels of the variable
How do we describe contingency tables?
_ x _ design
e.g. 2 variables, and 2 levels of each category
so we have a 2 x 2 design
What can we conclude if there is no association?
if there is no association, the observed values 0 the frequencies in each cell.
- they do not differ from what we would expect to happen merely by chance
What can we conclude if there is an association?
- if the observed values are significantly different from the expected values, we can conclude there is an association between the variables that differs from what we would expect by chance
How do we know what the expected values are?
- expected values are the values you would expect to see in each cell if no association existed between the 2 variables (null hypothesis is true)
What is the equation for the expected value?
(Row total x column total) / Grand total
What is O-E?
observed value - Expected value
What is the Chi-square (X2) equation?
X2 = sum of ((O - E)2 / E)
How do we calculate Chi-square?
The chi-square statistic measures the degree of difference between the observed and expected frequencies
- square each O-E value
- divide each result by its own expected value
- sum all the results
How do we interpret the Chi-square value?
Chi-square should ALWAYS be either zero or a positive number
- if not zero, then an association exists
-
is the association statistically significant?
- we figure this out with the p value
- we also want to know how strong the association is
- for this we need an effect size
How do we calculate the degrees of freedom for a chi-square?
- R = number of rows
- C = number of columns
- df = (R - 1) (C - 1)
What are the two effect sizes for chi-square?
- Phi-coefficient
- Cramer’s V
- both give n a value between 0 (complete independence) and 1 (perfect association)