week 11 statistics Flashcards
Test of association
Test of association are need when the variables of interest are categorical
association is relationship
Correlations
used with scale variables
A correlation is a parametric statistic
comes with assumptions about distribution
Chi-square
used categorical variables. Chi-square is non parametric . Does not make assumptions about a distribution
Representing the data: contingency table
shows how the data are distributed across the variable
The number in the cells are known as observed frequencies
Descriptive statistics are an important place to start
we have to look at descriptive statistics: we observe the frequencies looking at the descriptive statistics
How Chi-square works
if there is no association then observed the values and frequency do not differ from what we would expect to happen merely by chance
But if the observed values are significantly different from the expected values then there is an association
How do we know what the expected values are
Expected values: are the values you would expect to see in each cell if no association existed between the two variables
Row total x column total/ grand total
Chi-square
The chi-square statistic measures the degree of difference between the observed and expected frequencies
1) square each O-E value
2) Divide each result by its expected values
3) add up all the results
Interpreting Chi-squared
should always be either zero or a positive number. If it is not zero, then an association exists
But is it a statistically significant association
We also want to know how strong association it is
Chi-square p-value
To get P-value we need to know the degrees of freedom
To calculate the degree of freedom for chi-square
Critical value table
shows you critical value and tell you the value your statistic must be greater than to be significant
Two effect size. Phi-coefficient and Cramer’s V
Both give a value between 0 and 1. Further from 0 the stronger the association between the two variable.
Takes both sample size into account
Phi and Cohen’s V
the value should be equal too or greater than that given on the table to fall into that category.
As a percentage of their maximum possible variation