week 1 SCM (chi-squared) Flashcards
why do we analyse frequencies when using at categorical variables?
- the numerical values you attach to different categories are arbitary
- this means that the mean of a categorical variable is meaningless
- because of this, we analyse frequencies of each category
what are the rows and columns of a contingency table?
- the columns are the conditions (i.v)
- the rows are the categories of the measure (d.v)
what is the general idea of the chi-squared test
it compares the frequencies you observe in certain categories to the frequencies that you might expect to get in those categories by chance
what is the chi-squared equation?
what is the equation for expected values, used in the chi squared eqution?
what is the degrees of freedom formula for chi squared tests?
(row total-1) *(column total-1)
What is the degrees of freedom for a contingency table with two columns?
1
this is because (r-1) * (c-1)
so
(2-1) * (2-1) = 1
what theory is the likelyhood ratio statistic based on?
The maximum likelyhood theory
this means that the probability for obtaining the observed set of data is maximised
this model is then compared to the probability of obtaining those data under the null hypothesis
therefore the resulting statistic is comparing the observed frequencies with those predicted by the maximised model
when would we use a likelyhood ratio statistic over a chi squared?
When the samples are small
what happens to the chi squared distribution as the degrees of freedom increases?
the peak of the curve moves to the right and the distribution spreads out
what does greater degrees of freedom mean in terms of how high the chi squared value has to be
the more degrees of freedom the higher the chi squared value has to be to be statistically significant
what is a problem with the chi squared test?
- the sampling distribution of the test statistic has an approximate chi squared distribution
- the larger the sample the better the approximation becomes
- however, in small samples the approximation is not good enough making the statistical significance test of the chi squared innacurate
what sample size is required for a chi squared?
- the expected frequencies in each cell must be greater than 5 for the chi squared significance test to be accurate
- what is the degrees of freedom of the likelyhood ratio?
the same as chi squared (rows-1)(columns-1)
what is a type 1 error and a type 2 error?
TYPE 1 ERROR= rejecting the null hypothesis when its actually true
TYPE 2 ERROR= failing to reject the null hypothesis when its actually false