Tests Of Nominal Data Flashcards
Examples
Left handedness and creativity.
Gender and criminality.
Gender and course acceptance.
Contingency and associations
Contingency- a relationship between two categories.
A is contingent upon b: a depends on b/ b is necessary for a.
Contingent can mean- a is more likely after b; there is some relationship.
Contingency
Involves two (or more) nominal variables. Works out the percentage of people in one category are associated with the other category. Then construct a contingency table.
Contingency tables
Enter four counts into 2 X 2 table.
Calculate row totals, column totals and overall total got the table as shown below.
Hypothesis testing
H: is an association between the two variables.
We test it as- does the contingency table differ from what we would expect if there was no relationship? Does it differ significantly from what we would expect to see if the null hypotheses were true?
Null hypothesis
We must calculate a contingency for when the null hypothesis is true- we calculate an expected value for each cell of the contingency table.
Expected=(Row total X column total)/overall total.
Work out what numbers would look like in a world there was no association between variable.
Chi square
Chi square (X^2) test of associations. Degrees of freedom= (Nrows-1) X (Ncolumns-1). Look up chi square value and compare to critical value.
Types of statistic
Pearson- standardised test.
Likelihood ratio.
Fisher’s exact test- used when the sample size is too low.
Assumptions of chi square
You must have at least 5 counts in each cell.
If not, use fisher’s exact test.
Goodness-of-fit
When to use-
Looking for how well a set of numbers fit a pattern.
When looking for an odd value from a single population.
When you have nominal data.
Logic is that we compare data obtained with an expected value.
Hypotheses: Ho- data are consistent with a specified distribution.
Ha- data are not consistent with a specified distribution.
Assumption of the goodness-of-fit test:
Sampling method is simple random sampling.
Variable understudy is categorical.
Expected value of the number of sample observations in each level of the variable is at least 5.