Week 6 (Chi square) Flashcards
Explain degrees of freedom?
The Degrees of Freedom refers to the number of values involved in the calculations that have the freedom to vary
What are inferential statistics
Inferential statistics makes inferences about populations using data drawn from the population. Instead of using the entire population to gather the data, the statistician will collect a sample or samples from the millions of residents and make inferences about the entire population using the sample
What is a chi square goodness of fit?
The goodness of fit test is used to test if sample data fits a distribution from a certain population
it tells you if your sample data represents the data you would expect to find in the actual population.
Chi square goodness of fit and null hypothesis?
Default assumption is that the proportion is uniformly distributed across categories
What are the chi square goodness of fit data assumptions?
Data must be counts (categorical or ordinal)
Counts in each category must be independent (can’t contribute to more than one category)
Sample size must be large enough (have at least 5)
Chi square formula goodness of fit?
Similar to variance
Deviation scores are calculated for every cell and summed
X^2 = sum of all cells (observed count-expected count)^2/expected count
What chi square value indicated a perfect fit to the expected deviation and aligns with the null hypothesis?
0 means a perfect fit
Closer to 0, closer to null hypothesis
How do you calculate degrees of freedom for a chi-square goodness of fit?
Number of cells - 1
Explain cramers v in relation to chi-square goodness of fit?
Like a correlation coefficient
Possible range 0,1
Measure of association between variables
Square root of X^2/N(k-1)
N= total sample size
K-1 = degrees of freedom
Explain cramers V values and strength of effect?
>.5 = large .3-.5 = medium .1-.3 = small 0-.1 = trivial
What does a chi-square test of independence measure?
Statistical independence or association between two or more categorical variables
How do you calculate expected frequencies for a chi-square test of independence?
(Row total x column total)/N total
How do you calculate degrees of freedom for a chi-square test of independence?
df = (# rows - 1) x (# colums -1)
What causes a chi-square test of independence to become robust?
If 20% has small (less than five) expected frequencies
Solution is to collect more data!
What type of data are chi-square tests used to analyse?
Categorical