final exam Flashcards
what do we use when we want to test if two variables are different from each other
two-way chi-square
formula for chi-square
TOTAL OF (Observed - Expected)2 / Expected
when you run the formula for X2 (chi-square) and you get the final number, what is that called
test statistic
degrees of freedom for two-way chi-squared
(number of rows minus 1) * (number of columns – 1)
what do you plug into R-Studio to check p-value for two way chi-square
pchisq(X2 , df = x , lower.tail = F)
then check if the p-value is greater or less than 0.05. If it’s lower, you reject the null hypothesis.
What does “power” mean?
it’s the likelihood of finding a particular difference as statistically significant
what variable are you trying to find in power analysis?
what is n (sample size) you need to reject the null hypothesis and the findings to be significant
Why do you want to aim for ~ 80% for power?
you want a 80% probability of rejecting the null hypothesis with a 95% confidence
when we want an area to be 80%, the z value that corresponds with that is
qnorm(0.8) = 0.84
formula for SE
σ/n(squarerooted)
what are we trying to minimize in power analysis?
Type II error (β): This error occurs when the test fails to reject the null hypothesis when it is actually false. To minimize this, we aim to decrease the overlap between the two distributions (the H0 and Ha curves)
what are we trying to maximize through power analysis?
Distance between the null hypothesis and alternative hypothesis distributions
Maximizing the area of the alternative distribution that falls in the rejection region of the test (where we reject the null hypothesis) increases the chances of detecting a true effect.
what does ANOVA stand for?
ANalysis Of VAriance
What is ANOVA used for?
ANOVA is used to determine if a numeric variable differs across different groups
What are the three assumptions for ANOVA?
The observations are independent within and across groups
The data within each group are nearly normal
The variability across the groups is about equal
Hypothesis testing for two-way chi-squared and ANOVA – what null are they testing?
chi-square Ho = there is no meaningful difference between observation and expected value (from random chance)
ANOVA Ho = all group means are equal
𝜇_x=𝜇_y=𝜇_c
when we reject the null hypothesis in ANOVA it doesn’t mean every mean is different from one another it just means
that there is at least one difference.
Mean Squared Between Groups (MSG) in ANOVA
conceptually represents the amount of variation between groups (how much group means deviate from the overall mean).
If it’s high, there’s more of a difference. If it’s low, the group means are similar to the overall mean - the groups aren’t different.