Lecture 6 REVISED Flashcards
hypothesis = ?
informed speculation to be tested
e.g., possible relationship between 2 or more variables
association = ?
patterned/systematic variation
tools for discrete variables in demonstrating an association?
descriptive statistics: contingency tables
inferential statistics: chi-squared test
alpha = ?
threshold for statistical significance
usually 5%
what are the necessary steps to establishing causality?
- rationale linking cause and effect
- demonstrate cause came before outcome
- show association between cause and outcome
- remove any other factors that could be related to the outcome
marginal distribution = ?
focus on one variable at a time
conditional distribution = ?
focusing on a variable and how it’s affected by another variable
how do you calculate degrees of freedom?
n-1
what is chi-squared (x squared) about?
comparing two differences
differences we observe in our sample vs differences we expect if null hypothesis were true
how is chi-squared calculated?
(observed difference - expected difference) / expected difference
p value
helps us weigh the inference value of descriptive evidence
between 0 and 1, tells us how probable the null hypothesis is
high p value means…
low p value means…
high p value = sample data are compatible with true null hypothesis
low p value = the sample data aren’t compatible with true null hypothesis
what do you do with your p value?
compare it to alpha/significance level to decide whether to reject or fail to reject null hypothesis
does chi squared test work with all samples?
no, doesn’t work well with very small samples