Week 7: Logistic Regression and Odds Ratio Flashcards
What type of data is logistic regression used for?
Dichotomous/binary outcomes
How is “odds” defined in logistic regression?
Odds = number with the event / number without the event
How is the odds ratio (OR) interpreted?
- OR < 1: Outcome is less likely to occur
- OR > 1: Outcome is more likely to occur
- OR = 1: No difference in likelihood
What does it mean if the 95% CI for OR includes 1?
The result is not statistically significant at p < 0.05
How are logistic regression models fitted in statistical software?
Using the natural log of the odds (ln(odds)) as the outcome variable. Ln(odds) can take any positive or negative value. In Stata, ln(odds) is referred to as coefficient.
Why use logistic regression instead of chi-square tests?
Logistic regression allows the inclusion of multiple explanatory variables and continuous variables
What is the rule of thumb for the number of explanatory variables in logistic regression?
One explanatory variable for every 10 participants in the smallest outcome group
What assumptions must be met?
- Data must be independent
- Explanatory variables should not be highly correlated
How do you calculate a 95% CI for ln(odds)?
Lower limit = coefficient - (SE1.96), upper limit = coefficient + (SE1.96)
What is the significance of the Wald test?
Tests the H0 that there is no relationship between variables by examining if true parameter ln(OR) = 0
If z is greater than +/- 1.96 then it is statistically significant
How does logistic regression handle categorical explanatory variables with more than two categories?
It assigns one category as the reference and calculates ORs for the others relative to it
How can logistic regression coefficients be interpreted for continuous variables?
They represent the change in ln(odds) for a one-unit increase in the variable
What is the formula to convert coefficients and ORs to ORs for larger units (e.g., 10-unit changes)?
If using coefficients, multiply by 10 then exponentiate i.e., exp(coefficient*10)
If using ORs, raise to the power of 10
What is ln?
Natural logarithm (sometimes written log or log_e, although log has a different mathematical meaning). Ln(odds) are not usually reported in papers and reports (ORs are presented instead)
What is exp?
Exponentiate - inverse function to natural logarithm