Week 7: Logistic Regression and Odds Ratio Flashcards

1
Q

What type of data is logistic regression used for?

A

Dichotomous/binary outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is “odds” defined in logistic regression?

A

Odds = number with the event / number without the event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How is the odds ratio (OR) interpreted?

A
  • OR < 1: Outcome is less likely to occur
  • OR > 1: Outcome is more likely to occur
  • OR = 1: No difference in likelihood
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does it mean if the 95% CI for OR includes 1?

A

The result is not statistically significant at p < 0.05

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How are logistic regression models fitted in statistical software?

A

Using the natural log of the odds (ln(odds)) as the outcome variable. Ln(odds) can take any positive or negative value. In Stata, ln(odds) is referred to as coefficient.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why use logistic regression instead of chi-square tests?

A

Logistic regression allows the inclusion of multiple explanatory variables and continuous variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the rule of thumb for the number of explanatory variables in logistic regression?

A

One explanatory variable for every 10 participants in the smallest outcome group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What assumptions must be met?

A
  • Data must be independent
  • Explanatory variables should not be highly correlated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you calculate a 95% CI for ln(odds)?

A

Lower limit = coefficient - (SE1.96), upper limit = coefficient + (SE1.96)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the significance of the Wald test?

A

Tests the H0 that there is no relationship between variables by examining if true parameter ln(OR) = 0
If z is greater than +/- 1.96 then it is statistically significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does logistic regression handle categorical explanatory variables with more than two categories?

A

It assigns one category as the reference and calculates ORs for the others relative to it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can logistic regression coefficients be interpreted for continuous variables?

A

They represent the change in ln(odds) for a one-unit increase in the variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the formula to convert coefficients and ORs to ORs for larger units (e.g., 10-unit changes)?

A

If using coefficients, multiply by 10 then exponentiate i.e., exp(coefficient*10)
If using ORs, raise to the power of 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is ln?

A

Natural logarithm (sometimes written log or log_e, although log has a different mathematical meaning). Ln(odds) are not usually reported in papers and reports (ORs are presented instead)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is exp?

A

Exponentiate - inverse function to natural logarithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do cohort and case-control studies differ?

A
  • Cohort study - select two groups of people based on exposure (e.g., smokers versus non-smokers), compare outcome (e.g., cancer/non-cancer)
  • Case-control study - select two groups of people based on outcome (cancer versus non-cancer), compare exposure (e.g., smoking/non-smoking)
17
Q

How do you calculate OR?

A

Odds in those with a characteristic / odds in those without a characteristic

18
Q

How is Z calculated?

A

Z = coefficient / SE

19
Q

What is the equation for ln(odds)?

A

ln(odds) = β0 + β1 * x

20
Q

How do you calculate odds from ln(odds)?

A

Exponentiate ln(odds)
Odds = exp(ln(odds))

21
Q

On ln(odds) scale, what does it mean if the 95% CI includes 0?

A

There is not enough evidence to reject H0

22
Q

What does ‘i’ signify in logistic regression syntax?

A

Ensures a separate OR for each category (applicable to variables with > 3 categories)

23
Q

What is ‘b1’ used for in syntax?

A

Switch reference category without recoding (applicable to variables with 2 categories)