Binary Logistic Regression Flashcards

1
Q

What is a contingency table?

A

Summarises frequency distribution of each of two categorical variables and the association between them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What can a contingency table allow us to check?

A

▪️How each potential explanatory variable is related to the DV
▪️The categories for explanatory variables are large enough (at least 5)
▪️How many missing cases for each

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the risk of an outcome?

A

The number of times the outcome of interest occurs divided by the total number of possible outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do you calculate the risk ratio (relative risk)?

A

Risk of one outcome divided by risk of the other outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a risk ratio of 1 indicate?

A

No difference between the groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you calculate the odds of an outcome?

A

The number of times it occurs divided by the number of times it doesn’t occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you calculate the odds ratio?

A

Odds of outcome in one group compared to odds of outcome in the other group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does an odds ratio of 1 mean?

A

▪️Outcome occurs half of the time
▪️Exposure doesn’t affect the odds (no difference between groups)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does an odds ratio <1 indicate?

A

Outcome occurs less than half the time

Indicates exposure is associated with reduced risk of developing outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does an odds ratio >1 indicate?

A

Outcome occurs more than half the time

Indicates exposure is associated with increased risk of developing outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which ratio can be used for case-control studies?

A

Odds ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When can you NOT use risk ratios and why?

A

In case control studies where selection of subjects is based on the outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What happens to RR and OR in rare outcomes?

A

They will be approximately equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What can you assume about a relationship where the outcome is binary?

A

It is nonlinear, S-shaped (sigmoid) relationship

Linear cannot make sense!
▪️Outcome is limited to 0,1
▪️Goal is to separate groups, not minimise mean squared error
▪️Linear regression is highly sensitive to influential cases
▪️Assumptions of linear are violated, particularly homogeneity of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the link function?

A

A way of relating x and Y by way of a function in a logistic regression

To model the relationship between left and right hand side of the equation

g(y) = β0 + β1x1 + ε

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is maximum likelihood?

A

An iterative process that estimates the best fitted equation

Find the coefficient that makes the observed data most likely

(try lots of models until tweaking it further doesn’t improve the fit)

17
Q

What does the coefficient in a logistic regression mean?

A

An increase of X of 1 unit will change the ODDS (log) by β

18
Q

How do you calculate the odds ratio for X?

A

The anti-log of the coefficient

(e to the power of the coefficient)

19
Q

When do you use the binary logistic regression?

A

To test whether there’s an association between variables, where the dependent variable is categorical on a binary scale

20
Q

What assumptions are needed for the binary logistic regression?

A

▪️Binary dependent variable has a Bernoulli distribution
▪️It’s linearly related to the predictor only after transforming into the logit scale
▪️observations are independent
▪️continuous variables have a linear effect on the log-odds scale

21
Q

If you have a continuous outcome/DV, what type of regression should you use?

A

Linear regression

22
Q

If you have a binary (categorical) outcome/DV, what type of regression should you use?

A

Binary logistic regression

23
Q

What ratio can you calculate in a case-control study?

A

Odds ratio

(NOT risk!)

24
Q

What ratio can you calculate in a prospective cohort study?

A

Odds OR risk ratio

25
Q

What is incidence?

A

Aka risk

e.g., number of people who got disease when exposed over the total number of people who were exposed