L5 - Logistic Regression Flashcards

1
Q

What is the aim of Logistic Regression?

A

To find the best combination of variables which correctly classify cases according to their membership in the two categories defined by the dependent measure

(0,1) or (1,2).

i.e. binary logistic regression is figuring out the best predictor of whether you belong to a particular group​

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When do you use Logistic Regression?

A

When the dependent measure is binary

  • (two possible outcomes, i.e. yes or no)*
  • e.g. Are you a smoker? - Are you depressed?*
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Does Logistic Regression give an outcome based on their estaimated or actual score, or a probability?

A

Probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the probability outcome of logistic regression telling us?

A

The probability that the person is in the higher-value group

(i.e., 1 if the pair were coded 0,1 or 2 if it were 1,2).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is this formula for?

A

Logistic Regression

Determining the probability that a person belongs to the higher variable group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Can Logistic Regression have multiple IV’s?

A

Yes

Only the DV is binary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The first thing to do when conducting a logistic regression is identify the significant predictors.

What test/statistic do we use to identify which IV is the most significant predictor of the DV?

A

The Wald Statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Wald Statistic?

A

Statistical test that tells us whether the predictor (IV) is a significant predictor in our logistic regression

Like a t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the “Exp (B)” stand for in logistic regression?

A

The Odds-ratio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the Odds-ratio (Exp (B)) tell us in logistic regression?

A

Indicates the change in probability (P) (E, i.e., membership in the higher value group) resulting from a one unit change in the value of that predictor variable (IV) score.

For example, 3.42 for gender, means that a one unit change in Gender, i.e., going from Male to Female (assuming male =1, female=2 for this variable), leads to a 3.42 greater probability of being in the higher of the two groups defined by the dependent measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

If males = 1 and females = 2 and Exp (B) is 3.42; this means that females (higher IV) are 3.42 times more likely to engage in the behaviour than males.

True or False?

A

True.

Girls are 3.42x more likely to engage in the behaviour when compared to males.

It refers to how much the probability of them engaging in the behaviour increases if they belong to that IV.

UNSURE double check

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a Z equation in logistic regression?

A

The linear combination of the variables.

i.e., Z = B 0 + X 1 .B 1 + X 2 .B 2 + ……+ X n .B n .

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does an odds-ratio (OR) of “1” indicate?

A

Chance

(no predictive value, variable makes no difference)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In the Z equation below, what does B<strong>0,</strong> B and X’s refer to?

Z = B0 + X 1 .B 1 + X 2 .B 2 + ……+ X n .B n .

A

B<strong>0 </strong>= the constant

All other B values are the regression coefficients

X’s are the actual values each person scored on each variable in the equation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Z equation = Z = B 0 + X 1 .B 1 + X 2 .B 2 + ……+ X n .B n

Imagine gender were X2

Holding everything else constant, we could calculate a Z value with X2/gender = 1

This value is entered in the P(E) calculation below.

We get an overall probability of membership in the ‘higher’ group defined by the DV of 24% of being in the higher group)

If the odds-ratio is 3.42 for gender. What would the value of P(E) become if we increased X2 value from 1 to 2?

A

P(E) becomes 3.42 times larger to 82%

24% x 3.42 = 82%

  • If P(E) were 3 it would be 24x 6.84*
  • (remember, 1 is chance)*
  • important*
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In this logistic regression example, which of the 3 predictor variables are significant predictors of being part of the higher DV?

What difference in probability does each increase in the IV have on the DV?

A

Age and Gender are significant as p < .05.

SES has a p > .05 and so is not significant.

Increase of IV on DV for

Age = 21%

Gender = 342%

SES = 86%

17
Q

What does the outcome variable (DV) tell us in logistic regression?

A

The best predictors for if someone belongs in a particular group.

Outcome measure is a grouping (binary) variable.

Trying to predict which group one should be in.

18
Q

When predicting the DV in logistic regression, typically the more IV’s you belong to the ___ likelihood of belonging to the group.

A

greater

e.g. with problem gamblers; if they belong to the IV groups of 1. get to casino early; 2. use a credit card and not cash; 3. drink while playing - they are likely to be a problem gambler

19
Q

What percent do you have of being part of a group in a “chance” model of logistic regression?

A

50%

20
Q

What is considered a good logistic regression model predictor rate?

A

70-80% classified correctly

21
Q

What statistic do you use to determine whether a probability variable is significant?

A

Wald statistic

p < .05​

22
Q

If you get a logistic regression model that finds independent variables that are not significant, what should your next step typically be?

A

Re-run the model without the insignificant predictor variables

23
Q

After looking at the Wald statistic, where do you look in the logistic regression model?

A

Odds-Ratio

24
Q

Why would the Odds-Ratio of age be much smaller than gender?

(why can’t you necessarily compare all odds-ratios?)

A

Because 1 unit of age is 1 year (not much difference), whereas 1 unit of gender is a completely different sex (large difference)

25
Q

What does a 1.0 Odds-Ratio indicate?

A

That the predictor variable makes no difference whether or not you are likely to be part of the group of interest

26
Q

Why is Logistic Regression and understanding Odds-ratios useful?

A

You can profile people

  • E.g. when identifying if they might have cancer*
  • if they are over 50*
  • Smoker*
  • Parents had cancer*
  • The more of these that are ‘yes’ the more likely they are to have cancer*