Lecture 6 - Binary Logistic Regression Flashcards

1
Q

Logistic Regression is…?

A

….a modified form of the linear regression framework we have learned about so far. Logistic regression modifies the output from the linear regression model to transform it from a linear line to a sigmoidal curve, which has bounds of 0-1. Normal linear regression does not have bounds and in theory can run to infinity in either direction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The point of logistic regression is…?

A

…to model a criterion variable which is binary i.e. one that can only take on one of two values. Just like in linear regression, the predictor variables can be continuous or categorical, and multiple predictors are fine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The output from a logistic regression is usually interpreted as…?

A

…providing a probabilistic ‘guess’, based on the predictor variable scores, as to whether the criterion variable will be a 0 or a 1. So, if for a given person the output of the model is 0.9, this can be considered a 90% guess that that individual has a ‘1’ on the criterion variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In the SPSS output, where is the difference in -2LL between the model and model without predictors?

A

In the Omnibus table, under chi-square.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In the SPSS output, where is the equivalent of SSE-left, but for -2LL, between the model and model without predictors?

A

In the Model Summary table, under -2 log-likelihood.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the Likelihood ratio?

A

The difference in -2LL between the model and model without predictors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the name of the equivalent for Likelihood ratio in linear regression?

A

SSE-reduced.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Using the SPSS output from Logistic regression. How can you calculate the -2LL of the model with no predictors (the equivalent of SSE-total)?

A

Add the likelihood ratio from the Omnibus table with the -2LL (SEE-left equivalent) from the Model Summary table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the -2LL?

A

-2 Log Likelihood. It is -2 multiplied with the Log likelihood.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why do we use -2LL and not only the Log Likelihood?

A

Because the -2LL has a known distribution, the chi-square distribution, making it possible to compare two values of -2LL and calculate a p-value for the difference between these two values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you calculate Hosmer & Lemeshow’s r^2?

A

(Likelihood ratio) / (-2LL of the model with no predictors )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the Likelihood ratio divided by -2LL of the model with no predictors calculate?

A

Hosmer & Lemeshow’s r^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does Hosmer and Lemmeshows r^2 mean?

A

What proportion of -2LL we reduced by including predictors in the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In the “Variables in the Equations” table, what does the “B” column denote?

A

The b0(bottom) and b1(top) of the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the “Wald” number/value and how is calculated in logistic regression?

A

Wald is a measure that can be thought of as the equivalent to the t-value in linear regression. It compares a predictor’s b1 to the predictor’s SE (being the “b1” of the no-predictor model. This generates a p-value. Wald is calculated by:

(b1 for the predictor / SE for the predictor) ^2 = Wald

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Formula for odds?

A

Odds=P/(1-P) P = probability

17
Q

Formula for probability from odds?

A

Probability=Odds/(1+Odds)

18
Q

what is Exp(B)

A

Exp(B) tells us how much the odds of Y go up/down for every 1 increase in X.

19
Q

How do you get from the probability of Y when X = 9.0 to the probability of Y when X = 10.0, using calculations involving odds?

A
  1. Convert the probability of Y when X = 9.0 into Odds.
  2. Apply Exp(B)(10-9) to the Odds:
    ((10-9)
    Exp(B)*Odds) = New Odds
  3. Convert New Odds back to probability
20
Q

When would a “Chi-square test” be used in traditional psychology?

A

When you have BOTH a binary criterion variable AND a Binary predictor.

21
Q

When can Chi-square tests NOT be used?

A

Chi square can’t handle a continuous predictor variable, or multiple predictors, so in that situation students would then be taught to use logistic regression

22
Q

What is the general linear model called after it has been transformed?

A

the generalized linear model

23
Q

Name two advantages of using -2LL instead of SSE as a measure of goodness of fit.

A
  1. -2LL can be compared directly to another -2LL. When using SSE, we first have to calculate F.
  2. Using -2LL allow us to compare models with the same number of predictors, which is much harder to do using SSE or F-values.
24
Q

What is the validity of that measurement tool?

A

to what extent the tool is measuring what we really want to measure, rather than something else

25
Q

what are Nuisance Variables?

A

things that we do not want to measure but also affect how people respond to our measurement tools.

26
Q

Describe Nominal Data

A

Options have:
No order
No specific interval
No true zero

e.g. colours, gender, ethnicity

27
Q

Describe Ordinal Data

A

Options have:
- Order

(No specific interval)
(No true zero)

e.g. 1st 2nd 3rd (race positions)

28
Q

Describe interval Data

A

Options have:

  • Order
  • Specific interval

(No true zero)

eg. likart scale: bad—neutral—good

29
Q

Describe Ratio Data

A

Options have:

  • Order
  • Specific interval
  • True zero

e.g. 0-100%, reaction time.

30
Q

How does the public often interpret interval scales like a “5-star system”?

A

Public interpret these points as ordinal rather than interval data. 5 means good, not perfect, and 4 means something is wrong, not almost perfect. Cultures of what each point on the scale ‘means’ can develop.

31
Q

What type of test needs to be used to interpret ordinal data?

A

non-parametric tests