M2 - Logistic Regression Flashcards

1
Q

What differentiates LR from MR? pick best answer.

  1. LR is used for predicting group membership.
  2. LR only uses binary independent variables.
  3. LR has a dependent variable that is binary.
  4. LR output are graphs that have a straight line for the line of best fit.
A
  1. LR has a dependent variable that is binary.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Part B - Question 2: The Independent variables in a Logistic Regression should be:

  1. Binary.
  2. Continuous or binary.
  3. Ordinal.
  4. Continuous or metric.
  5. Metric.
A
  1. Continuous or Binary
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Part B - Question 3: What are key differences between ANOVA and logistic regression?

  1. The DV is binary for LR and not for ANOVA.
  2. The DV is binary for both LR and ANOVA.
  3. The DV is continuous for ANOVA and not for LR.
  4. The IV and DVs are binary for both LR and ANOVA
A
  1. The DV is binary for LR and not for ANOVA.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Part B - Question 4: Is it possible to use a continuous variable for use in Logistic regression?

  1. Continuous dependent variables can be used in a regression analyses if the scale includes a 1 and zero in the metric scale.
  2. No, a dependent variable that is continuous can only be used in a multiple regression analyses.
  3. Only if the continuous variable has more than 2 values.
  4. A cut-point can be identified on a continuous variable, and this can be used to form a binary variable.
A
  1. A cut-point can be identified on a continuous variable, and this can be used to form a binary variable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Part C - Question 1: What is the general shape of a plotted logistic regression formula?

  1. A straight line.
  2. A parabola.
  3. An S shape.
  4. A U shape
A
  1. An S shape.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Part C - Question 2: What is the scale of the DV for the logistic regression model?

A parabolic scale.
A decimal scale.
A hexadecimal scale.
A logarithmic scale

A

A logarithmic scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Part C - Question 3: Is binary logistic regression a linear model?

Yes, it has the function of y=c+mx.
Yes, it has the formula of y=bx+c
Both a and b.
No, it is non-linear function.

A

No, it is non-linear function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Part D - Question 1: Can the approach used for MR model building be used for LR?

No, it can only use, standard model building.
No, it must use no- linear strategies.
Yes, it can use standard, sequential and statistical model building.
Yes, it must use ordinal strategies

A

Yes, it can use standard, sequential and statistical model building.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Part D - Question 2: How many ordinal outcomes can a multinomial logistic regression predict?

One continuous category.
Three or more categories.
One multivariate variable.
Two categories

A

Three or more categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Part D - Question 3: Which type of LR is the 4th year lecture covering?

Multinomial.
Ordinal.
Binary.

A

Binary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Part D - Question 4: Sequential logistic regression is what?

Binary variables are randomly entered in blocks.
Binary and continuous variables are entered in blocks and pre-specified by the researcher.
Binary variables are entered in blocks and pre-specified by the researcher.
Binary and continuous variables are randomly entered in blocks

A

Binary and continuous (predictor) variables are entered in blocks and pre-specified by the researcher.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Part E - Question 1: What does target variable mean?

This is the independent variable with the most number of categories.
This is the outcome category of the dependent variable that is the focus of the research question.
This is the dependent variable with the least number of categories.
This is the variable that can be used as a independent or dependent variable

A

This is the outcome category of the dependent variable that is the focus of the research question.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Part E - Question 2: What is the reference category in logistic regression analyses?

It is the reference category used to interpret the categories of a categorical variable.
It is the reference for logistic regression.
It is the same as the Target category.
The variable that the research question is focussed on

A

It is the reference category used to interpret the categories of a categorical variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Part E - Question 3: How does SPSS choose the target category for an outcome variable in a logistic regression?

It chooses the highest numeric value.
It chooses the lowest numeric value.
SPSS does not choose the DV target category, the analyst always needs to select this to run the analysis

A

It chooses the highest numeric value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Part E - Question 4: Can a categorical DV have more than one category in logistic regression?

Categorical DVs in a logistic regression must be continuous.
Categorical DVs in logistic regression can only have two values.
Categorical DVS in a logistic regression must be ordinal.
Categorical DVs in a logistic regression can have two or more values.

A

Categorical DVs in logistic regression can only have two values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Part E - Question 5: Can a categorical IV have more than one category?

Categorical IVs in a logistic regression can have two or more values.
Categorical IVS in a logistic regression must be ordinal.
Categorical IVs in a logistic regression must be continuous.
Categorical IV variables in logistic regression can only have two values.

A

Categorical IVs in a logistic regression can have two or more values.

17
Q

Part E - Question 6: For an IV with more than one category, can any category be the reference category?

Yes, any category can be the reference variable.
No, SPSS will force a reference category, and this cannot be changed.
No, the lowest category should be the reference category.
No, the highest category should be the reference category

A

Yes, any category can be the reference variable.

18
Q

Part G - Question 1: When checking the linearity assumption in logistic regression we are doing the following:

There is linear relationship between the independent variables.
There is log linear relationship between continuous independent variables and the dependent variable.
There is a log linear relationship between categorical variables.
There is a linear relationship between all independent variables and the dependent variable

A

There is log linear relationship between continuous independent variables and the dependent variable.

19
Q

Part G - Question 2: What statistic is recommended to interpret model fit and can also be used as pseudo measure of R2?

The Cox and Snell statistic.
The Chi-square classification.
The Wald statistic.
The -2 Log Likelihood

A

The -2 Log Likelihood

20
Q

Part G - Question 3: An odds ratio > 1 means what?

Outcome is more likely for target level of IV.
No difference between groups.
Outcome is less likely for target level of IV

A

Outcome is more likely for target level of IV.

21
Q

Part G - Question 4: An odds ratio < 1 means what?

Outcome is more likely for target level of IV.
Outcome is less likely for target level of IV.
No difference between groups

A

Outcome is less likely for target level of IV.

22
Q

Part G - Question 5: An odds ratio = 1 means what?

No difference between groups.
Outcome is more likely for target level of IV.
Outcome is less likely for target level of IV

A

No difference between groups.

23
Q

Part G - Question 6: The odds ratio is also referred to as what?

Tetrachoric correlation.
Exp(B).
The standard error.
Wald statistic

A

Exp(B).

24
Q

Part G - Question 7: Odds ratios should always be interpreted in the context of what?

Sample size.
Number of correct classifications.
Number of false classifications.
Prevalence

A

Prevalence

25
Q

In what circumstances should logistic regression be used?

A
  • predicting group membership
  • DV is binary / categorical
  • continuous variables can be converted to binary using cut offs
  • Multiple IVs can be categorical or continuous
26
Q

Outline the differences between logistic regression and multiple regression in terms of

  • types and number of IVs and DVs
A

LR
IVs - multiple continuous or categorical
DVs - single binary or categorical (binomial, multinomial or ordinal)

MR
IVs - multiple continuous or categorical
DVs - single continuous

27
Q

Outline the differences between logistic regression and multiple regression in terms of

  • approach for entering variables
A

LR
Standard (entered altogether)
Sequential (entered in blocks) - theory directed
Statistical (forward to backward)

MR
Standard (forced entry)
Hierarchical (entered in blocks) - theory directed
Stepwise (statistics based, forward or backward) - only user for exploration

28
Q

Outline the differences between logistic regression and multiple regression in terms of

  • assumptions
A

LR

  • Independence of errors - errors should not be correlated (clustered data)
  • linearity - IV should have linear relationship with log of the DV
  • distribution normality - distribution should be normal and outliers should be dealt with (transformed or removed) using Standardised residuals Cook Distance
  • samples size –> 5 cases per possible combination required
  • singularity and multicollinearity
29
Q

How does the logistic function differ compared to other functions

A
log function =
log(Y/1-Y) = b0 + b1x1 + b2x2 + e
log function is an exponential function that is shaped liked an S curve
\+ve coefficients will increase Y
-ve coefficients will decrease Y
interpret using Odds Ratio

linear function = Y = bx + c
linear function is a straight line
as DV increase by 1 unit IV increases by b units

quadratic function = 
Y =  ax2 + bx + c
quadratic function is a parabola
\+ve a is happy face
-ve a is sad face
30
Q

When is categorical coding useful?

A

Useful to deal with categorical variables with 2 or more outcomes

31
Q

What is binomial, multinomial and ordinal categorical coding?

A

Binomial is 2 (Yes/No)
Multinomial is for nominal groups eg brown = 1, blue = 2, green = 3
Ordinal is multinomial moving in a progressive way eg education level achieved 1 = high school, 2 = grad school, 3 = postgrad

32
Q

Which group of DV category should be the referent in binomial logistic regression and how should it be coded?

A

Referent group should be coded lower and should be the group you that is not the target of interest ie control group

33
Q

For IVs with 2 and 3 categories, how should they be coded?

A

Binomial categorical IV -SPSS will assign automatically

  • target should be the group of interest
  • referent should be the group not of interest

Multinomial categorical IVs

  • can be anyway
  • needs to make sense
  • set the referent as the variable you are most interested in so other groups can be compared directly to that one
34
Q

What does the choice of referent group in categorical coding impact?

A

interpretation of the DV

interpretation of the coefficient o the IVs

35
Q

Name the model overall approaches for logistic regression interpretation

A

Model Improvement

  • 2LL change
  • 2LL proportion (% improvement of model fit)

Classification Accuracy
% correct
% improvement relative to baseline

36
Q

Describe model improvement methods for interpreting LR

A
-2LL change
= -2LLbase - -2LLnew
= -2LL change (Omnibus test)
used for nested models only
significant at p =.05 if 1 df > 3.84

-2LL proportion
= x2 model / -2LLbase
then transform to improvement of model fit

37
Q

Describe classification accuracy methods for interpreting LR

A

Classification accuracy
1. % correct
= the # of correctly predicted to be in one group and not in another group

  1. % improvement
    = the # of correctly predicted relative to if everyone was predicted to be in the category with the most outcomes
    = hits + correct rejection -nmax / sample n - nmax
    then transform to % for model improvement over baseline
38
Q

Name and explain the individual predictors of LR interpretation

A

b weight = change in log odds of Y =1 for 1 unit change in the IV
Odd Ratio = change in likelihood of Y = 1 for 1 unit change in IV
<1 = less likely, > 1 is more likely, 1 = no difference
Significance test - Wald’s test

39
Q

Where do you find the log odds and odd ratio in SPSS output?

A

log odds - b weight = B
For every unit increase level of group, the log odds of being in the outcome increase by B units

Odd ratio - Exp(B)
likelihood of increase relative to referent group