7A Flashcards

1
Q

What are the main topics of this lecture?

A
  1. Linear probability models
  2. Logistic regression models
  3. Effect size in logistic regression
  4. Non-binary categorical dependent variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two types of categorical variables?

A
  1. Binary categorical variables – Two categories (e.g., Voted = 1, Did not vote = 0).
  2. Non-binary categorical variables – More than two categories (e.g., religion: Catholic, Protestant, Muslim).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why should the dependent variable (Y) in linear regression be continuous?

A

Linear regression assumes Y follows a continuous distribution, making coefficient interpretation meaningful.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a linear probability model (LPM)?

A

A linear regression model where the dependent variable is binary (coded as 0 or 1).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the problems with linear probability models?

A
  1. The model assumes linearity, which may not hold for binary outcomes.
  2. It can predict probabilities outside the range of 0 to 1.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When are linear probability models less problematic?

A
  1. When the mean of Y is close to 0.5.
  2. When independent variables are binary or have a limited range.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is the regression coefficient interpreted in a linear probability model?

A

As the predicted change in probability that Y = 1 for a one-unit increase in X.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is R² difficult to interpret in a linear probability model?

A

The binary nature of Y means points do not lie close to a regression line, making the R² less meaningful.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What should you do when using a linear probability model?

A

Compare results with logistic regression to check for consistency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is logistic regression?

A

A statistical model that estimates the probability of a binary outcome (Y = 1 or 0) using a non-linear function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why use logistic regression instead of a linear probability model?

A
  1. Ensures predictions stay between 0 and 1.
  2. Handles non-linearity in probability changes.
  3. Provides odds ratios for easier interpretation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does logistic regression estimate coefficients?

A

It uses maximum likelihood estimation (MLE) instead of OLS regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is maximum likelihood estimation (MLE)?

A

A process where the model searches for parameters that maximize the likelihood of observing the given data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Wald test in logistic regression?

A

A statistical test used to determine if a regression coefficient is significantly different from zero (similar to the t-test in linear regression).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the R² value mean in logistic regression?

A

There are two types of R² values in logistic regression that indicate how much variation in Y is explained by the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the Chi² test in logistic regression?

A

A test that evaluates whether all independent variables combined explain any variation in Y.

17
Q

How do effect sizes differ between linear and logistic regression?

A

In linear regression, effect size is measured by standardized coefficients. In logistic regression, standardized coefficients do not exist, so we use odds ratios.

18
Q

What are odds ratios in logistic regression?

A

They measure how much the odds of Y = 1 change with a one-unit increase in X.

19
Q

How are odds ratios calculated?

A

The odds ratio for an independent variable is Exp(B) in SPSS output.

20
Q

What does an odds ratio of 2 mean?

A

The odds of Y = 1 double for a one-unit increase in X.

21
Q

What does an odds ratio of 0.5 mean?

A

The odds of Y = 1 are halved for a one-unit increase in X.

22
Q

What does an odds ratio of 1 mean?

A

There is no effect of X on Y.

23
Q

How do odds ratios behave for negative effects?

A

A strong negative effect produces an odds ratio less than 1, which is the inverse of a corresponding positive effect.

24
Q

How should odds ratios NOT be interpreted?

A

Odds ratios should not be confused with probability ratios.

25
What is an example of an odds ratio interpretation?
If the odds ratio for distrust in politicians is 0.564, it means that for a one-unit increase in distrust, the number of Yes-voters for every No-voter is almost cut in half.
26
How can logistic regression effect sizes be visualized?
1. Using odds ratios. 2. Making a probability graph to show how Y changes with X.
27
How should non-binary categorical dependent variables be handled?
They can be analyzed using multinomial logistic regression, but a simpler approach is to create binary dummy variables for each category.
28
How does binary recoding of non-binary dependent variables work?
Instead of using vote choice as the dependent variable, create separate binary variables for each party: Voted for this party = 1, all others = 0.