7A Flashcards

1
Q

What are the main topics of this lecture?

A
  1. Linear probability models
  2. Logistic regression models
  3. Effect size in logistic regression
  4. Non-binary categorical dependent variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two types of categorical variables?

A
  1. Binary categorical variables – Two categories (e.g., Voted = 1, Did not vote = 0).
  2. Non-binary categorical variables – More than two categories (e.g., religion: Catholic, Protestant, Muslim).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why should the dependent variable (Y) in linear regression be continuous?

A

Linear regression assumes Y follows a continuous distribution, making coefficient interpretation meaningful.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a linear probability model (LPM)?

A

A linear regression model where the dependent variable is binary (coded as 0 or 1).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the problems with linear probability models?

A
  1. The model assumes linearity, which may not hold for binary outcomes.
  2. It can predict probabilities outside the range of 0 to 1.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When are linear probability models less problematic?

A
  1. When the mean of Y is close to 0.5.
  2. When independent variables are binary or have a limited range.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is the regression coefficient interpreted in a linear probability model?

A

As the predicted change in probability that Y = 1 for a one-unit increase in X.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is R² difficult to interpret in a linear probability model?

A

The binary nature of Y means points do not lie close to a regression line, making the R² less meaningful.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What should you do when using a linear probability model?

A

Compare results with logistic regression to check for consistency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is logistic regression?

A

A statistical model that estimates the probability of a binary outcome (Y = 1 or 0) using a non-linear function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why use logistic regression instead of a linear probability model?

A
  1. Ensures predictions stay between 0 and 1.
  2. Handles non-linearity in probability changes.
  3. Provides odds ratios for easier interpretation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does logistic regression estimate coefficients?

A

It uses maximum likelihood estimation (MLE) instead of OLS regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is maximum likelihood estimation (MLE)?

A

A process where the model searches for parameters that maximize the likelihood of observing the given data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Wald test in logistic regression?

A

A statistical test used to determine if a regression coefficient is significantly different from zero (similar to the t-test in linear regression).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the R² value mean in logistic regression?

A

There are two types of R² values in logistic regression that indicate how much variation in Y is explained by the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the Chi² test in logistic regression?

A

A test that evaluates whether all independent variables combined explain any variation in Y.

17
Q

How do effect sizes differ between linear and logistic regression?

A

In linear regression, effect size is measured by standardized coefficients. In logistic regression, standardized coefficients do not exist, so we use odds ratios.

18
Q

What are odds ratios in logistic regression?

A

They measure how much the odds of Y = 1 change with a one-unit increase in X.

19
Q

How are odds ratios calculated?

A

The odds ratio for an independent variable is Exp(B) in SPSS output.

20
Q

What does an odds ratio of 2 mean?

A

The odds of Y = 1 double for a one-unit increase in X.

21
Q

What does an odds ratio of 0.5 mean?

A

The odds of Y = 1 are halved for a one-unit increase in X.

22
Q

What does an odds ratio of 1 mean?

A

There is no effect of X on Y.

23
Q

How do odds ratios behave for negative effects?

A

A strong negative effect produces an odds ratio less than 1, which is the inverse of a corresponding positive effect.

24
Q

How should odds ratios NOT be interpreted?

A

Odds ratios should not be confused with probability ratios.

25
Q

What is an example of an odds ratio interpretation?

A

If the odds ratio for distrust in politicians is 0.564, it means that for a one-unit increase in distrust, the number of Yes-voters for every No-voter is almost cut in half.

26
Q

How can logistic regression effect sizes be visualized?

A
  1. Using odds ratios.
  2. Making a probability graph to show how Y changes with X.
27
Q

How should non-binary categorical dependent variables be handled?

A

They can be analyzed using multinomial logistic regression, but a simpler approach is to create binary dummy variables for each category.

28
Q

How does binary recoding of non-binary dependent variables work?

A

Instead of using vote choice as the dependent variable, create separate binary variables for each party: Voted for this party = 1, all others = 0.