Classification Flashcards

1
Q

Why doesn’t linear regression work for qualitative data?

A

Theoretically, we can assign numerical values to qualitative response variables, but this would imply an ordering on the outcomes, insisting the difference between each of the variables are the same. Interpreting the estimates in this case would be very difficult.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Logistic Regression

A

Helps with modeling qualitative data with binomial categorical responses. Want our model to provide predictions between 0 and 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What will help fit a logistic regression?

A

Maximum Likelihood

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Formula for logistic regression

A

P(x) = e^B0 + B1 (x) / 1 + [e^B0 + B1 (x)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Z-Statistic

A

Large absolute value of z-statistic indicates evidence that our coefficient is significantly different than zero – keep in the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

P-values

A

Looking for values less than 0.05

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Difference between logit and probit models

A

Logistic uses the cumulative distribution function of the logistic distribution and probit uses the cumulative distribution function of the standard normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Heteroskedasticity

A

Variance of error terms is not constant across different observations. Probit models generally account for this.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Rand Accuracy

A

The test to use to evaluate the test sample performance of the logistic and probit models. Not make sense to use R^2 since logit and probit models are non-linear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly