Week 1 Flashcards

Question 1

Q

What is the Linear Probability Model?

Answer

A

A model where you try to predict the probability of a binary variable using the regular OLS model.

Question 2

Q

What is the disadvantage of using a Linear Probability Model?

Answer

A

It can estimate probabilities outside the [0, 1] range.

Question 3

Q

What are two models we can use to predict probabilities of a binary variable? On what are they based?

Answer

A

We can choose the following two models:

Logit model, based on a standard logistic distribution (i.e. a student t distribution with 9 degrees of freedom)
Probit model, based on a standard normal distribution

Question 4

Q

What is a Latent variable?

Answer

A

A latent variabale is a variable that is not directly observed but can be inferred. We can create the latent variable e.g. y_i* for y_i.

Question 5

Q

Derive the probabilities for P(y_i = 1 | x_i) in a binary probiy/logit model for the case that y_i= 1_{{yi* > 0}}.

Answer

A

Note: in the image is G(.) the CDF for the standard logistic or standard normal distribution.

Question 6

Q

Why can we assume e_i ~ N(0, 1) without loss of generality? (i.e. why do we not essume e_i ~ N(μ, σ²))

Answer

A

We can set: (see image).

This means that we do not identify any of the beta’s, mu’s or sigma’s. Even with infinitely many observations we could not observe any difference, thus we can set mu and sigma to 0 without loss of generality.

Question 7

Q

What is the marginal effect of e.g. x₁ on the estimated value (in a binary probit/logit model)?

Question 8

Q

What is the PEA?

Answer

A

Partial Effect at the Average, it is just the derivative of the probability function (relative to x) and then at the mean of x.

Question 9

Q

What is the APE?

Answer

A

Average Partial Effect, calculated as follows:

Question 10

Q

What are the advantages and disadvantages of the APE compared to the PEA?

Question 11

Q

What is a disadvantage of both the PEA and APE?

Question 12

Q

Derive the likelyhood and loglikelyhood of the binary probit and logit model.

Question 13

Q

What is the z-statistic, how do we use it?

Answer

A

The z-statistic is similar to the t-statistic (but since we use MLE it is only asymptotically valid). The z-statistic = β₁/SE₁. The 95% confidence interval is it z-statistic +/- 1.96.

If 0 is not within this range we reject H₀: β_j = 0.

Question 14

Q

How can we compare estimated values between the binary logit and probit models?

Answer

A

We could divide them, e.g.

Question 15

Q

How can we compare parameters between the binary probit and logit models?

Answer

A

We can compare the β_i by multiplying it by the difference in marginal effect, this means that multiplying the β of the probit model by 1.5958.
We can compare the β_i by comparing the standard deviation of the logistic and standard normal distribution. Then you the β of the probit model by 1.8138.