Binary responses Flashcards

1
Q

what does the line B0 +B1X mean?

Linear regressor with a single regressor
Y = B0 +B1X+ u

A

The conditional mean of Y given X

E[Y|X]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what does B1 mean?

Linear regressor with a single regressor
Y = B0 +B1X+ u

A

The partial affect of X on E[Y|X] (on conditional mean of Y given X)

^E[Y|X] / ^X

^ = average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What hoes the predicted value ^Y mean?

^Y(estimation of Y)

A

A predicted conditional mean of Y given X

^E[Y|X] - ^b0 +^b1X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what does the line B0 +B1X mean?

E[Y|X] - b0+b1X

A

P(Y=1|X)

Probability of Y +1 is conditional on X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Linear probability model (advantages)

A
  • Simple to estimate and to interpret

- inference is the same as for multiple regression (need heteroskeddasticity - robust standard errors)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Linear probability model (disadvantages)

A
  • A LPM says that the change in the predicted probability for a given change in X is the same for all values of X
  • Also, LPM predicted probabilities can be < 0 or > 1!

These disadvantages can be addressed by using a nonlinear probability model: probit and logit regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

a nonlinear functional form is used when we want:

A
  1. P(Y=1|X) to be increasing in X for b1 >0,

2. 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Differences between linear and nonlinear functions (FORMULAS)

A

Linear: P (Y+1|X) = b0 + b1X

Non-linear: P (Y+1|X) = F(b0 + b1X)

F is a cumulative distribution function of a random variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Probit regression model

A

P (Y=1|X) = o| ( b0 + b1X)

o| is the cdf of standard normal distribution

b0 + b1X is the “z value” of the probit model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why use normal cdf? (probit regression)

A
  • two properties are satisfied
  • easy to use
  • straightforward interpretation of “z value”
  • a latent variable model with normal error u
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Probit regression model (formula

A

P (Y=1|X) = o| ( b0 + b1X)

F(x) = [ 1 / 1 + e^ (b0 + b1X) ]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

cdf of logistic distribution

A
  • logit and probit use different probability functions, the coeficients (b’s) are different in logit and probit
  • in practice, logit and probit are very similar - since empirical results typically don’t hinge on the logit/ probit choice, both tend to be used in practice
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

P (Y=1|X) = o| ( b0 + b1X)

how can we estimate b0 and b1 ?

A
  • Non linear least square

- Maximum likelihood estimation (MLE)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the OLS estimator?

A

It is the coefficient values that minimize the squared residuals.

min E [ Y - ( b0 + b1X) ]^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Nonlinear lest square

A
  • no explicit solution exist in general
  • Solved numerically using the computer (specialized minimization algorithms)
  • In practice, MLE is used more ( a more efficient estimator )

min E [ Y - o| ( b0 + b1X) ]^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

likelihood function

A
  • The conditional density of Y1,…,Yn given X1,…,Xn, treated as a function of the unknown parameter b0 and b1
  • the probability that data was generated with the given parameter values
17
Q

Maximum likelihood estimator (MLE)

A
  • The values of (b0, b1) that maximizes the likelihood function
  • “the probability thqat data was generated with the parameter values of MLE is the largest.”
  • MLE is consistent, asymptotically normal, and efficient ( with the minimum variance)