Tutorial 6 - Binary Dependent Variables Flashcards

Question 1

Q

What are different approaches to estimate binary dependent variables?

Answer

A

linear probability,
logit,
probit
complementary loglog

Question 2

Q

What is a Linear Probability model?

Answer

A

can be estimate by OLS (only variance is different)
Var(ϵ_i|x_i) = x_i‘β (1 − x_i‘β)
> error term is heteroscedastic, adjust standard errors
estimated values, ^y, can take any value, not restricted to [0; 1]-interval (not what we want to have)
linear probability is still often used due to the easy interpretation and the flexibility of the linear model

Question 3

Q

What are latent variables?

Answer

A

latent variables are variables that are not directly observed but are rather inferred (through a mathematical model) from other variables

Question 4

Q

How does the latent variable model look like?

Answer

A

y* latent variable, continuos and unobserved variable driving the dependent variable
binary outcome variable y
- y = 1 if y* > 0
- y = 0 if y* ≤ 0
eg. preferred working time in hours for working full time or ability to cover credit in Euro

Question 5

Q

What is the probability for dependent variable y to be one under latent model?

Question 6

Q

Which question does F(x) answer?

Answer

A

F(x) answers the question which share of distribution (described by f(t) ) is smaller (or equal) to the value x

Question 7

Q

What is the distribution function for probit?

Answer

A

see below with where Φ(x) is the distribution function of the standard normal distribution -> probit

Question 8

Q

What is the distribution function for logit?

Answer

A

see below with Λ(x) = standard logistic function

Question 9

Q

What is complementary log-log model?

Answer

A

Third alternative to logistic regression and probit analysis for binary response variables.
Frequently used when the probability of an event is very small or very large -> has advantages for cases with average probabilities close to zero or one
Unlike logit and probit, the complementary log-log function is asymmetrical.

Question 10

Q

What is the distribution function for complementary log-log model?

Answer

A

extreme value distribution function:

Question 11

Q

What type of estimator do linear probability, logit, probit and complementary loglog have?

Question 12

Q

What are the distribution functions for linear probability, logit, probit and complementary loglog?

Question 13

Q

What are the two ways of interpreting probabilistic models?

Answer

A

average marginal effect
marginal effect evaluated at average

Question 14

Q

Which Tests for Goodness of Fit can you use for models with binary dependent variables?

Answer

A

Pearson’s test
Hosmer-Lemeshow test

(same test statistic below, but different groups)

Question 15

Q

How can you apply Person’s test for goodness of fit?

Answer

A

form m groups according to covariates:
- n_j: number of observation in group j
- Y_j: number of observations being one,
- ^p_j: predicted probability of being one
Sum of squared Pearson’s residuals (group residuals) approximately Χ²_M−K distributed

Question 16

Q

How can you apply Hosmer-Lemeshow test for goodness of fit for models with binary dependent variables?

Answer

Study These Flashcards

A

form m groups according to predicted probability ^p (typically ten equally large groups, with 0.0-0.1, 0.1-0.2,…):
- nj: number of observation in group j
- Yj: number of observations being one,
- ^pj: predicted probability of being one
Test statistic is Χ²_M−K distributed