Tutorial 7 - Censored Variables Flashcards
What is censored data?
- Random sample
- But partial information about the value of a variable—we know it is beyond some boundary, but not how far above or below it (or 0)
What is truncated data?
Data are truncated when the data set does not include observations in the analysis that are beyond a boundary value. Having a value beyond the boundary eliminates that individual from being in the analysis.
- -> no random sample!
- -> no information on censored variables!
What is the Tobit model used for?
The Tobit model is used in situations in which the dependent variable y is censored
What are corner solutions?
The dependent variable y has bunches at certain points (typically zero) due to individual behaviour (corner solution)
- Hours worked for women
- Monthly spending on cigarettes
- Amount invested in a project
Here, the term “censoring” is also used for corner solutions.
What are the Tobit model assumptions for the dependent variable?
The latent variable y* = xβ + ϵ ϵ | x ~ N(0; σ²) iid
is assumed to :
- be a linear function of the covariates,
- with normally distributed, homoscedastic errors
How can you express “left-censoring” or “censoring from below” (at zero) for observed variable y as a function of latent variable y*?
What is the expected value of the observed variable y in a Tobit model?
What is the probability that y is positive in censored data?
with
- Φσ= cdf of normal distribution with a standard deviation of σ and a mean of zero
- Φ (.) = cdf of standard normal distribution (a standard deviation of one and a mean of zero)
In the tobit model, what is the expected value of y, given that y is positive?
In the tobit model, what is the expected value of y?
What is the partial effect of continuous regressor xK on the observed variable y in the Tobit model?
What is the partial effect of continuous regressor xK on the latent variable y* in the Tobit model?
Which partial effect is more interesting for which case of censored data?
- for “real” censoring model, it is in most cases the effect on the latent variable
- for corner solution model, it is the observed variable
What is the average partial effect (APE) of a continuous variable xKon y?
What is the average partial effect (APE) of a discrete variable xKon y?