Tutorial 4 - Frisch-Waugh Theorem, Omitted Variables, Instrumentral Variables Estimation Flashcards
Say we are interested in β₂: the effect of x on y, after controlling for w (=holding w constant).
How can we obtain the estimator ^β₂ according to Frisch-Waugh?
- Regress y on a constant and w. Take the residuals from this regression -> yres,w (the variation in y that cannot be explained by w)
- Regress x on a constant and w. Take the residuals from this regression -> xres,w (the variation in x that can not be explained by w)
- Regress yres,w on xres,w. The coefficient of xres,w is the same as the coefficient of x in the multivariate regression
This procedure illustrates the idea of filtering out the effect of w when estimating the effect of x on y.
Which conditions are met when the OLS estimator ^β is unbiased?
Unbiased: E(^β) = β
Note: If it is unbiased, that also means it is consistent:
What does this mean?
The OLS estimator is consistent:
How can you apply the Frisch Waugh theorem here?
- Regress log(invest) on a constant and trend and take residuals (“de-trended housing investment”)
- Regress log(price) on a constant and trend and take residuals (“de-trended house prices”)
- Regress the de-trended investment on de-trended prices.
Show that under the assumption of mean independence of the error term given the regressors, the OLS estimator is unbiased!
Show that the OLS estimator is consistent (under the assumption that the regressors and the error term are uncorrelated)!
When is the OLS estimator unbiased?
if E(ϵ|X) = 0
i.e. if the error term and the regressors are mean independent.
When is the the OLS estimator consistent?
if Cov(x, ϵ) = 0,
i.e. if the error term and the regressors are uncorrelated.
How are unbiasedness and consistency related with each other?
Note that mean independence implies uncorrelatedness, but not vice versa.
Thus: if the OLS estimator is unbiased, it is also consistent. But if the OLS estimator is consistent, it need not be unbiased.
How do you call regressors that are correlated with the error term, Cov(x, ϵ) ≠ 0, i.e. where the OLS estimator will be inconsistent and biased?
the regressors x are endogenous
Why can regressors be endogenous?
- Omitted variables: x and y are both driven by a third, unobserved, variable
- Simultaneity: y causes x, rather than vice versa
- Measurement Error: x is measured with error
- Dynamic Models with Serially Correlated Errors
Which conditions does an instrumental variable z have to fulfill?
- Instrument relevance
- Instrument exogeneity
What is “instrument relevance”?
The instrument has to be correlated with the endogenous variable Cov(zₖ, xₖ ) ≠ 0.
In the present case with multiple covariates, a stronger condition is that in the regression below, the parameter 𝛿ₖ ≠ 0, i.e. the instrument zₖ has to be correlated with xₖ also when controlling for all exogenous covariates!
What is instrument exogeneity?
The instrument has to be uncorrelated with the error term. It has to affect the outcome only via the endogenous variable, i.e. it should not have a direct effect on the outcome:
How can you test relevance of the instrument with instrumental variables?
It can be tested by estimating the first stage regression (below) and use a usual t-test for H₀: 𝛿ₖ = 0. -> the more significant, the better.
For multiple instruments z₁, …, zₗ: use a test of joint significance of the instruments in the first stage. Rule of thumb: F-statistic >10 to have a “strong” instrument