Lecture 5 (Instrumental variables (IV)) Flashcards
Explain qualitatively the basic IV-setup. What are we doing to achieve a causal effect?
IV isolates only the part of the variation in the endogenous variables that is due to exogenous factors (i.e., the variation in the instrument). We then use only this part of the variation to identify the effect of the endogenous variable on the outcome.
What are the basic IV stages
The basic stages:
Z → X = First stage
Z → Y = Reduced form = ITT
X → Y = Structural equation (second stage)
What are the identification we need to think about in IV?
Identification:
- Relevance: $cov(X_i,Z_i)\neq0$
- Validity: $cov(u_i, Z_i )=0$
- Randomization
- Exclusion
How do we test for randomization in IV?
- Show a balance table for pre-treatment covariates across different values of our instrument.
- Z is uncorrelated with the other covariates in our study (this is the same thing as above but from PPL).
- Add controls to show that the estimates do not change (as above but from PPL)
How do we test for exclusion in IV?
- Can not directly be tested!
- Comes from economic reasoning and carefully thinking about violations!
- Specific to each instrument and outcome variable.
- Run a refutability test (PPL)
- If we find a sample where there is no first stage, we should have no reduced form effect. If we do, there is a violation of the exclusion restriction.
- Show that no other studies use the same instrument (PPL)
- If we have collective use of the same instrument, there will be a violation of the exclusion restriction.
- The same instrument can’t be used by anyone else.
How can we test for instrument relevance in IV?
- See if there exists a first stage!
- Run first-stage regression and look at F-value
- Should be > 10 or > 105 depending on the literature.
Can also look at tF- stat and AR-test as an alternative to F-stat.
- Should be > 10 or > 105 depending on the literature.
- Run first-stage regression and look at F-value
Use method of moments to derive the IV estimator
We can then derive $\hat \beta$ using the method of moments. Starting with the assumption:
$$
E[z’e] =0
$$
$$
E[z’(y-x’\beta)] =0
$$
$$
E[z’y]=E[z’x]\beta
$$
$$
\hat\beta =E[z’x]^{-1}E[z’y]
$$
$$
\hat\beta =(Z’X)^{-1}Z’Y
$$
Explain what is meant by “exclusion restriction”?
The only way our instrument affects the outcome is through our variable of interest. That is, Z has no direct effect on the outcome variable or takes no other route to the outcome variable.
What is 2SLS and how to we estimate it?
2SLS works by regressing the endogenous variables on the instruments (and exogenous variables) to isolate this part of the variation. Actually, there are many ways to “isolate the variation” due to exogenous factors, 2SLS is only the most popular.
The two-step procedure in 2SLS is the following:
- Project (regress) each of the exogenous and endogenous variables on
the set of instruments and endogenous variables (the “first stage”) - Regress the outcome Y on the fitted values obtained in the first step
Use method of moments and the the projection matrix to derive the 2SLS
See notes
What are mistakes that we need to avoid with 2SLS
- Estimating 2SLS manually incorrectly calculates the standard errors!
- We should not use probit or logit in the first stage!
Show that 2SLS is consistent
See notes
Can we test and see if our instrument is really needed? what is the consequence of including an instrument that is not needed?
We employ a “Hausman test for endogeneity”.
If we have an exogenous independent variable, it is better to just run an OLS.
What do we mean by “indirect least squares”?
When can we use this?
How does it connect with the Wald estimator?
This is basically just dividing the reduced form coefficient form with the first-stage coefficient. This is only valid in the just identified case.
The reduced form yields the intention to treat (ITT) effect. When scaling the reduced form with the first stage, we get the average treatment effect on the treated (ATTE). If the first stage = 1, we get LATE!
Using indirect least squares in the case with a binary instrument yields the Wald estimator
What is two-sample IV?
We can still use IV if we use different data sets. That is, having data on the instrument and first stage in the big dataset 1 and data on the instrument and the outcome of interest in the smaller dataset 2, lends itself to a “Two-sample IV”(T2IV). The important thing is that the instrument exists in both sets. We create the first stage with dataset 1 and the reduced form with dataset 2. We then decrease the risk of a weak instrument, since we create the first stage with the big dataset.