F9 Instrumental Variables Flashcards
What is the advantage and disadvantage of IV?
Advantage: It’s possible to deal with unobserved confounders.
Disadvantage: It relies heavily on the exclusion restriction, which is difficult to defend.
Draw the DAG of IV
Z –> D –> Y
+
D <– U –> Y
What is the exclusion restriction?
The instrument can only influence Y through D. It’s important with correlation between the instrument and the outcome (the only through)
The instrument cannot affect confounders or Y directly.
No causal relation.
Cov(Z, epsilon) = 0
What is a good instrument?
Quasi-random (easier to defend the exclusion restriction).
What are the four important groups in IV
Compliers: Treatment status is affected by the instrument in the correct direction (treatment units is treated). Complied with the draft of the lottery.
Defiers: Treatment status is affected by the instrument in the wrong direction (treatment units is not treated). Military-deserters
Never takers: Unit never take treatment regardless of the instrument. Medical exempt for military.
Always takers: Always take the treatment regardless of the instrument. Patriots - military no matter what.
How can an estimate be biased by confounders?
Biased estimate: delta-hat = delta + gamma * Cov(u,x)/Var(x).
Bias: gamma * Cov(u,x)/Var(x).
(1) Gamma: The effect of unobserved confounder - positive or negative - cov(u,y).
(2) Cov(u,x)/Var(x): The relation between x and unobserved confounders
What direction can bias have if affected by confounders?
Negative-negative or positive-positive: Upward bias.
Negative-positive or positive-negative: Downward bias
What is a strong instrument?
Highly correlated with the independent variable (non-zero at first stage)
What is 2SLS?
Two stages least squares regression. Two steps:
(1) X_i = γ + βZ_i + ε_i. The endogenous independent variable as a function of the instrument.
(2) Y_i = α + δX_i-hat + ϵ_i. The fitted values from the first stage is used in the primary regression
The estimate becomes (first stage replaced with fitted values):
Cov(βZ,Y)/Var(βZ) = Cov(X-hat,Y)/Var(X-hat)
What are the assumptions behind 2SLS?
(1) Exclusion restriction (instrument cannot be correlated with the error term). Necessary but not sufficient (must be a strong instrument as well).
(2) Non zero first stage (instrument must be correlated with the endogenous independent variable D).
What is the reduced form?
The correlation between the instrument and the outcome.
Must be different from zero and thus significant (to make sure that it’s different from zero) if we want to estimate an effect.
What is a weak instrument and how can you test whether an instrument is strong or weak?
Weak instrument: Not highly correlated with the endogenous independent variable.
Problem: Inconsistent and large SE
Test: The F-statistic on the first stage of 2SLS.
What is the F-statistic?
The joint significance of all independent variables in a regression. We’re punished by:
Every independent variable added
Weak instruments.
Relevant if you have more instruments.
F > 10 for the first stage.
Can you have more than one instrument?
Yes as long as all instruments meet the exclusion restrictions.
What is the beta-estimate with IV?
Cov(y,z)/Cov(x,z) with z being the instrument.
So, if the effect of Z on Y (the reduced form) is 2 and the effect of Z on X is 3, the effect of X and Y is 2/3 = 0,66