Causal Analysis Flashcards
causal relationship only if _ assumption holds
exogeneity
unobserved factors (ui) unrelated to regressor X
exogenous regressor
Xi and ui correlated
endogenous regressor
techniques to overcome endogeneity
randomisation and IVs
limitations of randomisation in OLS
unethical, estimate only those who chose to take part, limited duration
assumptions of an IV
relevance and exogeneity
relevance assumption
Z must be correlated with X
exogeneity assumption (exclusion restriction)
Z must not be correlated with error term u
instrument is used to
observe exogenous variation in Xi through movements in Zi to identify causal effect of Xi on Yi
IV estimation can be done by
2SLS
2SLS stages
- regress X on Z to get predicted Xs
2. regress Y on predicted Xs
weak instruments problem
2SLS estimator will be biased towards OLS estimator if correlation is close to zero
doesn’t explain sufficient variation in endogenous variable X
testing strength of instrument
F-stat
F-stat (actual value, not p-value)
<10 = weak >10 = strong
testing validity of instrument
can only use IV if same number of instruments and endogenous regressors
LATE
Local Average Treatment Effect - effect of receiving treatment on w/e
ITT
Intention to Treat effect - captures causal effect of receiving treatment = underestimate
causal effect =
ITT/difference in compliance rates btw treatment and control group
Regression Discontinuity (RD) designs
situation in which treatment D depends on an observed continuous variable Q (running variable)
example of RD design
babies w/ both weight below certain value receiving extra neonatal care
Sharp RD design
treatment status (D) is a deterministic function of q. e.g MLDA = 21 - status changes at some point in continuous function
Fuzzy RD design
treatment status (D) is NOT a deterministic function of q. change in the probability of treatment. e.g raising of school leaving age affect on earnings = indirect effect
IV
correlated with one of the variables but not correlated with anything in error term that affects dependent variable