Panel Data and Difference-in-Differences Flashcards
What assumption does DnD require?
Common trends assumption: requires assumption that T would have the same trend as C were it not treated.
What are three magnitudes that will be key for a formal representation of diff-in-diff identification?
- the group-level fixed effects of the T and C groups
- the common trend (in the absence of treatment)
- the causal effect (aka the difference-in-differences)
Two formal assumptions of DnD
- E(Y1 −Y0|s,t)=δ ⇒ conditional treatment effect is a constant
- E(Y0 |s,t)=γs +λt ist ⇒ potential outcomes in the untreated state have an additive structure. Specifically, they must consist of
- a unit-specific fixed effect γs
- a period effect (or trend) λt that is common across all units
This assumption is violated if the potential outcomes are such that in the absence
of treatment their true modelling requires state-specific time effects, as e.g. when
the expected potential outcomes are given by E (Y 0 |s, t ) = γs + λt + κst .
DnD as a Regression
Diff-in-diff estimates can also be implemented as a regression. Recall equation (1) above:
Yist = γs + λt + δDst + εist . It’s straight-forward to formulate a regression that estimates the key parameters as coefficients
Yist = α+β1INJ +β2TNov +δ(INJ ×TNov)+εist, where INJ and TNov are state and time dummies, respectively (IPA and TFeb are the reference categories for which no dummy is included). INJ × TNov is the same as Dst above. The coefficients contain the following magnitudes (as specified on previous slide), and can be interpreted accordingly:
- α=γPA+λFeb - β1=γNJ−γPA
- β2 =λNov −λFeb - δ=δ
Advantage of DnD Regression?
- One key advantage of regression diff-in-diff is the possibility to include controls: Y =α+βI+βT+X′β+δ(I×T)+ε. ist 1 NJ 2 Nov ist 3 NJ Nov ist
- Another key advantage of regression diff-in-diff is the possibility to estimate effect of a non-binary treatment.
Dnd Controls in Minimum Wage example?
- chain of restaurant (McD, KFC, Wendy’s, etc)
- ownership (company-owned vs franchise-taker)
- region within NJ and PA (to capture regional demand shocks)
Robustness Check for DnD
If the number of time periods is at least > 2 (in practice it must be much larger), a good robustness check of diff-in-diff results is to estimate a model with unit-specific trends: Y = α+I′β +T′β +Istτ+X′ β +D δ+ε , ist s 1 t 2 ist 3 st ist where
- its interaction with the unit dummies Is allow each unit to have an individual linear trend
- τ are the trend coefficients to be estimated.
⇒ If δ remains largely unaffected by the inclusion of Is′ Tt τ , then this is strong support for trends being common.
Interpret the general DnD Regression: Y = α+I′β +T′β +X′ β +D δ+ε ,
- Is and Tt are vectors of state and time dummies;
- Xist is a vector with variables that can differ across subjects, states, and time;
- Dst can either be the a binary indicator of treatment of unit s at time t, or it can be a continuous variable capturing treatment intensity of unit s at time t.
Main requirement of DnD
- availability of panel data with at least two pre-treatment observations
- treated and untreated subjects
- common pre-treatment trends of the untreated and (later) treated
- constant treatment effect
- no shocks coinciding with treatment
When is DnD most valuable to evaluate Policy?
- is implemented at an organizational level whose units contain many subjects (e.g. students within schools or classes, firms in regions, households in cities, etc), and
- is applied to some, though not all of the units (some but not all schools, regions, cities)
- staggered or (randomly) phased-in treatment can also be exploited (→ natural
experiment)
Policy areas for DnD
- education policy
- labour market policy
- but e.g. also reform effectiveness in organizations