F10 Difference-in-Difference Flashcards

1
Q

What are the elements of the most basic DiD design?

A

Two groups in two time periods - pretreatment and post treatment period (2x2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the mathematical specification of a simple 2x2 design?

A

delta = (yt-bar^post - yt-bar^pre) - (yc-bar^post - yc-bar^pre)

In other words a difference between two differences. The temporal difference in the control group and the treatment group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the central assumption?

A

Parallel trends assumption: Up until the intervention groups have the same trajectory on an outcome of interest.

No time-variant group specific unobservables.
No anticipation effects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the parallel trends assumption mean? How is it measured?

A

The control group is used as a counterfactual development for the treatment group, as we assume they would have had parallel trends in absence of treatment.

Potential outcomes: Comparison of the potential outcome pre- and post treatment had both groups not been treated (unobserved)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of effect are we estimating?

A

ATT: Average treatment effect on the treated (+ selektionbias)

Treatment is not randomization. The Y1 is not assume to be the same.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the regression for a simple 2x2 DiD? And what do the different parts express?

A

Y = α + λT + γD + δTD + ε

α: Control in pretreatment
α + λ: Control in post treatment
α + γ: Treatment in pretreatment
α + λ + γ + δ: Treatment in post treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a triple DiD?

A

Three dimensions. Not only between time and unit but also within a unit (e.g. subpopulations, locations, or policies). Can be used for placebo.

8 groups are now relevant e.g. control group pretreatment high income.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does a regression for a triple DiD look?

A

Y = α + T + D + Z + TD + TZ + DZ + δTDZ + ε

So all categories for themselves, as interactionterms and then as three-way interaction.

Delta is still the effect estimate (when all dummies are =1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What kind of estimate is the triple DiD?

A

ATT: Average treatment effect on the treated for a subgroup of units (we can’t rule out heterogeneous treatment effects).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a staggered DiD?

A

Two-way fixed effects differential timing in treatment.

Treatment occur at diffrent times for different units (multiple time periods).

E.g. US states implementing medicaid at different points in time. Or EU countries implementing WLBD in different tempi.

A weighted ATT based on an average of all possible ATTs

Staggered is NOT a two-way fixed effects estimator (staggered has much stronger assumptions)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is problematic in a staggered DiD? 3 things

A

Units treated late might be compared to units that were always treated or treated early.

Bacons decomposition:
1) Treatment group can be used as both control and treatment

2) Assign weights to individual 2x2 DiD depending on time of treatment (in the middle). Shortening/lengthening the panel changes the estimate - theoretical nonsense

3) Assume no timevariant ATT heterogeneity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When is a staggered DiD problematic?

A

When treatment effect is heterogeneous or dynamic over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is important for coefficients for time periods leading up to the intervention?

A

They must de insignificant otherwise the effect is triggered by something else than the intervention.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is serial correlation and why is it a problem?

A

Also known as autocorrelation. Serial correlation (or autocorrelation) occurs when the residuals (errors) in a regression model are correlated across time.

Often occurs in times series data. With more observations for the same unit, I kind of artificially inflate the number of observations and thereby the power (effective sample size is smaller).

This probably result in much lower p-values than what can be justified (type I errors or false positives)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Does DiD look at changes og levels?

A

We study changes (within variation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How is panel data and repeated cross-sections different i DiD?

A

Panel data: FE is possible
Repeated cross-section: Not the same units - potential bias.

17
Q

What happens if treatment in DiD is endogenous?

A

Parallel trends are always violated

18
Q

Explain the event study test for DiD

A

You compare placebo pre-treatment leads of the DD coefficient. If DD coefficients in the pre-treatment periods are statistically zero, then the difference-in-differences between treatment and control groups followed a similar trend prior to treatment.

If they had been similar before, then why wouldn’t they continue to be post-treatment?

BUT assumes the future is like the past. Neither sufficient or necessary for parallel trends.

19
Q

How can you test parallel trends?

A

Event study test (evaluate pre-treatment leads)
Empirical inspection of development pre treatment
Placebo on the outcome

20
Q

What is first and second stage regarding DiD?

A

First stage (treatment’s effect on usage). Second stage (treatment’s effect on the outcomes of interest).

Important to examine both.

21
Q

Happens when you take the first difference in simple 2x2 DiD?

A

You eliminate unit-specific effects (FE)

22
Q

What can be iffy with inference and DiD?

A

Use cluster robust on treatment group-level. Due to autocorrelation + correlation within in cluster

23
Q

What is the logic for placebos?

A

The reasoning goes that if we find, using our preferred research design, effects where there shouldn’t be, then maybe our original findings weren’t credible in the first place

24
Q

What is the main concern using repeated cross-sections instead of panel data in two-way fixed effects?

A

We risk compositional changes - an omitted variable bias built into the sample itself caused by time-variant unobservable.

We assume they are independent draws from the population - i.i.d. (cross-sectional independece)

25
Q

What does Cunningham say about specification for DiD?

A

I could be a source of bias. We always need to carefully think about the DGP for the outcome - linear, non-linear, poisson, binary etc.