Regression Discontinuity Designs (RDD) Flashcards

1
Q

idea of RDD

A

exploit particularities in laws and institutions, usually thresholds

  • treatment occurs just above (below) threshold
  • control group are observations just below (above) threshold
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

identification requirement: RDD

A

discontinuity (‘jump’) in the mapping Z → D at some threshold z0 (necessary requirement)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

identifying assumption:

A

continuity of Z |D → Y (in potential outcomes) and of other variables (Z → X ) around threshold z0

⇒ ‘identifying requirement’ and ‘identifying assumption’ are necessary and sufficient, respectively, for identification of causality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Two types of RDD

A
  1. ‘sharp’ or ‘clean’ RDD
  2. ‘fuzzy’ RDD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

clean RDD estimation

A
  • D is deterministically related by Z
  • i.e. all subjects above (below) threshold z0 are treated …
  • … while all subjects below (above) threshold z0 are

Implications:

  • comparison of observations just above and just below threshold yields estimate of
    causal effect
  • several complications in model specification and estimation (see below)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

fuzzy RDD estimation and implication

A
  • D is probabilistically related by Z
  • i.e. treatment probability is higher for subjects above (below) threshold z0 than for those just below (above)

Omplications

  • being above (zi ≥ z0) or below (zi < z0)threshold is used as instrument for actual treatment status di
  • estimation analogue to intention-to-treat design with Wald or 2SLS-IV estimation
  • ‘double local’ interpretation of estimated effect
    a. local in the LATE sense (applies only to compliers)
    b. local because only estimated at threshold
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Formalization of RDD

A

1) zi≥z0 di= 0 if zi<z0></z0> E[Y0|Z] = α+βz (1)
E[Y1|Z] = E[Y0|Z] + δ (2)
Y1 = Y0+δ (3)
where δ is the causal effect of interest, assumed to be locally constant.

Note that model 1 describes observed outcomes for all z < z0 and potential outcomes for all z ≥ z0. In turn, 3 describes observed outcomes for all z ≥ z0 and potential outcomes for all z < z0.
Because for every i either Y 1 or Y 0 is observed, one can write the regression Yi =α+βzi +δdi +εi,
where di is an ‘above threshold’ dummy variable. Inclusion of other controls (+X′γ)
possible but ideally matters little (continuity assumption).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Main problems with RDD estimation

A

P1 observations from treatment group influence prediction of control group outcomes, and vice versa
P2.a observations far from z0 change yˆ close to z0 which can lead to bias
P2.b complex functional forms necessary to adequately model the data, which often
creates bias near threshold
P3 there may be very few observations close to the threshold

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

SOlution of Estimation Problems for RDD

A

P1 estimate separate regressions on either side of the threshold
P2 narrow z-range of observations considered (so called ‘bandwidth’) to reduce bias P3 avoid too small bandwidths
⇒ there is an unavoidable trade-off between bias and precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Testing RDD Assumptions

A
  1. in-depth knowledge of laws and institutions necessary
    - check whether there is an actual jump in d; is it sharp or fuzzy?- check whether d and y do not display jumps at other points of z where there is no threshold. - estimate ‘effects’ using preferred RD specification at many placebo thresholds. - rule of thumb: should only be able to reject the null of ‘no jump’ in 5% of cases
  2. continuity harder to test
    - assumption cannot be verified (because potential outcomes by definition unobserved)
    - but can be falsified because assumption is usually likely violated if
    (a) y or d are very ‘jumpy’ along z,
    (b) there is sorting around z0 (e.g. due to manipulated population numbers),
    (c) or if there are other changes at the same threshold
    → check whether one of them applies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Checks to falsify continuity assumption

A

(a) check if there are discrete jumps in y or d at values of z where there are no thresholds (placebo tests)
(b) check for smooth distribution of other variables x around z0 (works for observables)
(b) check for smooth density around threshold, e.g. by visual inspection and McCrary (2007) test (also speaks to unobservables)
(c) in-depth knowledge of legal and institutional environment necessary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Advantages of RDD

A

Advantages

  • often only way for causal identification of certain effects
  • particularly high internal validity if continuity holds
  • assumptions fairly well and transparently testable (e.g. compare to IVs)
  • graphical display of results often intuitive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Disadvantages of RDD

A
  • similar to any natural experiment
  • opportunities for RDD cannot be created, just discovered
  • method influences what is been studied
  • local identification only → external validity
    (how far away from threshold is δ still constant?)
  • other pitfalls to be aware of when using population threshold RDDs → seminar next week
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Basic idea of non-paramtric identification?

A
  • data divided into many slices or ‘bins’
  • many local means estimated
  • kernel-weighting within slices possible
How well did you know this?
1
Not at all
2
3
4
5
Perfectly