Threats and Analysis Flashcards
True treatment effect
True treatment effect is a concept that applies to each individual. So the impact of this program on me is X, the impact on you is Y. Just like the true counterfactual, the true treatment effect cannot be observed, we can only estimate it with a certain degree of confidence.
Average Treatment Effect
ATE is also hypothetical, and assumes everyone is treated, but averages across me, you and everyone else in the sample (or population). This too, is impossible to measure if there is any non-compliance.
Intention to Treat:
ITT is similar to ATE, however if we assume no spillovers from compliers to non-compliers in the treatment group
we can think of it as the treatment effect on compliers and spillover effect on non-compliers within the treatment group.
Local Average Treatment Effect (LATE) or Complier Average Treatment Effect (CACE):
CACE or LATE basically just limits the sample to compliers and can be estimated (using the Wald estimator) if there are no spillovers; or alternatively, CACE could be estimated if we knew the spillover effect.
Noncompliance
when some members of the treatment group don’t receive treatment and/or control group receive treatment, etc.
Attrition
whether or not you have outcomes for your subjects; unable to find your subjects many years later, etc.
And/or the control group is less likely to feel like they need to engage in outcome measures (taking a test, survey, etc) because they didn’t receive the beneficial treatment
Spillovers
to what extent to the treatments assigned to some people leak over to the controls nearby
Generalizability
how can you translate your results to policy & behavioral recommendations within the limitations of your results?
Unit level treatment effect & average treatment effect (ATE)
How would an experimental subject have responded if treated? How would same subject have responded if untreated? Difference between these two potential outcomes is the unit-level treatment effect;
Average unit-level treatment effect is the ATE (average treatment effect in the subject pool)
Why is the average treatment effect important?
Randomization pulls a random sample of the treated and untreated potential outcomes. We can’t know the individual level causal effect, but we can estimate in an unbiased manner, under core assumptions, the average treatment effect
Core assumptions of the potential outcomes model
Random assignment of subjects to treatments, non-interference, excludability
Core assumptions - Random assignment of subjects to treatments
receiving treatment statistically independent of subjects’ potential outcomes
We must compare only randomly assigned groups & resist the temptation to compare the groups that actually take the treatment to the groups that don’t take the treatment - because they aren’t necessarily randomly assigned
Core assumptions: non-interference
subject’s potential outcomes reflect only whether they receive the treatment themselves; they are unaffected by how treatments happened to be allocated/how the randomization pans out
Core assumptions: excludability
subject’s potential outcomes respond only to defined treatment, not other extraneous factors that may be correlated with treatment – Importance of defining treatment precisely and maintaining symmetry between treatment and control groups (e.g., through blinding)
Must maintain symmetry in everything that you do in design and analysis - Enumerators have to stay the same for treatment and control, for example (otherwise could be a violation of symmetry)
What 3 things are missing from the ‘core assumptions’ of the potential outcomes model?
No assumptions about shape of outcome distribution (e.g., that responses are normally distributed) - We’re not assuming that we can generalize our inference about the ATE in the subject pool
The issue of “external validity” is a separate question that relates to the issue of whether the results obtained from a given experiment apply to other subjects, treatments, contexts, and outcomes.
Random sampling of subjects from a larger population is not a core assumption, but aids generalizability
When core assumptions are met…
…an experiment generates unbiased estimates of the average treatment effect (ATE)
Sampling distribution
collection of possible ways that an experiment could have come out, under different random assignments
estimator
procedure for generating guesses about a quantity of interest (e.g., the average treatment effect)
Under simple or complete random assignment, the difference-in-means estimator is…
Under simple or complete random assignment, the difference-in-means estimator is unbiased – Any given estimate may be higher or lower than the true ATE, but on average, this procedure recovers the correct answer
3 possible reasons for non-compliance
Sometimes there is a disjunction between the treatment that is assigned and the treatment that is received.
(1) Miscommunication and administrative mishaps - Send radio ads but sometimes they just don’t play them; things just don’t happen as you intend sometimes
(2) Subjects may be unreachable - People don’t answer/aren’t home/they moved/they died, etc.
(3) Encouragements sometimes don’t work - Can’t force people to do anything; Encouragement doesn’t push them to enroll. Or, people who are in control group show up anyways
When addressing “noncompliance” - how should we think about the ‘excludability’ assumptions?
Are outcomes affected only by the treatment? Or by
both the assignment and the treatment?
Excludability is going to mean that the only thing that will affect people is the treatment itself, and not something other factor - this is a potentially fallible assumption. To the extent that you can design things in a way that doesn’t tip people off to their assignment is important – otherwise things could get distorted a bit.
Can you switch non-compliers between treatment and control (depending on their behavior) and/or remove them from the sample?
No! This leads to bias! You ONLY want to compare the randomly assigned groups, do NOT change what you are comparing!
What should estimation strategy be based on - in terms of treatment and control groups? What’s the risk of altering group structure?
Subjects you fail to treat are NOT part of the control group! Do not throw out subjects who fail to comply with their assigned treatment
Base your estimation strategy on the ORIGINAL treatment and control groups, which were randomly assigned and therefore have comparable potential outcomes. Groups/individuals should always be analyzed based on their original assignment.
Certain types of individuals may be more likely to take up the treatment than others; comparing only those who are actually treated (whether in the treatment group or overall) with those who aren’t would run the risk of introducing selection bias.