How to Randomize Flashcards
Random sample
Target population is not statistically different than the entire population
Random assignment
Randomly assigns a group to treatment and control;
Each individual group remains statistically consistent with the entire population’s characteristics
Simple randomization
Fixed probability, typically 50-50 would maximize your power
May end up with slightly more in one group & fewr in the other - but with a large enough sample, it’s not a problem (law of large numbers)
Spot randomization
Complete randomization
Fixed proportion
Determine the number of individuals in treatment & control (sample frame) and then pull out of a hat/random number generator
Why might you want to have the unit of randomization at the school vs. the student level?
1) Might be impossible to have half a school do something & the other half not
2) Risk of contamination/co-mingling, so those in control could benefit from the treatment
Multiple treatment arms
A more cost-effective (usually) way to answer multiple questions in the same study; but the sample size has to be large enough
Cross-cutting treatments (factorial design)
Involves interaction of 2 or more treatments; requires at least 2 treatments & four groups (1 group per treatment, 1 interaction arm, and 1 control group)
Varying the intensity of treatment can help to measure….
Dosage, sensitivity, elasticity, spillovers
Give a reason why it might be unethical to conduct a randomized evaluation
We already know an intervention works - so doing a study would needlessly consume resources while also denying people the intervention
Why might service providers having trouble distinguishing between treatment and control be an issue? What is a possible solution?
Creates crossovers, no longer a pure counterfactual
Solution: Assign to different service providers; randomly assign to teams - this way a caseworker is trained to treat (or not), doesn’t have to decide on treatment
Solution: Change the unit of randomization - randomize at the cluster level and have all providers in a cluster provide treatment (or not)
Why is it a challenge if (a) the control group finds out about treatment? Or if (b) the control group benefits from treatment? Or if (c) the control group is harmed by treatment? (Or if treatment is harmed) What is a possible solution?
(a) If control finds out about treatment, they might get upset…creating a lot of attrition.
(b) If the treatment group knows the control group - they could help them out & contaminate the control group
(c) If treatment and control are competing (for jobs, e.g.) the control could be harmed and the differential between the groups will be larger.
Solution: Vary the unit of randomization to limit spillovers or create a buffer
Why is it a challenge if you have the resources to treat everyone? Possible solution?
If you had no capacity constraints, you could treat everyone but have no control group.
Solution: Phase-in treatment
What are some of the concerns with a randomized phase-in design?
with a phase-in design, long-term outcomes can be harder to measure impacts on since the control group receives the program in the long run, doing away with the comparison group. This is of especial concern when the phase-in occurs on too short of a timeline for impacts to materialize and be measured.
We may want to choose randomization on the bubble when
the eligibility criteria are well defined, but can be changed
If all applicants are equally eligible, there is no marginal set of applicants amongst whom the randomization can be carried out (i.e. there is no “bubble”.) If there are no clear eligibility criteria, the bubble becomes hard to define. However, even when there are strict eligibility criteria, an inability to change these criteria would render a randomized evaluation infeasible since there would be no additional applicants amongst whom the treatment can be randomized (i.e. all applicants would either be eligible or ineligible according to existing criteria - one may want to consider a regression discontinuity design in this situation.)
Why is it a concern when everyone is eligible for a program? What is the solution?
Sometimes we are unable to randomly assign access to a program itself, but we can randomly assign encouragement to take up the program. This approach is useful when we want to evaluate a program that is already open to all of the eligible recipients but only some are currently using it. The program continues to be open to all the eligible people, but only some of them (the treatment group) will receive extra encouragement to take up the program
Here, you measure the effect of encouragement vs. the program