PSM Flashcards
What is exact matching?
Comparing individuals for whom the values of x are identical
rarely an option in practice since it’s often difficult to find T and C groups with identical values
What is the purpose of matching?
To reproduce the treatment group among the non-treated
What two conditions must be met to implement matching estimators?
- Conditional independence assumption (CIA): There exists a set x of observable covariates such that after controlling for these covariates, the potential outcomes are independent of T status
- Common support assumption: For each value of x, there is a positive probability of being both treated and untreated (you can find a treated unit to match with an untreated unit)
What is the CIA?
Conditional independence assumption (CIA): There exists a set x of observable covariates such that after controlling for these covariates, the potential outcomes are independent of T status
What is common support assumption?
Common support assumption: For each value of x, there is a positive probability of being both treated and untreated (you can find a treated unit to match with an untreated unit)
How is the CIA used to construct a counterfactual for the treatment group?
It implies that after controlling for x, the assignment of units to T is “as good as random”
What assumption does the CIA require?
That all variables relevant to the probability of receiving treatment may be observed and included in x
Why is PSM called a “data-hungry” method?
You need a lot of data for this method
What is the propensity score?
The probability that a unit in the combined sample of treated and untreated units receives the T, given a set of observed variables
What does the propensity score theorem say?
You only need to control for the probability of treatment, because if conditional on x, Ti and (Y1i, Y0i) are independent, then conditional on the propensity score p(xi), Ti and (Y1i, Y01) are independent
What is the formula for the ATE conditional on propensity score?
ATE conditional on propensity score=E[Y1i-Y01|p(xi)]
Three steps for estimating program impact using PSM?
- Estimate propensity score
- Choose matching algorithm
- Estimates impact of intervention with matched sample
True or false: Use flexible functional form to estimate propensity score
True–want to allow for possible nonlinearities in the participation model (i.e., include higher-order terms and interaction terms)
With or without replacement-which is better?
Without replacement-can only be matched with one treated unit
Estimators are more stable if a number of comparison cases are considered for each treated case–ie usually should use replacement
What is nearest neighbor matching?
Individual from comparison group with closest propensity score is chosen–note that this can be done with or without replacement