PSM Flashcards by Hannah Kistler

What is exact matching?

Comparing individuals for whom the values of x are identical

rarely an option in practice since it’s often difficult to find T and C groups with identical values

How well did you know this?

Not at all

Perfectly

What is the purpose of matching?

To reproduce the treatment group among the non-treated

How well did you know this?

Not at all

Perfectly

What two conditions must be met to implement matching estimators?

Conditional independence assumption (CIA): There exists a set x of observable covariates such that after controlling for these covariates, the potential outcomes are independent of T status
Common support assumption: For each value of x, there is a positive probability of being both treated and untreated (you can find a treated unit to match with an untreated unit)

How well did you know this?

Not at all

Perfectly

What is the CIA?

Conditional independence assumption (CIA): There exists a set x of observable covariates such that after controlling for these covariates, the potential outcomes are independent of T status

How well did you know this?

Not at all

Perfectly

What is common support assumption?

Common support assumption: For each value of x, there is a positive probability of being both treated and untreated (you can find a treated unit to match with an untreated unit)

How well did you know this?

Not at all

Perfectly

How is the CIA used to construct a counterfactual for the treatment group?

It implies that after controlling for x, the assignment of units to T is “as good as random”

How well did you know this?

Not at all

Perfectly

What assumption does the CIA require?

That all variables relevant to the probability of receiving treatment may be observed and included in x

How well did you know this?

Not at all

Perfectly

Why is PSM called a “data-hungry” method?

You need a lot of data for this method

How well did you know this?

Not at all

Perfectly

What is the propensity score?

The probability that a unit in the combined sample of treated and untreated units receives the T, given a set of observed variables

How well did you know this?

Not at all

Perfectly

What does the propensity score theorem say?

You only need to control for the probability of treatment, because if conditional on x, Ti and (Y1i, Y0i) are independent, then conditional on the propensity score p(xi), Ti and (Y1i, Y01) are independent

How well did you know this?

Not at all

Perfectly

What is the formula for the ATE conditional on propensity score?

ATE conditional on propensity score=E[Y1i-Y01|p(xi)]

How well did you know this?

Not at all

Perfectly

Three steps for estimating program impact using PSM?

Estimate propensity score
Choose matching algorithm
Estimates impact of intervention with matched sample

How well did you know this?

Not at all

Perfectly

True or false: Use flexible functional form to estimate propensity score

True–want to allow for possible nonlinearities in the participation model (i.e., include higher-order terms and interaction terms)

How well did you know this?

Not at all

Perfectly

With or without replacement-which is better?

Without replacement-can only be matched with one treated unit

Estimators are more stable if a number of comparison cases are considered for each treated case–ie usually should use replacement

How well did you know this?

Not at all

Perfectly

What is nearest neighbor matching?

Individual from comparison group with closest propensity score is chosen–note that this can be done with or without replacement

How well did you know this?

Not at all

Perfectly

What is radius matching?

Study These Flashcards

Specify a caliper (maximum propensity score difference)

Implication for bias and variance of reducing caliper?

Study These Flashcards

Reduces the bias

Increases variance

How do you implement kernel method?

Study These Flashcards

Choose a kernel function, specify bandwidth parameter

Compare each treated unit to a weighted average of the outcomes of all untreated units, with higher weights placed on untreated units with scores closer to that of treated individual

Implications for bias and efficiency of choosing only one neighbor for nearest neighbor matching?

Study These Flashcards

Minimize bias by using most similar observation

Ignore information–>reduced efficiency

Conventional method for calculating standard errors from PSM estimates?

Study These Flashcards

Bootstrapping-sample from analysis sample with replacement, and replicate multiple times

You need to be sure that measures to generate PSM score are not confounded with outcomes or anticipation of treatment–what types of measures should you use?

Study These Flashcards

stable over time or
deterministic (ie age) or
measured before participation

How to check specification of your model re CIA?

Study These Flashcards

balancing tests (does the estimated propensity score adequately balance characteristics between T and C group units?)

How to check specification of your model re common support?

Study These Flashcards

visual inspection of densities of propensity scores
comparison test such as Kolmogrov-Smirnov
are there big differences between maxima and minima of density distributions?

What are we doing when we use propensity score to calculate ATT?

Study These Flashcards

For each propensity score, we calculate the difference in mean outcomes for the treated and untreated with that p(X)

We then take a weighted average of these over the different propensity score values

Two advantages of PSM over regression that controls for x?

1. Matching does not require assumptions about functional form (eg linear relationship) 2. Regression runs risk of extrapolating onto a space where there is little common support

5 requirements for covariate selection

1. Choose x's so that unconfoundedness holds 2. Should be correlated with treatment (Di) and outcome 3. Selection should be based on theory 4. x's should be measured before treatment and not affected by it 5. x;s should not be too good at predicting treatment--we are relying on common support

Implications for bias and standard errors of implementing nearest neighbor with replacement?

better matches--> possibly less bias | higher standard errors

Implications for bias and standard errors of implementing nearest neighbor without replacement?

worse matches-->possibly more bias | lower standard errors

Why would NN matching without replacement lead to lower standard errors?

Using more variation

Formula for propensity score matching estimator for ATT?

E[Y(1)|D=1, P(x)] - E[Y(0)|D=0, P(x)] (treated-untreated)--note that the second term subs in for the unobserved term that we really want to know, which is E[Y(0)|D=1, P(x)]

stratification and interval matching

Paritions the common support into intervals (strata) and then calculates mean differences within these strata

PSM Flashcards

(31 cards)