PROPENSITY SCORE Flashcards

1
Q

What does a propensity score represent?

A

The predicted probability of exposure in a particular individual based on a set of relevant characteristics (because confounders increase the propensity of being exposed)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the role of the propensity score?

A

Estimates treatment effects by controlling for confounding in observational cohort studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What’s the best distribution of exposure/outcome for a propensity score?

A

Common exposure, rare outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which variables should we
a) include
b) not include
in a logistic regression to estimate the propensity score?

A

a) All variables related to the outcome, whether or not they are related to the exposure
b) UNLESS they are a consequence of the exposure (collider); also exclude variables unrelated to the outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the impact of including variables that are unrelated to the exposure despite being related to the outcome?

A

None, it’s fine.

It decreases the variance without increasing the bias.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the impact of including variables that are unrelated to the outcome despite being related to the exposure?

A

It increases the variance without decreasing the bias.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the main 2 steps of creating/using propensity scores?

A
  1. Model the exposure variable in a regression as a function of potential confounders. This calculates the predicted probability of exposure for every individual as a function of these covariates.
  2. Apply the propensity score by matching, stratifying, controlling or weighting.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

For a given propensity score, what is the chance of control/experimental arm the same as?

A

The choice of control or experimental arm is the same as a random process, given that the patient had a real choice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the C statistic?

A
  • Means: concordance
  • A measure of model discrimination, but cannot judge adequacy
  • Estimates the probability that a patient randomly selected from the treatment arm has a higher propensity score than a patient randomly selected from control arm (should be high)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What should the area under a ROC curve be for a propensity score?

A

0.5 (random)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Summary of propensity score matching?

A
  • Finding individuals with similar propensity score in both arms and matching them (e.g., 1:1 nearest-neighbor-matching, with a match of 0.1 distance)
  • Calculate treatment effect with matched pair analysis
  • Equivalent to simulated randomization (distributes confounding evenly)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Summary of propensity score adjusting?

A
  • Including the propensity score as a covariate in a regression model (ideally, with the individual covariates, leading to a doubly robust model)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Summary of propensity score stratifying?

A
  • Splitting the dataset on the basing of the propensity score alone, and then estimating the treatment effect in each stratum and taking the weighted average for overall effect
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Summary of inverse probability weighting?

A
  • Re-weighting individuals from the whole dataset to increase the weight of those with unexpected exposure
  • Equivalent to producing additional observations for where there is few observations
  • Creates a pseudopopulation with near-perfect covariate balance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Weight given to treatment and control arm in inverse probability weighting? Any problem with that?

A

Tx: 1/PS
Control: 1/(1-PS)

Problem: when the PS is close to 0 for the tx arm, and close to 1 for control arm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which methods eliminate systematic differences the best?

A
  • Matching

- IPW

17
Q

What is a potential problem with propensity score adjustment?

A
  • May bias if the assumption about the functional relationship between the propensity score and the outcome is wrong (e.g., linearity)
18
Q

Which methods are potentially doubly robust? Which are not?

A

Yes: Propensity score adjustment (if adding also individual covariates), IPW (how?)
No: Matching, stratifying (N is too small)

19
Q

What are the pros and cons of traditional covariate adjustment?

A

Pros: performs well, provides prognostic model for outcome of interest
Cons: not good for small n and many covariates

20
Q

What are the pros and cons of propensity score matching?

A

Pros: reliable, good balance of covariates, simple
Cons: unmatched subjects are not analysed, less precise due to small n

21
Q

What are the pros and cons of propensity score adjustment?

A

Pros: performs well
Cons: very similar to traditional adjustment, without necessarily being better

22
Q

What are the pros and cons of propensity score stratifying?

A

Pros: keeps all the data, can look at interactions
Cons: not as good with few outcome events, does not account for strong confounding

23
Q

What are the pros and cons of inverse probability weighting?

A

Pros: keeps all the data, easy, the pseudopopulation has a perfect covariate balance
Cons: Unstable when extreme weights

24
Q

What are the limitations to propensity score methods in general?

A
  • Only binary treatment/exposures
  • No balancing of unmeasured covariates
  • Can’t get the effect of the covariates that are adjusted for
  • Not for prediction
  • Cannot handle time-varying exposures or confounder
25
Q

4 sources of residual confounding

A
  1. Bad definition
  2. Imperfect proxy
  3. Not included
  4. Misclassified
26
Q

What is an instrumental variable?

A

One that is:

1) Causes the exposure
2) Affects the outcome only through the exposure (=’exclusion criteria’)
3) Not associated with any confounders (=’independence criteria’)

27
Q

Looking at instrumental variable is equivalent to…

A

Intent to treat estimate in RCTs

28
Q

What are endogenous regressors? Exogenous regressors? And what are instrumental variables?

A

Endogenous are correlated with the exposure, while exogenous are not (often proxy, for example extreme weather for stress during pregnancy)

Instrumental variables are exogenous

29
Q

An instrumental variable analysis trades…

A

The ignorability of the treatment variable
for
the ignorability of the instrument (that we think is more plausible)

30
Q

What if the instrumental variable is weak?

A

It’s biased toward the null due to non-differential misclassification

31
Q

What are the two stages to calculate IV estimates?

A

1 . Each endogenous explanatory variable is regressed on all exogenous variables in the model (both in the equation of interest and the excluded instruments). We take the predicted values of these regressions.

  1. We estimate the regression of interest, but we replace each endogenous covariate with the predicted value from the first stage.