Potential Outcomes Framework Flashcards

1
Q

What is the fundamental problem in causal inference?

A

We want to observe the outcomes of each individual under 2 concurrent experimental conditions 1) the treatment condition 2) the counterfactual condition. However it is impossible to observe this!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the Individual Treatment Effect

A

If we could observe Y_i1 and Y_i0 for each individual and subtract the difference between the individual’s two outcomes.

ITE = Y_i1 - Yi0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In an Individual Treatment Effect framework what is Y_i0 and Y_i1?

A

Y_i0 = Value of ith individual’s outcome if the individual where assigned to the treatment condition

Y_i1 = value of the ith individual’s outcome if the individual where assigned to the control condition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What would the Average Treatment Effect equal if we could observe the Individual Treatment Effect for every individual in our population?

A

The average of all the individual ITEs

ATE = E[ Y_i1 - Y_i0]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does Rubin’s causal model state?

A

That it is possible to estimate the ATE when individuals have been randomly assigned to treatment (because both groups should be equal in expectation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the estimated Average Treatment Effect?

A

The difference between the average outcomes of the sample treatment and the average outcomes of the sample control groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How can the estimated ATE be expressed in a regression framework?

A

E[Y_i | D = 1] = B_0 + B_1

E[Y_i | D = 0] = B_0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is regression?

A

A technique used to estimate the conditional expectation function for Y_i given the values of one or more other variables X_ik

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the population regression function?

A

The line that best fits the population distribution of (Y_i, X_i) in that it minimizes the sum of squared errors in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the Conditional Expectation Function equal the Population Regression Function?

A

When the CEF happens to be linear (unlikely)
When there is joint normality
In saturated regression models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The Population Regression Function is the best (blank) predictor of Y_i given X_i

A

linear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is B_1 in the Population Regression Function?

A

B_1 =

Cov(Y_i, X_i)
/
v(X_i)

Covariance of Y_i and X_i divided by the variance of X_i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is B_0 in the Population Regression Function?

A

B_0 = E[Y_i] - B_1 * E[X]

Average of Y_i minus the average of X_i multiplied by the population slope coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a saturated regression model?

A

A regression with discrete explanatory variables where the model includes a separate parameter for every possible combination of values taken on by the explanatory variables

Basically every dummy variable + all possible interactions between all dummy variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When will the Population Regression Function slope coefficient have a causal interpretation?

A

When the Conditional Expectation Function that it approximates is causal, this is true when the CEF describes bifferenes in the average potential outcomes between the treatment and control groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the counterfactual?

A

Outcome for an individual in a different state

17
Q

What is ceteris paribus?

A

The change in (expected) outcomes across states, holding all else equal

18
Q

Does the Population Regression Function always represent the Conditional Expectation Function?

A

No. The CEF tells us how the mean of y varies with X in the population. This relationship may not be linear, so the PRF estimated may not represent the CEF in all cases.

19
Q

What is the omitted variables bias formula?

A

B_s = B_1 + Pie_1 * Gamma

Pie_1 = the slope of the coefficient from a regression on the omitted variable on the included variable (the relationship between the omitted variable and the other variable in the model)

Gamma = the slope coefficient of the omitted variable in the “long” regression where you do not have omitted variable bias

20
Q

Why can you not test whether the error term is correlated with an included covariate?

A

Because by construction, OLS models chose an intercept and a slope such that Xi is uncorrelated with the estimated hat_Epislon_i

C( hat_Epsilon_i, x_i ) = 0 mathematically

21
Q

Why does a randomized experiment ensure that E[ Epislon_i | X ] = 0

A

Because treatment level is distributed independently of any other determinants of the outcome

Random assignment ensures that other characteristics that might influence an outcome are randomly, and equally distributed between participants in the treatment and control groups.

22
Q

Is it a problem if characteristics are not equally distributed between participants in the treatment and control groups in the sample?

A

No. What is important is that potential members of the treatment and control group are identical on all observable and unobservable dimensions on average in the population

Due to idiosyncrasies in sampling the actual treatment and control groups may differ slightly on observable and unobservable dimensions but such idiosyncrasies are accounted for in the margin of error built into the statistical analysis

23
Q

What is selection bias/omitted variable bias?

A

Difference in potential outcomes (under no treatment) between those who get treated and those who dont

24
Q

An estimator is unbiased if

A

E[ hat_Beta_1 ] = Beta_1

The estimated parameter equals the population parameter

25
Q

An estimator is efficient (precise) if

A

It has the lowest variance of all unbiased estimators