Assumptions Flashcards

Question

Why does matching and regression produce different results?

Answer 1

Matching, groups ‘matches’ individuals who have same observable characteristics and computes ATE/ATT for each group. OLS is simply a weighted average of the ATE of these groups. OLS uses all observations, even those off common support. OLS is much quicker and easier.

Answer 2

If there is OVB, the CIA doesn’t hold and therefore regression estimates will be biased.

Answer 3

Long regression controls for selection, so includes a dummy (control) variable, whereas the short regression does not.This means the short regression gives a biased estimate, so the difference between the 2 is the OVB - Baseline and DTE Bias occurs.

Answer 4

Effect of D in short (Biased) = Effect of D in long (Unbiased) + Relationship between omitted and included (Pi1 in aux regression: X = Pi0 + Pi1 D) x effect of omitted in long (gamma in long regression) So OVB: Beta(S) - Beta(Long) = pi1 x gamma

Answer 5

We can add observable characteristics (X) and use them as control variables and if CIA holds, OLS gives estimator of the Average Treatment Effect (ATE).

Answer 6

Potential omitted variables are anything thats correlated with the treatment. Then in X = Pi0 + Pi1D, if pi1 is significant, X is correlated with the treatment.

Answer 7

A bad control is a variable which is itself an outcome variable: something which might be affected by the treatment. Be careful with these, but usually more controls is always better.

Answer 8

Variance - How well the regression fits the data (R squared) Regressions will produce 0 as uncorrelated with the regressors. e(i) = Y(i) + Y_hat(i)

Answer 9

For a sample, we estimate Beta with Beta_hat: SE(B_hat) = sigma(e)/sqrt(n) x 1/ sigma(x) 1/sigma(x) is the residual variance square rooted

Answer 10

You have the calculate the Wald ratio (lambda = rho/phi), which is equal to the reduced form divided by the first stage. Z->Y / Z->D

Answer 11

Z -> D P[Di|Zi = 1] - P[Di|Zi = 0] = phi

Answer 12

Z -> Y P[Yi|Zi = 1] - P[Yi|Zi = 0] = rho

Answer 13

It is a Local Average Treatment Effect (LATE), meaning it’s only an average for a certain group, this group being the compilers - those who obey their lottery outcome.

Answer 14

1) Treatment and control group 2) These treatment and controls groups are comparable 3) There is info on treatment and control group, before and after the treatment occurs.

Answer 15

In the absence of treatment, the difference between the treatment and control group remains constant over time.

Answer 16

Add an unobserved variable (X) into the regression that is correlated with the treatment and changes at the same time as the treatment -> causes OVB.

Answer 17

1) Can easily calculate standard errors for DiD 2) Treatment can be continuous, not just binary 3) Easily add control variables 4) Easily add additional time periods

Answer 18

MLDA - parallel trends assumption holds -> simple model Real - Trends aren’t parallel, there is a DiD effect, and is a differential time effect Spurious - Trends aren’t parallel, No DiD effect but is a differential time effect

Answer 19

For panel data(repeated observations on same units over time), it can be a poor estimate of the uncertainty of our estimate. 1) Heteroskedasticity -> Use robust SE 2) Serial correlation -> Use clustered standard errors - they relax assumptions that observations are independent (need a reasonable amount)

Answer 20

Occurs when we are combining two separate instruments and want to calculate 2 IV estimates. 1) First stage is a regression 2) Calculate fitted values (no residual) 3) Estimate second stage 4) Finally, can add control variables simply by adding them to first and second stage.

Answer 21

Can use a t-test for continuous variables, to test whether there is a statistically significant difference in average outcome, between treatment and control group. -Value < 0.05 and t statistic > 2, to reject the null.

Answer 22

[ Y_bar - (2xSE(y_bar), Y_bar + (2xSE(y_bar)]

Answer 23

Want to test whether E[y] = x T(mu) = Y_bar - mu / SE(Y_bar) Or t(0) = Y_bar / SE(Y_bar)

Answer 24

Non compliance occurs when participants do not adhere to their treatment or control group, leading to a deviation in the intended random assignment. Causes selection bias which affects results.

Answer 25

We could use the Intention to Treat Analysis, which is the impact of offering the treatment, as opposed to the impact of the treatment itself. ITT maintains the benefits of random assignment. So provides an unbiased estimate for the causal effect of the treatment.

Assumptions Flashcards

(49 cards)