Instumental Variables Flashcards

1
Q

! A good instrument must fulfil the following conditions:

A
  1. Relevance: Cov (z, x) ≠ 0. This means that the IV is correlated with the endogenous RHS variable.
  2. Exogeneity: Cov (z, u) = 0. This means that the IV should not be correlated with the error term. In other words, the IV only affects the dependent variable through the endogenous RHS, not directly.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Consider a simple model to estimate the effect of personal computer (PC) ownership on college grade point average for graduating seniors at a large public university, where PC is a binary variable indicating PC ownership:

GPA = b0 + b1PC + u

i) Why might PC ownership be correlated with u?

A

a) Measurement error in PC:
If we have a random error when observing PC ownership, then this error would add to the error term and cause a correlation between the measured PC ownership dummy and the residual.

b) Simultaneous relationship:
It is logical to assume that PC ownership may affect grades, but would grades also affect PC ownership? If we have reasons to believe this, then PC and u should be correlated.

c) Omitted variable bias:
If there is another variable that affects performance in school and correlates with PC ownership as well, then we should expect that u is correlated with PC. It has been well established that socioeconomic status affects student performance. The error term u contains, among other things, family income, which has a positive effect on GPA and is also very likely to be correlated with PC ownership.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Endogeneity through Simultaneous relationships is:

A

If X affect Y but also Y affects X then Corr (X, u) ≠ 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Endogeneity problem is when…

A

… the independent variable is correlated with the error term.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the econometric problem if X is an endogenous explanatory variable and the model is estimated with Ordinary Least Squares (OLS).

A

The econometric problem is that if Innovation is, therefore, correlated with the error term E(u|Firmsize, Innovation)≠0, it causes OLS to be a biased or inconsistent estimator.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How are OLS and IV estimates of a coefficient different?

A

The OLS estimates of a model that requires an IV estimate will be biased, but precise, while the IV estimate will be consistent, but the standard errors incorrect. This is because the IV estimates variance is based only on the variation in the IV.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How are proxy and instrumental variables different in their relationships to x?

A

A proxy variable uses a direct relationship between z and x, whereas an instrumental variable looks at the indirect relationship between z and x.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If z is correlated with y, can it be a valid instrument?

A

Yes, as long as it’s not directly correlated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Write the reduced form equation for X means…

A

…write down a linear regression model explaining the endogenous explanatory variable by all exogenous variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Do x and u need to be uncorrelated in order to use z as a valid instrument for x?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Do z and u need to be uncorrelated to use z as a valid instrument for x?

A

Yes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

While a proxy variable must be highly correlated/uncorrelated with the omitted variable, a good instrumental variable must not be correlated / uncorrelated with the ommited variable.

A

A good proxy must be highly correlated with the ommited (through the error term) where as the a good instrumental must not be correlated with the ommited.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which gets you higher standard errors? OLS or IV?

A

Standard errors of the estimated parameters with IV are substantially larger than those of OLS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the consequence of overidentification

A

Adding more instumnets than need can cause severe bias in the 2SLS estimators.

The standard errors of the estimated regression parameters will become smaller.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What conclusion can be drawn if H0 (no overidentification) of the (Hensen) Sargan test is rejected?

A

If we reject the H0 of the Sargan test then Overidentification is a stat. sig. probelm and we cannot trust the IV (2SLS) estimates because they are inconsistent.

They do not solve the bias problems we already knew existed when estimating by OLS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Please describe how you would test for over-identification in the model outlined above. Why is it important to test for over-identification?

A

(Hensen) Sargan test:

Save the residuals of the second stage, and regress these on all the instruments and all exogenous variables.

Compute the Chi-square test statistics using the N*R2 of that regression.

H0: No overidentification*

(there may be overidentification however it is not a statistical problem.)

H1: Overidentification

Reject H0 if N*R2 > chi2

It is important, as it is a test for the validity of the instruments. If H0 (no overoveridentification) is
rejected we cannot trust the 2SLS estimates.

15
Q

Please describe how you would test if X (eg Innovation) is an endogenous explanatory variable in the model outlined above. Why is it important to test for endogeneity?

A

Hausman-Wu test:

Save the residuals of the first stage regression, put them in the equation of all IV and exogenous variables and test if its coefficient is significantly different from zero (standard t or z test).

If significant: reject the null of exogeneity, hence in favour of endogeneity.

It is important to test for this because we would use IV while it is not necessary (not endogenous)over the more efficeint OLS.

16
Q

When the explanatory variables are exogenous the 2SLS estimator is … efficient than the OLS because…

A

…less

…the 2SLS has large standard errors.

17
Q

Consider the estimation of the following linear regression model.
Y = b0 + b1 X + u, where Y and X are variables, u is an error term, and b0 and b1 are parameters. X is an endogenous explanatory variable.

What is true?
Ordinary Least Squares (OLS) is..

a. an inefficient estimator.
b. an inconsistent estimator.
c. an unbiased estimator.
d. a consistent estimator.

A

an inconsistent estimator

18
Q

Consider the estimation of the following linear regression model.
Y = b0 + b1 X + u, where Y and X are variables, u is an error term, and b0 and b1 are parameters. X is an endogenous explanatory variable.

Suppose a variable Z is used as an instrument for X. What is true? Z is a valid instrument if..

a. it is correlated with X and correlated with u.
b. it is uncorrelated with X and uncorrelated with u.
c. it is correlated with X and uncorrelated with u.
d. it is uncorrelated with X and correlated with u.

A

it is correlated with X and uncorrelated with u.

19
Q

Consider the estimation of the following linear regression model.
Y = b0 + b1 X + u, where Y and X are variables, u is an error term, and b0 and b1 are parameters. X is an endogenous explanatory variable.

Suppose we use a proxy variable W to deal with a suspected omitted variable that causes
X to be endogenous. What is true? This solves the endogeneity problem if..

a. the proxy W is correlated with u and once controlled for W in the model, the (new) error term is uncorrelated with X.
b. the proxy W is perfectly correlated with X.
c. the proxy W is correlated with u and once controlled for W, the new error term is
uncorrelated with X and uncorrelated with W.
d. the proxy W is correlated with X but not with u.

A

the proxy W is correlated with u and once controlled for W, the new error term is
uncorrelated with X and uncorrelated with W.

20
Q

Consider the estimation of the following linear regression model.
Y = b0 + b1 X + u, where Y and X are variables, u is an error term, and b0 and b1 are parameters. X is an endogenous explanatory variable.

A variable Z is used as an instrument for X. What is true? The standard error of the slope parameter is inversely related to..

a. the correlation between X and Y (in absolute terms).
b. the correlation between X and u (in absolute terms).
c. the correlation between X and Z (in absolute terms).
d. the correlation between Z and Y (in absolute terms).

A

the correlation between X and Z

21
Q

Consider the estimation of the following linear regression model.
Y = b0 + b1 X + u, where Y and X are variables, u is an error term, and b0 and b1 are parameters. X is an endogenous explanatory variable.

A variable Z is used as an instrument for X. What is true? Exogeneity of Z..

a. can be tested with the first stage regression results.
b. can be tested with the Over-Identification test (Sargan test).
c. can be tested with the Hausman-Wu test.
d. cannot be tested.

A

can be tested with the Hausman-Wu test.

22
Q

Consider the estimation of the following linear regression model.
Y = b0 + b1 X + u, where Y and X are variables, u is an error term, and b0 and b1 are parameters. X is an endogenous explanatory variable.

In the presence of a high and significant correlation between X and Z, it can be concluded that..
a. Z is exogenous.
b. Z is a relevant instrument and can be used to instrument X.
c. Z is a relevant instrument and since it measures the same construct as X, it can be used
as a proxy for X.
d. Z is a relevant instrument and can be used to instrument X, under the assumption Z is exogenous.

A

Z is a relevant instrument and can be used to instrument X, under the assumption Z is exogenous.

23
Q

How do you check for the relevance of the instrumental variable?

A

We check for relevance by examining the value of the F - statistic of the instrumental variable in the regression of the endogenous variable on the IV.

We want F-statistic > 10

24
Q

Please write the statistical steps to justify the relevance of the excluded instruments. What do you conclude?

A

Ho: b1=b2=0
H1: Ho is not true (or at least one of them is different from zero).

If the F stat > F critical value, then reject Ho.

If we reject Ho IVs are jointly significant.

As a result, we can conclude that the instruments seem to be relevant, which may increase the unbiasedness of the 2SLS estimator and affect the power of the significant test.

25
Q

A Hausman-Wu test the corresponding t-test statistic (i.e. corresponding to the first-stage residual coefficient) equals 1.50. What can be concluded and what is recommendable concerning the estimation of the model?

A

The Hausman-Wu test statistic can be compared with the critical value of 1.96 => Do not reject the null of exogeneity. Hence, there is no empirical evidence that Innovation is an endogenous explanatory variable.

It is recommendable that the equation of interest is re- estimated with OLS, as it is more efficient than IV/2SLS.

26
Q

There is one endogenous variable, instrumented with three variables, hence 2 degrees of freedom for this test, the critical value is 5.99 with a 5% level of significance.

An over-identification (Sargan) test is carried out and the corresponding Chi-Squared test statistic is equal to 25.7. What can be concluded?

A

This means the null of no over-identification is rejected, therefore, the results are invalid.

27
Q

IF the instruments seem to be relevant we conclude that …

A

…instruments may increase the unbiasedness of the 2SLs estimator and affect the power of the significant test.