Topic 10: Specification and Data Issues Flashcards
Why would you need a proxy variable?
A proxy variable can be used in place of an unobserved variable, If you think you might have omitted variable bias from leaving out the unobserved variable.
What is a proxy variable?
An observed variable that is closely correlated with, but not identical to an unobserved explanatory variable.
What is functional form misspecification?
When there is biased caused by omitting variables that are functions of other variables (like excluded x^2).
How would you test for functional form misspecification?
By using the Ramsey RESET test.
What kind of bias occurs when a model does not properly account for the relationship between the dependent and independent variables because the correct explanatory variable is not observed?
Omitted variable bias.
How does Classical measurement error in the dependent error affect the bias and variance of an OLS estimator?
It does not cause bias, but does increase the variance of the OLS estimator.
What is classical measurement error?
When the respondent’s answer is erroneous, which causes noise in the regression (e) not providing information about the individual, cov(e,x)=0 and cov(e, u)=0 where x is not observed.
What is attenuation bias?
When the bias in an estimator is biased towards zero, thus the expected value is less than the absolute value of the parameter. Occurs from measurement error in the independent variable.
What is the form of inconsistency in the OLS estimator under the assumption of classical measurement error in the explanatory variable?
As beta1^ reaches it’s probability limit, it is equal to beta1 times the quantity of 1 minus the fraction of the variation in e divided by the variation in x* plus the variation in e.
What are the assumptions needed to have a good proxy variable?
Where x3 is the proxy variable for unobserved x3*:
Cov(x, v) = 0 or very close.
x3 and x3* need to have some correlation, represented by delta1.
Cov (u, x3) = 0 (but not necessary)
How do you run a Ramsey RESET test?
Run the necessary regression, then predict the fitted values (predict varname, xb), generate the square and cube of the fitted value, run the original regression included these squared and cubed variables, then joint F test the squared and cubed terms.
What is the null hypothesis in the Ramsey RESET test?
The null hypothesis is that the model is correctly specified, or that delta1=0 and delta2=0 (the coefficients on the fitted squared and cubed variables). A small p value is strong evidence to reject the current specification, a large p value fails to reject the specification.
When a model does not properly account for the relationship between the dependent and explanatory variables because the correct explanatory variable is not observed, what kind of bias occurs?
Omitted variable bias.
Can the Ramsey RESET test be used to test for general omitted variable bias?
No.
If the inclusion of one or more proxy variables for an unobserved omitted variable causes the coefficient estimate of the key variable to be lower, what would our conclusion be?
Their inclusion tells us that the return to the true key variable is lower than we would think, because of omitted variable bias.