Regression Part-9 Model Specification Flashcards

1
Q

What is model specification bias?

A

CLRM assumption that model must be correctly specified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the attributes of a good model?

A
  1. Parsimony (few variables good)
  2. Identifiability (estimated parameters have unique values)
  3. Goodness of Fit (using evaluation criteria)
  4. Theoretically sound
  5. Exogenous regressors (X uncorrelated with error)
  6. Data coherency (residuals must be white noise)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are types of model specification errors?

A
  1. Omission of a relevant variable(s).
  2. Inclusion of unnecessary variable(s).
  3. Implementing a wrong functional form.
  4. Errors of measurement
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is underfitting?

A

If we omit a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is overfiiting?

A

If we include irrelevant variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the difference between model specification and model misspecification?

A

Model specification error - we have a true model in mind
Model misspecification - we don’t know the true model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What leads to errors in measurement?

A

Using proxies in the implemented model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the consequences of Omitted Variable bias?

A
  1. If X3 is correlated with X2 then missepcified model coeffs are inconsistent and biased; The alpha2 overestimates the beta2
  2. Var(alpha2) > var(beta2) (overestimated)
  3. The intercept may be underestimated
  4. Standard errors increase; R squared decrease
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the consequences of Irrelevant Variable inclusion bias?

A
  1. the ols estimators are unbiased and consistent; LUE not Best
  2. Var of new model more than true model;
  3. var(alpha2)/var(beta2) = 1/(1-r23squared) or the VIF
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the consequence of the two bias?

A

Underfitting
Overfitting
Coefficients of the variables -
- Biased and inconsistent
- Unbiased and consistent
Error variance
- Incorrectly estimated
- Correctly estimated
Hypothesis testing proc.
- Might be invalid
- Still valid
Variances of coefficients
-Inefficient
-Inefficient (larger

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the consequence of incorrect functional form?

A

incorrect or illogical values of estimated coefficients

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the reasons of errors in measurement bias ?

A

nonresponse errors, reporting errors, and computing errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the consequence of errors in measurement bias on Y?

A
  1. The OLS estimators are unbiased.
  2. The variances are unbiased.
  3. The estimated variances of the estimators are larger than true model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the consequence of errors in measurement bias on X?

A
  1. The OLS estimators are biased.
  2. They are also inconsistent; that is, they remain biased even if the sample size increases indefinitely.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is remedy of error in measurement bias?

A
  1. use instrumental or proxy variables.
  2. the data are measured as accurately as possible; avoid errors of recording, rounding, or omission
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to detect Unnecessary variables specification errors?

A

We can use the 𝐹 and 𝑑 tests to assess the statistical significance of variables or we can also use the Partial 𝐹 test for assessing the significance of a subset of variables (anova of (new model, old model)

17
Q

How to detect Omitted variables specification errors?

A

Informal - residual plot - if pattern then omission
Formal - Ramsey’s RESET (Regression Specification Error Test)

18
Q

What are the steps in Ramseys RESET?

A
  1. Obtain predicted values of Y
  2. Rerun model with predY squared, and cubed
  3. Find out F value
  4. If F significant then misspecified model

It is a test of misspecification only says if there is but not where

19
Q

What is the Mckinnon White Davidson test

A

This test is used to choose between the linear and log-linear regression models.
To see if one specification is better than the other, we can use the MWD test.
𝐻0: linear model π‘Œ is a linear function of 𝑋s
𝐻1: Log-Linear model: ln π‘Œ is a linear function of 𝑋s or a log of the 𝑋

20
Q

What Steps in MWD test?

A
  1. Estimate the linear model and obtain the estimated π‘Œ values
  2. Estimate the log-linear model and obtain the estimated ln π‘Œ values
  3. Obtain 𝑍1𝑖 = ln π‘Œπ‘– βˆ’ lnπ‘Œπ‘–
  4. Regress π‘Œ on the 𝑋s and 𝑍1𝑖.
    Reject 𝐻0 if the coefficient of 𝑍1𝑖 is statistically significant by the usual 𝑑-test.
  5. Obtain 𝑍2𝑖 = exp(lnYi) - Yi
  6. Regress ln π‘Œ on the 𝑋s or logs of 𝑋s and 𝑍2𝑖
    Reject 𝐻1 if the coefficient of 𝑍2 in the preceding equation is statistically significant.
    The idea behind the MWD test is that if the linear model is in fact the correct model, the constructed variable 𝑍1𝑖 should not be significant. The same comment applies to the alternative hypothesis 𝐻1
21
Q

What is the order of harsher penalties imposed by model selection criterias?

A

r squared< adj r squared< AIC< SIC