week 2 part 3 Flashcards
When can the desirable properties of OLS estimators be negatively affected?
When one or more model assumptions are violated.
What is commonly used to investigate the assumptions?
The residuals
What are the 6 assumptions?
- Assumption of linearity
- Assumption of no perfect multicollinearity
- Assumption of constant variability
- Assumption of no autocorrelation
- Assumption of no endogeneity
- Assumption that the error term is normally distributed.
What does the Assumption of linearity claim?
That the regression model is linear, and the parameters are correctly specified.
What do we do if there is no linearity?
Adapt the model to the non-linearity by making simple transformations of the dependent variable and/or the explanatory variables.
What can we use to identify non-linear patterns?
We can use residual plots. Linearity is confirmed if the residuals are randomly distributed over the observations of an explanatory variable. A clear trend in the residuals indicates a non-linear pattern.
What does the Assumption of no perfect multicollinearity claim?
That there is no exact linear relationship between the explanatory variables.
When does Perfect multicollinearity occur?
When two or more explanatory variables have an exact linear relationship. Perfect multicollinearity is easy to detect because the model cannot be estimated.
What is the problem with Multicollinearity?
Its presence results in uncertain estimates of the regression coefficients. Thus, multicollinearity makes it difficult to separate the various influences of the explanatory variables on the dependent variable. If the multicollinearity is severe, we can see that some important explanatory variables become insignificant, and some coefficient estimates may even have incorrect signs.
How do we detect multicollinearity?
With informal methods.
- A high R2 and adjusted R2 combined with individually insignificant explanatory variables, indicate multicollinearity.
- Another guideline is that if the correlation coefficient between two explanatory variables is more than 0.8 or less than -0.8, it suggests serious multicollinearity.
- Unexpected signs in the estimated regression coefficients also indicate this.
What should we do if we found strong multicollinearity?
- Remove one of the collinear variables (those that essentially measure the same thing). Or,
- Obtain more data.
- In some cases, the best option is to do nothing at all if there is justification for keeping all variables.