MULTICOLLINEARITY Flashcards
What is multicollinearity?
Multicollinearity is when there is an exact linear relation between regressors
How do we detect multicollinearity? Five ways.
- High R2 but many stat insignificant t-stats.
- High pairwise correlation of explanatory variables (>=0.8)
- High partial correlation between coefficients
- Auxiliary regression producers sig F stat
- High variance inflation factor or low tolerance factor.
What is VIF?
VIF is a measure of the degrees to which the variance of the OLS estimator is inflated due to collinearity.
What are the four consequences of multicollinearity?
1) OLS estimators are still BLUE
2) Large standard errors and small t-stats which make it more difficult to reject the null when we should
3) Can’t conduct reliable hypothesis-testing due to high R2 but insignificant p-values.
4) Coefficients are sensitive to small changes in data
Stata output
High R2 but few significant variables. While the adjusted R2 isn’t too high it is stat significant at the 1% level. There is also a number of insignificant variables that would expect to be significant such as education.
There appears to be high collinearity, even the VIF mean is 2.73.