Dummy Variables + Heteroscedasticity Flashcards
How can we incorporate qualitative info in our regression model?
Use dummy variables
- sometimes known as binary variables
How does dummy variable work?
Dummy variable = 1 if particular observation has certain characteristic
Dummy variable = 0 of particular observation doesn’t have characteristic
Golden rule when using dummy variables?
When using dummy variables one category must always be omitted
What is dummy variable trap?
Occurs when all categories are included in regression
- results on perfect Collinearity
- MLR 3 violated
- model cannot be estimated
How to interpret dummy variable?
Interpretation of dummy variables is relevant to omitted category- also knows as the base category
What are the consequences of heteroscedasticity?
- OLS still unbiased and consistent as long as MLR 1-4 hold
- Intwepretation of R-Squared and Adj R-Squared unchanged, this is because these are estimates of the population and depend on unconditional variances
- OLS no longer BLUE - min variance property no longer holds (MLR 5)
- Ftests, t-tests and CI no longer valid-this is because the estimated variances of our coefficients are no longer unbiased estimators
- true standard errors often larger than OLS reports
In the B-P test what is evidence against the null? I.e against homoscedasticity?
- a high R-squared from the regression of the squared residuals on all the explanatory variables
What is downfall of B-P test?
Imposes a linear relationship between variance and explanatory variables
- relationship may not be linear
Why is the white test better than the B-P test?
White test detects more general deviations from homoscedasticity than B-P test
- capture more than just linear relationships by including squares and interaction terms
Downfall of white test?
Including all squares and interactions leads to a large number of estimated parameters
What must hold for B-P and White test to hold?
MLR1-4:
- linear in parameters
- random sampling
- no perfect Collinearity
- zero conditional mean
What is main concern when carrying out B-P and Whote test?
- misspecificatipn of model
- if wrong FF is used I.e we use level-level instead of correct log-log form can lead to wrong conclusions