SU6 - Heteroskedasticity, Binary Variables and Introduction to Maximum Likelihood Estimation Flashcards
What is homoskedastic? SU6CH1
When Var(e|Y) = Ο2, i.e the variance of e is constant no matter the number of Y
What is heteroskedasticity? SU6CH1
When the conditional variance of the error term e varies systematically with the regressors.
What are the two consequences of heteroskedasticity? SU6CH1
1) if OLS estimators are computed under the assumption of homoskedasticity, the estimators of the variances will be biased and incorrect, which leads to hypothesis testing to be incorrect
2) if Var(e|X) is not constant, OLS is no longer BLUE
What does heteroskedasticity not affect? SU6CH1
1) does not cause OLS estimators to be biased or inconsistent
2) R squared is unaffected
What is Heteroskedastic Robust Standard Errors? SU6CH1
To find the variance under the assumption of heteroskedasticity.
However, this formula is applicable to both hetero and homo
What are the tests used to test for heteroskedasticity? SU6CH1
Breusch-Pagan test and the white test (more efficient)
The test is πΈ(π2|π1,π2,β¦,ππ)is constant (homoskedasticity), or depends on the regressors (heteroskedasticity)
Limitation of the Breusch-Pagan test? SU6CH1
Can only detect if the regressors have linear effects on the error variance. Nonlinear effects cannot be detected
Limitation of the White test? SU6CH1
Can detect nonlinear effects but when the regression model has many regressors, this approach becomes quickly infeasible because there will be squares and cross-products
What is a binary variable also called?
dummy variable
What is the dummy variable trap?
Perfect collinearity arises from adding all the dummy variables for each quality
How to avoid the dummy variable trap?
1) drop one dummy variable
2) if both dummies are included, drop the intercept
What is the multiple linear regression model called when the dependent variable is a binary variable?
Linear probability model
Whatβs the issue with the linear probability model?
The model is unlikely to be linearly related to the independent variable for all their possible values.
eg. going from zero to four young children reduces the probability of working by > 1, which is impossible
Is the linear probability model heteroskedastic or homoskedastic?
heteroskedastic, therefore, do use robust standard errors when conducting hypothesis testing
What does including a dummy variable do to a regression model?
it shifts the intercept, thus each group indicated by the binary variable may have a different intercept