Week 5 Flashcards

1
Q

What is heteroscedasticity?

A

It states the variance of residuals in a regression model changes across values of X. So it violates the key assumption of OLS which is constant variance of errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is homoscedasticity?

A

The assumption that the error has the constant variance for any values of X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a graphical trait of heteroscedasticity?

A

When there is a clear pattern or shape in the plot of residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What test can be used for heteroscedasticity?

A

The Breusch-pagan test, which measures potential heteroscedasticity. The null hypothesis here is that the variance of the error term is constant. This means the null hypothesis is that there is no heteroscedasticity and the residuals are homoscedastic.
So when the p-value is below 0.05, we can say there is heteroscedasticity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is collinearity?

A

a situation in which 2 or more IVs are closesly related to one anotherWh

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is the issue with collinearity?

A

The issue is that it reduces the accuracy of the estimates of the regression coefficients as it increases the standard error. this then leads to an increase in p-value, resulting in failure to find coefficients are significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How can we assess collinearity?

A

By using the variation inflation factor (VIF).
VIF of 1 means absence of collinearity
VIF above 5 or 10 means a problematic amount of collinearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are solutions for collinearity?

A
  1. Omit one of the problematic IVs. Issue is it distorts results and gives wrong picture so should only be used when extremely high collinearity or when IV does not yield any loss of info
  2. Use more observations
  3. Combine 2 IVs into 1. Issue is it makes it hard to interpret the variable because the 2 variables are used.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly