Bias Variance Tradeoff & Linear Models Flashcards

1
Q

What is bias within the context of modeling?

A

Basically it describes inflexibility in a model- a model’s inability to capture the effects of other random variables that have been omitted from the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does low bias lead to?

A

Low bias leads typically leads to overfitting, which leads to high variance (out of sample performance) but low error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does high bias lead to?

A

It leads to underfitting, and consequently low variance and high error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why is the violation of the strict exogeneity assumption problematic ?

A

A violation means there is endogeneity, which means there is omitted variable bias (model is mis-specified)

This means that the error term is capturing the effects of omitted variables, and the residuals will be correlated with other Independent variables, which induced multicollinearity -> large sampling variability and incorrect standard errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can the impact of multicollinearity be measured?

A

VIF- multicollinearity Very Intensely Fucks standard errors (use Variance Inflation Factor)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why do we sometimes assume the error term is normally distributed?

A

It is a known property from probability theory that a random variable that is a linear transformation of a normal random variable is also normally distributed! So if the errors are normally distributed, we can then assume the Betas are normally distributed and conduct the proper inference assuming normality.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly