Model Selection in Multiple Regression Flashcards

1
Q

What is an underfitted model?

A

A model that is too simple

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the danger with underfitted and overfitted models?

A

Poor predictive abilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of variables do we want to include in our model?

A
  • Variables that have a genuine relationship with the response
  • Variables that offer a sufficient amount of new information about the response
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What happens when collinear variables are fitted together in a model?

A
  • The resulting model unstable

- Often obtain inflated standard errors for these estimates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What can be used to detect Collinearity?

A

Variance Inflation Factors (VIFs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Write the equation for VIFs

A

1/(1-R^2p)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What R command gives the p values for all predictors assuming that the term is the last in the model?

A

Anova

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What can nested and non-nested models be compared with?

A

Information based fit criteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Give examples of information based fit criteria

A

AIC or BIC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe Occam’s Razor in this context

A

When comparing models of equal explanatory power, one should choose the simplest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the AIC statistic?

A

Measure of fit which is penalized for the number of parameters estimated in a model?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does a smaller AIC value signal?

A

A better model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When would you use AICc over AIC?

A

When the sample size is not a great deal larger than the number of parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Give the formula for AICc

A

AICc = AIC + 2P(P+1)/N-P-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why does the BIC score differ from the AIC?

A

BIC employs a penalty that changes with the sample size (N)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What should you do if you include an interaction term in a model?

A

Also include the main effects associated with it

17
Q

Where should interactions come in the sequence of predictors?

A

Last