10. Model Comparisons Flashcards

Question 1

Q

What is used to test if multiple variables are significant/add to the model? (Name only)

Answer

A

F-Test for overall model significance

Incremental F-Test for overall model significance

Question 2

Q

How does an F-Test work?

Answer

A

Contains F-ratio (the ratio of explained variance to unexplained variance)

F-ratio is is found through the mean sum of squares

F-ratio is then evaluated against an F-distribution with df model, df residual and pre-defined alpha

Question 3

Q

How do we know through the F-test if our model is significant or not?

Answer

A

Bigger the F-ratio = Better model as model variance is larger than the residual variance

F-ratio = Closest to 1 if null hypothesis is true

F > 1 = Increased model variance

Question 4

Q

What is an incremental F-test?

Answer

A

The incremental F-test evaluates the statistical significance of the improvement in variance explained in an outcome with the addition of further predictor(s)

Question 5

Q

How does the incremental F-test work?

Answer

A

Based on the difference in F-values between models

A null or empty model is a linear model with only the intercept, predicted value of the outcome is the mean of the outcome (“the least wrong estimate”), beta = 0

For every predictor added, DF increases

Question 6

Q

How do you interpret an incremental F-test from anova?

Answer

A

Compare the p-value for the F-test to your significance level. If the p-value is less than the significance level, your sample data provide sufficient evidence to conclude that your regression model fits the data better than the model with no independent variables.

Question 7

Q

What is a nested model?

Answer

A

Predictors in one model are a subset in another

Question 8

Q

What is a non-nested model?

Answer

A

Unique variables in both models

Question 9

Q

What can you use to compare models if they are based on the same data set and are nested?

Answer

A

Incremental F-Test

AIC

BIC

Question 10

Q

What can you use to compare models if they are based on the same data set and are non-nested?

Question 11

Q

Can we compare models that are not from the same data set?

Question 12

Q

What are parsimony corrections?

Answer

A

Penalises models for being complex = Helps avoid overfitting (adding predictors arbitrarily to make it fit)

Question 13

Q

What are AIC and BIC parsimony corrections?

Answer

A

BIC have a harsher penalty than AIC for typical sample sizes

Severe parsimony penalty lm(n) > 2

BIC = Difference of 10 shows that one model is better than another

No cut offs for AIC in establishing substantial difference

Question 14

Q