HW 5 Flashcards

Question 1

Q

It is good practice to perform variable selection based on the statistical significance of the regression coefficients

Question 2

Q

The training risk is an unbiased estimator of the prediction risk

Question 3

Q

When the number of predicting variables is large, both backward and forward step wise regressions will always select the same set of variables

Question 4

Q

It is not required to standardize or rescale the predicting variables when performing regularized regression

Question 5

Q

Complex models with many predictors are often extremely biased, but have low variance

Question 6

Q

Variable selection is a simple and solved statistical problem since we can implement it using the R statistical software

Question 7

Q

Backward stepwise regression is preferable over forward stepwise regression because it starts with larger models

Question 8

Q

Stepwise regression is a greedy algorithm searching through all possible combinations of the predicting variables to find the model with the best score

Answer

A

False, not all possible combinations

Question 9

Q

Akaike Information Criterion (AIC) is an estimate for the prediction risk

Question 10

Q

Mallow’s CP statistic penalizes complexity for the model more than leave-one-out CV and BIC

Answer

A

False, BIC penalizes more than other approaches

Question 11

Q

Ridge regression is a regularized approach that can be used for variable selection

Question 12

Q

The lasso regression requires a numerical algorithm to minimize the penalized sum of least squares

Question 13

Q

The L1 penalty measures the sparsity of a vector and forces regression coefficients to be zero

Question 14

Q

Elastic net regression use both penalties of ridge regression and hence combines the benefits of both

Question 15

Q

In regularized regression, the penalization is generally applied to all regression coeffs where p = number of predictors

Answer

A

False, the shrinkage penalty is not applied to the intercept

Question 16

Q

If there are specific variables that are required to control the bias selection in the model, they should be forced into the model and not be part of the variable selection process

Answer

Study These Flashcards

A

True

Question 17

Q

The penalty constant lambda in penalized regression controls the trade-off between lack of fit and model complexity

Answer

Study These Flashcards

A

true

Question 18

Q

In ridge regression, when the penalty lambda is zero, the corresponding ridge regression estimates are the same as the ordinary least squares estimates

Answer

Study These Flashcards

A

True

Question 19

Q

Ridge regression can be used to deal with problems caused by high correlation among the predictors

Answer

Study These Flashcards

A

True

Question 20

Q

When selecting variables for explanatory purpose, one might consider including predicting variables which are correlated if it would help answer your research hypothesis

Answer

Study These Flashcards

A

True

HW 5 Flashcards

(20 cards)