Chapter 10 Flashcards

1
Q

What is cross-validation?

A

In an attempt to estimate the true generalization error of an model on a set of data, the data-set is split up in different sets of trai9ning and test data, and paramter estimation and training and test error is calculated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the generalization error?

A

A measure of how well our model performs on average assuming that we have infinite test sets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Hold-out cross validation:

A

Just 1 iteration where test and training are split up and the generalization error is assumed to be equal to the test error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

K-fold-CV:

A

Dataset split up K times into training and test data with N/K observations in each.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is forward selection?

A

Start by having 0 attributes. Then add attribute that minimizes generalization error the most (cross-validation). Then add another variable and do cross validation again - stop when generalization error does not decrease anymore.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is backwards selection?

A

start with full model and remove stuff.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly