model fit, overfit, and evaluation Flashcards

1
Q

overfitting

A

the tendency to tailor models to the training data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

hold out validation and holdout accuracy

A

holdout accuracy is an estimate of generalization accuracy.
holdout data = test data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

why is overfitting bad

A

more complex -> spurious correlations (not representative)
product incorrect generalizations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

avoid overfitting for tree induction

A
  1. stop growing the tree before it gets too complex
  2. prune back a tree
How well did you know this?
1
Not at all
2
3
4
5
Perfectly