model fit, overfit, and evaluation Flashcards
1
Q
overfitting
A
the tendency to tailor models to the training data
2
Q
hold out validation and holdout accuracy
A
holdout accuracy is an estimate of generalization accuracy.
holdout data = test data
3
Q
why is overfitting bad
A
more complex -> spurious correlations (not representative)
product incorrect generalizations
4
Q
avoid overfitting for tree induction
A
- stop growing the tree before it gets too complex
- prune back a tree