Session 4.2 Flashcards
Higher number of folds means…
having to run more models, having larger train sets and smaller test sets
10 folds are the most common option. 5 folds are also frequently used.
Having larger training sets leads to…
better performance in each model
Having smaller test sets leads to
higher variance across models
The Cumulative Response curve
plots the true positive rate as a function of the percentage of test instances targeted
Underfitting
A model that is too simple does not fit the data well (high bias)
e.g., fitting a quadractic function with a linear model
Overfitting
A model that is too complex fits the data too well (high variance)
e.g., fitting a quadractic function with a 3rd degree function
Bias
a model that underfits is wrong on average (high bias) but is not highly affected by slightly different training data
Variance
a model that overfits is right on average, but is highly sensitive to specific training data
Ensemble methods
use multiple algorithms to obtain better predictive performance than could be obtained from any of the algorithms by itself
Using multiple algorithms usually increases model performance by:
reducing variance: models are less dependent on the specific training data
Examples:
- Bagging (or bootstrap aggregation)
- Random Forest
- Boosting