5. Sampling methods Flashcards
1
Q
Sampling techniques
A
- Validation (Stratified (partition) sampling)
—- Cross validation
—- Validation Set Approach (Random partitioning)
—- K-fold Cross validation / Leave-1-out Cross validation - Bootstrap (Sampling with replacement)
- Regularization (to prevent overfitting)
—- Best subset selection
—- Stepwise Selection ( Forward and Backward )
2
Q
Purpose of sampling
A
- Choosing the right model
- Obtain information about the test error
- Data reduction
( Quantitavily MSE, Qualitatively (AUROC or misclassification rate) )
3
Q
Test error vs. Training error
A
Test error: hard to estimate
Training error: easy to calculate