Underfitting and Overfitting Flashcards

1
Q

4 Reasons of Overfitting

A

Limited Training Data
Noisy Data
Model complexity
Low Regularisation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

2 Sign of Overfitting

A

Training error is low
Generalisation error is high

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

3 Overfitting Remedy

A

Get more training data
Clean up the training data (noise).
Simplify or regularise the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

3 Reason of Underfitting

A

The model is too simple to capture the underlying patterns
The model is constrained
Irrelevant data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

3 Underfitting Remedy

A

Feed better features to the learning algorithm
Reduce the constraints on the model
Select a more powerful model, with more parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

2 Error Estimate for Model Selection

A

Optimistic Error Estimate - Not consider complexity
Pessimistic Error Estimate - Consider complexity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Model Evaluation

A

To estimate performance of classifier on previously unseen data (test set)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Holdout

A

Reserve k% for training and (100-k)% for testing.
Random subsampling - Repeated holdout

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Cross validation

A

Partition data into k disjoint subsets
K-fold - Train on k-1 partitions, test on the remaining one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

4 Step of Using Validation Set

A

Train multiple models with various hyper-parameters training set
Select model that performs best on validation set
Train the best model on the full training set
Evaluate final model on test set (estimate generalization error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

2 Type of Fine Tune Model

A

Grid Search CV - Evaluate all combinations through a specified hyperparameter space.
Randomised Search CV - Evaluate a given number of random combinations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly