Applying ML Flashcards
Exam2
Should you use the training, validation, or test set of data to choose the final model?
You should use the validation set of data because it provides an independent evaluation of the final model’s performance
Should you use the training, validation, or test set of data to report the accuracy of your model?
You should use the test set of data because it provides a more reliable assessment of how well your model performs on unseen data
What does the training and testing error look like in a high bias model?
Both the training and test errors are HIGH
What does the training and testing error look like in a high variance model?
The training error is LOW, and the testing error is HIGH
What does the training and testing error look like in a model with both high bias and high variance
The training error is HIGH, and testing error is GREATER than the training error
High Variance
Overfit
High Bias
Underfit
Can you use regularization in a neural network? What does it do?
YES, regularization can be used to prevent overfitting by using lambda
Name 3 things to try to improve a model with high bias
- Try getting additional features
- Try adding polynomial features
- Try decreasing lambda
Name 3 things to try to improve a model with high variance
- Get more training examples
- Try smaller sets of features
- Try increasing lambda
Adding data to a model with high bias won’t help. Why?
This won’t help because high bias indicates the model is too simplistic and unable to capture the underlying patterns in the data, regardless of the amount of data available.
Should you increase or decrease lambda used in regularization for a model with high variance?
Increase lambda to penalize overfitting