Practical Issues of ML Flashcards
Notes from the Practical Issues lecture that might help in the exam.
How would you determine the size of a validation set?
The validation set needs to be large enough to detect the performance difference between two or more models, but not necessarily much larger.
What are some ways to improve the Bias Error?
Improve feature engineering e.g. outlier removal
Improve model architecture or try another method
Reduce regularisation
Increase the model size
What are some ways to improve the Variance Error?
Add Regularisation or decrease the model size
Improve feature selection e.g. reducing dimensions, picking subsets, etc…
Add more training data
What are some ways to improve the Mismatch Error?
Understand the difference between training and testing sets
Add more training data that is similar to the test cases
What is the best way to evaluate a model?
Understand the key aim of the task, and choose the most appropriate single measure for the given task.
If multiple metrics are needed, order their priority