Practical Issues of ML Flashcards

Question 1

Q

How would you determine the size of a validation set?

Answer

A

The validation set needs to be large enough to detect the performance difference between two or more models, but not necessarily much larger.

Question 2

Q

What are some ways to improve the Bias Error?

Answer

A

Improve feature engineering e.g. outlier removal

Improve model architecture or try another method

Reduce regularisation

Increase the model size

Question 3

Q

What are some ways to improve the Variance Error?

Answer

A

Add Regularisation or decrease the model size

Improve feature selection e.g. reducing dimensions, picking subsets, etc…

Add more training data

Question 4

Q

What are some ways to improve the Mismatch Error?

Answer

A

Understand the difference between training and testing sets

Add more training data that is similar to the test cases

Question 5

Q

What is the best way to evaluate a model?

Answer

A

Understand the key aim of the task, and choose the most appropriate single measure for the given task.

If multiple metrics are needed, order their priority

Notes from the Practical Issues lecture that might help in the exam. (5 cards)