Practical Issues of ML Flashcards

Notes from the Practical Issues lecture that might help in the exam.

1
Q

How would you determine the size of a validation set?

A

The validation set needs to be large enough to detect the performance difference between two or more models, but not necessarily much larger.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some ways to improve the Bias Error?

A

Improve feature engineering e.g. outlier removal

Improve model architecture or try another method

Reduce regularisation

Increase the model size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are some ways to improve the Variance Error?

A

Add Regularisation or decrease the model size

Improve feature selection e.g. reducing dimensions, picking subsets, etc…

Add more training data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are some ways to improve the Mismatch Error?

A

Understand the difference between training and testing sets

Add more training data that is similar to the test cases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the best way to evaluate a model?

A

Understand the key aim of the task, and choose the most appropriate single measure for the given task.

If multiple metrics are needed, order their priority

How well did you know this?
1
Not at all
2
3
4
5
Perfectly