lecture 7: Over-fitting and bias/variance trade-off Flashcards

Question 1

Q

review: what is the main aim of regression?

Answer

A

given feature(s) x, we want to predict target y
note: x can be 1D or multi-D, y is 1-D

Question 2

Q

what is the number one rule for train and test sets

Answer

A

they should never overlap, test set should always be unseen data

Question 3

Q

what is overfitting

Answer

A

fit is very good for the training set but very bad for the test set, usually means the model used is too complex

Question 4

Q

what is underfitting

Answer

A

the fit is very bad for both the training and test set, usually means the model used is too simple

Question 5

Q

what are the reasons for overfitting

Answer

A

model is too complex or too many features and not enough training samples

Question 6

Q

what are the solutions for overfitting

Answer

A

use simpler models(eg. lower order polynomial) or use regularisation

Question 7

Q

what are the reasons for underfitting

Answer

A

model is too simple or features are not informative enough

Question 8

Q

what is regularisation

Answer

A

it is an umbrella term that includes methods that force learning algorithms to build less complex models
recall in previous lecture 𝜆

Question 9

Q

what does adding the regularisation term 𝜆reg(w) do?

Answer

A

encourages w to be small - called weight decay(L2 regularisation), which penalises more complex models
visually, it makes complex models flatter

Question 10

Q

what does lambda signify?

Answer

A

the trade-off between data-loss and regularisation

Question 11

Q

what is the difference between bias and variance

Answer

A

low bias represents the predictions being close to the target on average while low variance represents the spread of the predictions being small

Question 12

Q

in general, very simple models exhibit what bias and variance

Answer

A

high bias and low variance

Question 13

Q

in general, very complex models exhibit what bias and variance

Answer

A

low bias an high variance

Question 14

Q

according to the bias variance trade-off theorem, the mse of a new test sample x is given by

Answer

A

test error = bias squared + variance + irreducible noise
Bias(f)² + Var(f) + σ²
Bias = favg(x) - f(x)
Var = E[(f(x) - favg(x))²]

lecture 7: Over-fitting and bias/variance trade-off Flashcards

(14 cards)