L5 - Regression - Bias-Variance Trade Off Flashcards

Question 1

Q

Define bias…

Answer

A

Bias refers to the extent that a model produces errors due to under-fitting. High bias means that high model error is encountered due to underfitting.

Question 2

Q

Define variance…

Answer

A

Variance refers to the extent that a model has been overfit. High variance means a model will not perform accurately on unseen data due to overfitting the training data.

Question 3

Q

What is the ideal bias-variance trade off for a regression model?

Answer

A

Low bias and low variance.

Question 4

Q

What are the 3 causes of model error? Define each…

Answer

A

Bias - The extent to which a model predicts wrong.

Variance - The extent to which a model has learned data, and thus is overfit.

Irreducible Error - Refers to the unavoidable randomness of real world data that can cause errors.

Question 5

Q

How can we tune bias and variance?

Answer

A

To increase bias we must reduce variance and vice versa.

Question 6

Q

What is another way of wording the bias-variance trade off of a model?

Answer

A

Bias-variance trade off can also be called a Complexity trade off.

Question 7

Q

If we achieve the optimal bias-variance ratio, what model complexity does this give us?

Answer

A

Optimal model complexity.

Question 8

Q

What technique can we use to prevent our model from overfitting, and to encourage generalisation?

Answer

A

Linear Model Regularisation (Shrinkage)

This reduces the magnitude of the coefficients in the polynomial regression model.

Question 9

Q

What is the objective of Linear Model Regularisation?

Answer

A

Establish a trade off between bias and variance, resulting in optimal model complexity.

Question 10

Q

How does Linear Model Regularisation work?

Answer

A

Introduces a penalty to the models loss function
Penalty can be increased or decreased to increase or decrease complexity.
We want to push all coefficients towards 0.

Question 11

Q

What happens if the Tuning parameter of the Regression Models cost function is increased?

Answer

A

Results in less regularisation, which leans to overfitting ( increases variance )

Question 12

Q

What happens if the Tuning parameter of the Regression Models cost function is decreased?

Answer

A

Results in more regularisation
Decreased model complexity which leans to underfitting ( increases bias )

Question 13

Q

What is L1 regularisation?

Answer

A

L1 ( LASSO ) applies a penalty value that is proportional to the sum of the absolute coefficient values.
Prevents overfitting and performs feature selection.

Question 14

Q

What is L2 regularisation?

Answer

A

Ridge Regression
Prevents overfitting and improves model stability.
A penalty is applied that is proportional to the squared coefficient values.
Penalty imposes a bias, and thus can be used to control the bias-variance trade off.

Question 15

Q

How do we establish the penalty value?

Answer

A

Through cross-validation, to find the lambda with the lowest variance.

Question 16

Q

What do L1 and L2 have in common?

Answer

Study These Flashcards

A

Both push coefficients towards 0.

Question 17

Q

Why do L1 and L2 push coefficients to 0?

Answer

Study These Flashcards

A

Helps prevent overfitting
Improves generalisation
Reduces variance.

Question 18

Q

How does L1 Regularisation perform feature selection?

Answer

Study These Flashcards

A

By pushing coefficients to 0, the features with non-zero coefficients are known to be important.

L5 - Regression - Bias-Variance Trade Off Flashcards

(18 cards)