Bias Variance Trade off and Inductive Bias Flashcards

1
Q

Reducing bias when having a large variance:

A

When you have a large complex model and training data S is small, small changes to S lead to large changes in the model. This is a risk of overfitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Reduce variance having a large bias:

A

When you have a restricted, simple model, and the training data is small, small changes to it don’t lead to large changes in the model. This leads to a risk of underfitting (strong generalisation without sufficiently fitting the model)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do machine learning algorithms need to be able to generalise to new data?

A

It needs bias (inductive bias)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two sources of bias?

A

Representation bias

Search bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is inductive bias?

A

It is a criterion (preference or assumption) different from consistency with the data, used to favour a hypothesis over another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What must inductive algorithms have?

A

Inductive bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the approximation error?

A

How much error we have because we restrict ourselves to a specific class of models (i.e., how much inductive bias we have)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is estimation error?

A

The training error is only an estimation of the true error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does every classification algorithm have?

A

Inductive bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Is bias suitable for all data?

A

Yes and no. Every bias is suitable for some data sets and unsuitable for others.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the best way to find the best result?

A

Trying several different classification algorithms and using the best result (called the Toolbox Approach)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly