Statistical Decision Theory Flashcards

1
Q

What is the Expected Prediciton Error of a function?

A

EPE(f) = E[ (y - f(x))² ]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Using MSE, what is the optimal prediction function?

A

f* = E_Y|X ( Y|X=x )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does k-NN approximate the optimal solution?

A

f_kNN(x) = Ave( yi|xi in Nk(x) )
1. The expectation is approximated by averaging over sample data.
2. Conditioning at a point is relaxed to conditionning on its neighbourhood.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does linear regression approximate the optimal solution?

A

f_LR(x) = x.T * w
LR replaces expectation by average over the training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the inductive bias of an algorithm and what are those of k-NN and LR?

A

The inductive bias of an algorithm is the set of assumptions made by using it.
The inductive bias of k-NN is that it assumes the target function is well approximated by a locally constant funciton.
The inductive bias of LR is that it assumes the target function is well approximated by a globally linear funciton.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the “curse of dimensionality”?

A

To cover a given percentage of an output space, the more dimensions it has, the wider the range you’ll need to cover.
For example, to cover 25% of a 2D square, you’ll need to consider a square which has 50% its width and 50% its height. To cover the same percentage of a 3D cube, you’ll need to cover ~63% of each of its dimension.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the 0/1 loss function?

A

For a classification problem, it is the function which is 0 for correct classifications and 1 for misclassifications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Bayes Classifier?

A

f_BC(x) = argmax[ P(g|X=x) ] for g in G (the set of all possible classes)
It is the optimal solution for a classification problem with 0/1 loss function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Bayes Error Rate?

A

It is the error rate of the Bayes classifier. It is the lowest possible error rate a classifier can achieve.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly