Part 2. K Nearest neigbors Flashcards

1
Q

Name examples for using K nearest neigbors

A

Abnormalities
Email spam/not spam
Classifying credit card

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a training set?

A

In a training set each record contains a set of attributes, one of the attributes is the class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a validation set?

A

Used to determine the accuracy of the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is usually the case with the training and validation set?

A

Training set is used to build the model and validation set used to validate it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a Nearest-Neighbor Classifier require?

A

Set of stored records
Distance metric to compute distance between records
value of K.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is a unkown record classified?

A

Compute distance to other training records.
Identify K nearest neighbors
Use class labels of nearest neighbors to determine the class labels of uknown record.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does a small distance imply?

A

Discriminating attributes are equal and that also

implies that they probably are in the same class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Choosing the value of K

A

If K is too small, it is sensitive to noise points.

If K is too large, the neighborhood may include points from other classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to find the weights?

A

With gradient decent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to K?

A

Try many different values of K and look for optimum.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is curse of dimensionality?

A

In high dimensions (many attributes) everything is far. There are no points nearby. Unless a large number of data is available.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly