Module 2 Flashcards

Question 1

Q

Nearest neighbour classifier

Answer

A

classify instance to the class label of the nearest training instance
non-parametric model

Question 2

Q

One nearest neighbour cons

Answer

A

sensitive to noise

- overfit training data

Question 3

Q

Increasing k will make the classifier

Answer

A

have a smoother decision boundary (higher bias)

- less sensitive to training data (lower variance)

Question 4

Q

Weighted k-NN

Answer

A

assign a weight to each neighbour (based on how close they are)
sum the weights per class in neighbourhood (assign to class with largest sum)

Question 5

Q

k-NN pros

Answer

A

robust to noisy data

Question 6

Q

k-NN cons

Answer

A

slow for large datasets

Question 7

Q

k-NN regression

Answer

A

Compute the mean value across k nearest neighbours

Question 8

Q

Locally weighted regression

Answer

A

distance-weighted k-NN for regression

- compute the weighted mean value across k nearest neighbours

Question 9

Q

Decision Tree learning

Answer

A

search for an “optimal” splitting rule
split your your dataset
repeat 1 & 2 on each new splitter subset

Question 10

Q

Entropy

Answer

A

A measure of the uncertainty of a random variable

Question 11

Q

Information Gain

Answer

A

Difference between the initial entropy and the (weighted) average entropy of the produced subsets

Question 12

Q

Ordered values

Answer

A

for each feature sort its values

- consider only split points that are between two examples with different class labels

Question 13

Q

Categorical/Symbolic values

Answer

A

find the most informative feature

- create as many branches as there are different values for this feature

Question 14

Q

Pruning

Answer

A

target all nodes that are connected only to leaf nodes
turn each into a leaf node
repeat until all such nodes have been tested

Question 15

Q

Random forests

Answer

A

use many decision tress

- each tree generated with random sample of training set & random subsets of features

Question 16

Q

Regression trees

Answer

Study These Flashcards

A

remove of class label

- predict a real-valued number

Module 2 Flashcards

(16 cards)