5.2 Interpretation Flashcards

Question 1

Q

Consider the skin cancer screening test which is a binary classification problem. We record two binary features from 1000 patients, and we build some models with the following approaches. Which method gives the model with the largest size?

Answer

A

The model of Nearest Neighbour Classifiers is just the data set itself! For this problem with only two binary features and 1000 instance this model is the largest.

For 0-R the model is simply the label of the most frequent class (O(1)). The model in Naive Bayes is a set of prior probabilities and conditional probabilities (O(2+2*2)). In SVM model is the maximum-margin hyperplane, defined by w and b (O(2+1)).

Question 2

Q

Which of the following are hyperparameters of the associated approach (select all hyperparameters)?

Answer

A

Refer to the lecture note. Note that the

k for Nearest Neighbour,
the kernel in SVM
the stopping criterion for decision tree

act as hyperparameters, which are tuneable “knobs”.

Question 3

Q

Which statements are incorrect? (select all that are incorrect)

Answer

A

The model on 0-R is simply the label of the most frequent class which does not deliver any information about the features.

The weight of a zero feature does not play any role in the classification by SVM.

Question 4

Q

Which statements are incorrect? (select all that are incorrect)

Answer

A

PCA is an unsupervised approach.

Question 5

Q

What is a nearest prototype classifier?

Answer

A

Calculate the centroid of each class, and classify each test instance according to the class of the centroid it is nearest to

5.2 Interpretation Flashcards

(5 cards)