Classification Flashcards

Question 1

Q

Describe the steps of Image classification

Answer

A

1) Feature extraction
2) Feature description
3) Classification

Question 2

Q

How can we classify images without labels

Answer

A

Use unsupervised clustering techniques

Question 3

Q

What do we mean by semi-supervised learning?

Answer

A

We only have labels for a part of the dataset

Question 4

Q

What is MNIST

Answer

A

A dataset of 70 000 28x28 pixel handwritten digits

Question 5

Q

Describe the typical pre-processing of digit recognition in general images

Answer

A

Detect the digits in the large image
Normalize the size of the digit, for example, to 28x,28 pixels
Normalize the location, place mass center in the middle
“Slant” Make the orientation canonical

Question 6

Q

Describe K-Nearest Neighbour algorithm

Answer

A

Classify by taking a majority vote among the K-nearest neigbhours

Question 7

Q

Name some distance measurements

Answer

A

L2 (Euclidean), L1(Manhatten)…

Question 8

Q

Name some advantages and disadvantages of K-Nearest Neighbour

Answer

A

It works reasonably well
No training required
Nonlinear decision boundaries
Multi-class
All training data must be stored in memory
Long evaluation time

Question 9

Q

Why shouldn’t we use the test set to tune hyperparameters? What should we do instead?

Answer

A

Tuning hyperparameters with the test set overfits the model to the test set. Use a validation set or cross-validation.

Question 10

Q

What’s the advantage and disadvantage of using cross-validation vs. a dedicated validation set

Answer

A

Cross-validation makes more data available for training but is computational more expensive.

Question 11

Q

What’s the formula of a general linear classifier

Answer

A

w*x + b = y

Question 12

Q

how can we use the formula of a linear classifier, 
wx + b = c to assign a class to the data?

Answer

A

in class (+1) if c>0, else not in class (-1)

Question 13

Q

What is considered the best hyperplane for SVMs?

Answer

A

The hyperplane that maximalizes the margin, the combined distance from the closest points in both classes to the hyperplane.

Question 14

Q

What points do we need to determine the hyperplane in an SVM?

Answer

A

Only the points closest to the hyperplane

Question 15

Q

Derive the formula for making the margin equal to 2 in an SVM.

Answer

A

𝒘 ∙ 𝒙𝟏 + 𝑏 = -1
𝒘 ∙ (𝒙𝟏 + 𝑚𝒏) + 𝑏 = 1.
m 𝒘 ∙ 𝒏 = 2
𝒏 = 𝒘/||𝒘||

Question 16

Q

How can we deal with outliers in SVM’s

Answer

Study These Flashcards

A

We can introduce a slack variable which allows some misclassified points. This will increase the margin.

Question 17

Q

Why can we always solve the minimization problem of the SVM, assuming that the classes are linearly separable?

Answer

Study These Flashcards

A

The loss is convex

Question 18

Q

How can we solve the minimization problem of the SVM loss in practice

Answer

Study These Flashcards

A

Gradient descent and the subgradient

2. Lagrangian duality

Question 19

Q

Why do we use a kernel in SVMs?

Answer

Study These Flashcards

A

Transforming the data to a higher dimension might make it linearly separable.

Question 20

Q

Name some SVM kernels

Answer

Study These Flashcards

A

Linear, Polynomial, Gaussian.

Question 21

Q

Name some strategies for multi-class classification

Answer

Study These Flashcards

A

1 vs. Rest

1 v 1 with a majority voting

Question 22

Q

How large is the slack variable in an SVM when a point is on the wrong side of the separating hyperplane

Answer

Study These Flashcards

A

slack > 1

Classification Flashcards

(22 cards)