SVM Flashcards

Question 1

Q

What is the margin in the SVM algorithm

Answer

A

The margin is the distance from the line to the nearest point, and the SVM algorithm tries to maximize it. This also maximizes robustness of the algorithm.

SVM first considers classification errors and then maximizes the margin.

Question 2

Q

What is the Kernel Trick

Answer

A

The kernel trick consist in mapping features from a low dimensional input space where they are not linear separable, to a high dimenssional space. And then, having a better linear separation solution in the new high-dimenssional space, the boundary is mapped again to the original feature space.

Question 3

Q

What are the hiperparameters in SVM

Answer

A

The kernel, C and Gamma.

Question 4

Q

What is the impact of ‘gamma’ parameter in a SVM with linnear kernel?

Answer

A

The ‘gamma’ parameter actually has no effect on the ‘linear’ kernel for SVMs.

Question 5

Q

What does the ‘C’ parameter do in a SVM?

Answer

A

C parameter control the tradeoff between smooth decision boundary and classifying training points correctly

Question 6

Q

Does a large ‘C’ mean you expect a smooth boundary or that you will get more training points correct.

Answer

A

Having a bigger ‘C’ means haing more training points correct, and could lead to an overfitting in training data.

Question 7

Q

What does ‘gamma’ parameter do?

Answer

A

Gamma defines the influence of a point based on the distance from this point to de boundary.

Low values mean each point has a far reach and all are important to define the boundary.

High values mean that the boundary will only depend on the closest points.

Question 8

Q

With a high value of ‘gamma’ parameter in SVM do you expect the decision boundary to be smother or more jagged?

Answer

A

The decision boundary with a high value of ‘gamma’ would be more jagged, as the closest points to the boundary would have big weights and would be important to classify them correctly.

Question 9

Q

When is good to use SVMs and when is not? Why?

Answer

A

It’s good to use in complex domains where there is a clear margin of sepparation.

They don’t perform so well in large datasets because training time is cubic in the size of the datase, nor in datasets with lots of noise (very overlapping classes).

SVM Flashcards

(9 cards)