SVM Flashcards

1
Q

What is the margin in the SVM algorithm

A

The margin is the distance from the line to the nearest point, and the SVM algorithm tries to maximize it. This also maximizes robustness of the algorithm.

SVM first considers classification errors and then maximizes the margin.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the Kernel Trick

A

The kernel trick consist in mapping features from a low dimensional input space where they are not linear separable, to a high dimenssional space. And then, having a better linear separation solution in the new high-dimenssional space, the boundary is mapped again to the original feature space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the hiperparameters in SVM

A

The kernel, C and Gamma.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the impact of ‘gamma’ parameter in a SVM with linnear kernel?

A

The ‘gamma’ parameter actually has no effect on the ‘linear’ kernel for SVMs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the ‘C’ parameter do in a SVM?

A

C parameter control the tradeoff between smooth decision boundary and classifying training points correctly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Does a large ‘C’ mean you expect a smooth boundary or that you will get more training points correct.

A

Having a bigger ‘C’ means haing more training points correct, and could lead to an overfitting in training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does ‘gamma’ parameter do?

A

Gamma defines the influence of a point based on the distance from this point to de boundary.

Low values mean each point has a far reach and all are important to define the boundary.

High values mean that the boundary will only depend on the closest points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

With a high value of ‘gamma’ parameter in SVM do you expect the decision boundary to be smother or more jagged?

A

The decision boundary with a high value of ‘gamma’ would be more jagged, as the closest points to the boundary would have big weights and would be important to classify them correctly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When is good to use SVMs and when is not? Why?

A

It’s good to use in complex domains where there is a clear margin of sepparation.

They don’t perform so well in large datasets because training time is cubic in the size of the datase, nor in datasets with lots of noise (very overlapping classes).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly