SVM Flashcards
What is the general idea of SVM? What does the distance from the support vector mean?
To create a support vector to act as a barrier, separating the data into +1 or -1 regions. The distance from the support vector corresponds to the confidence of the prediction.
What is the definition of the margin in SVM?
The distance to the closest data point from the decision barrier.
How can a dataset that’s not linearly separable be handled with SVM?
We can introduce penalty to the weight function we wish to minimize.
How do we maximize the margin in SVM?
We minimize the weights so min(1/2)(W^2) or min(1/2)(W^2+C) if penalty.
How do we handle multi-class classification with SVM?
We can do this in 2 ways:
1) Learn 3 classifiers + vs. {o, -} - vs. {o, +} o vs. {+, -}
2) Learn 3 sets of weights simultaneously