Support Vector Machines Flashcards
What is the length of the projection of x onto w if w is a unit vector?
wTx
What is the margin?
Distance between the decision boundry (hyperplane) and the closest training point
How is the modulus of a vector w written?
|| w ||
What is || w ||?
sqrt(wTw)
If the hyperplane is defined as wx + w0 = 0, what is the distance from the origin to the hyperplane?
b = - w0 / ||w||
What is the perpendicular distance from a point x to the hyperplane wTx + wo = 0?
(1 / ||w||) |wTx + w0|
What is the value of the margin under the constraint mini | wTxi + w0 | = 1
1 / ||w||
What is maximizing 1 / ||w|| the same as?
minimizing ||w||2
What is the SVN optimization problem?
min<strong>w</strong> ||w||2
such that yi(wTx + w0) >= 1 for all i
What are the 2 good properties of the optimal weight parameters (for SVM’s)?
- They are linear function of the input and class labels
- Solution is sparse (optimal hyperplane determined by just a few examples)
What are support vectors?
The few training examples that determine the hyperplane
What is the problem if the data is not linearly seperable for SVM’s?
The optimization problem has no solution
What can we add to solve the problem if the data is not linearly seperable (for SVM’s)?
Slack variables
What is the SVM optimization problem (with slack variables)?
Minimize:
What is k (power of slack variable) usually set to (SVM)?
1