Support Vector Machines Flashcards

Question 1

Q

Define a linear classifier margin

Answer

A

the width that the boundary could be increased by before touching a datapoint

Question 2

Q

What is the margin of the Linear SVM

Answer

A

The maximum margin

Question 3

Q

What are support vectors

Answer

A

The datapoints that the margin pushes up against

Question 4

Q

Advantages of Maximum Margin

Answer

A

if the boundary is marginally misplaced, this gives us the least chance of misclassification
Empirically, this works very well
model is immune to removal of any non-support-vector data points

Question 5

Q

The hyperplane: wx + b = 0, is fully determined by ?

Answer

A

(w, b)

w = Weight Vector, b = bias term

Question 6

Q

w is perpendicular to the plus and minus planes, how does this help us calculate the margin width?

Answer

A

We know that the distance between two points on opposite planes will be w multiplied by a constant

w . x⁺ + b = +1
w . x^- + b = -1
x^- = x⁺ + λw
|x⁺ - x^-| = M
M = 2 / ||w||

Question 7

Q

Problems with Maximum Margin

Answer

A

The solution can change drastically if there is an outlier
no solution if the classes are not linearly separable

Question 8

Q

What is the general idea of the Soft Margin SVM

Answer

A

“Relax” the formulation to allow points to be on the “wrong” side.
Penalize points according to how far they are on the wrong side

Question 9

Q

How well does Soft Margin SVM do on unseen data?

Answer

A

Depends on the training error and the number of support vectors
When the number of support vectors is small, we can be sure that the generalization error is not much higher than the training error

Question 10

Q

What is a vector

Answer

A

an object that has both a magnitude and a direction

Question 11

Q

what is a vector’s norm

Answer

A

the magnitude, or length, of a vector.
Denoted ||x||

Question 12

Q

What is a vector norm and How is it computed

Answer

A

It is the magintude, or length, of a vector.
Calculated using Euclidean norm formula;
Square root of the sum of the squared points

Question 13

Q

Give the vector that denotes the direction of a vector

Answer

A

W = (cos(θ), cos(α))

Question 14

Q

Dot product of n-dimensional vectors (formula)

Answer

A

x ⋅ y = SUM(i=0, n) x_iy_i

Question 15

Q

What separates data in a) one dimension b) two dimensions c) three dimensions

Answer

A

point
line
plane

Question 16

Q

Describe the kernel trick used in SVM and what is the mathematical reasoning that makes it work?

Answer

Study These Flashcards

A

Applying a transformation ∅ to all data points before running the algorithm to find non-linear patterns
A function K(⋅, ⋅) is a kernel if there exists a function ∅(⋅) s.t.
K(x_i, x_j) = ∅(x_i) ⋅ ∅(x_j)