Support Vector Machines Flashcards

Question 1

Q

What is the length of the projection of x onto w if w is a unit vector?

Question 2

Q

What is the margin?

Answer

A

Distance between the decision boundry (hyperplane) and the closest training point

Question 3

Q

How is the modulus of a vector w written?

Question 4

Q

What is || w ||?

Answer

A

sqrt(w^Tw)

Question 5

Q

If the hyperplane is defined as wx + w₀ = 0, what is the distance from the origin to the hyperplane?

Answer

A

b = - w₀ / ||w||

Question 6

Q

What is the perpendicular distance from a point x to the hyperplane w^Tx + w_o = 0?

Answer

A

(1 / ||w||) |w^Tx + w₀|

Question 7

Q

What is the value of the margin under the constraint min_i | w^Tx_i + w₀ | = 1

Answer

A

1 / ||w||

Question 8

Q

What is maximizing 1 / ||w|| the same as?

Answer

A

minimizing ||w||²

Question 9

Q

What is the SVN optimization problem?

Answer

A

min_{<strong>w</strong>} ||w||²

such that y_i(w^Tx + w₀) >= 1 for all i

Question 10

Q

What are the 2 good properties of the optimal weight parameters (for SVM’s)?

Answer

A

They are linear function of the input and class labels
Solution is sparse (optimal hyperplane determined by just a few examples)

Question 11

Q

What are support vectors?

Answer

A

The few training examples that determine the hyperplane

Question 12

Q

What is the problem if the data is not linearly seperable for SVM’s?

Answer

A

The optimization problem has no solution

Question 13

Q

What can we add to solve the problem if the data is not linearly seperable (for SVM’s)?

Answer

A

Slack variables

Question 14

Q

What is the SVM optimization problem (with slack variables)?

Answer

A

Minimize:

Question 15

Q

What is k (power of slack variable) usually set to (SVM)?

Question 16

Q

What is C in SVM optimization problem with slack variables?

Answer

Study These Flashcards

A

Trade-off parameter, how important are the slack variables

Question 17

Q

What does this measure (SVM’s)?

Answer

Study These Flashcards

A

How well we fit the data

Question 18

Q

Why does adding slack variables increase the number of support vectors?

Answer

Study These Flashcards

A

Every non-zero slack variable adds a support vector (because slack puts the point on the margin)

Question 19

Q

What is a kernal function?

Answer

Study These Flashcards

A

k(x_i, x_j) = phi(x_i)^Tphi(x_j)

Takes the feature vectors, transforms them into new feature space and takes the dot product

Question 20

Q

Whats special about kernal functions (compared to basis expansion)?

Answer

Study These Flashcards

A

Can work with infinite dimensions

Question 21

Q

What is the form of a polynomial kernal function?

Answer

Study These Flashcards

A

k(x_i, x_j) = (x_ix_j)^d

Question 22

Q

What is the form a gaussian radial basis function kernal function?

Answer

Study These Flashcards

A

k(x_i, x_j) = exp(-||x_i - x_j||² / c)

Question 23

Q

What does || w || mean?

Answer

Study These Flashcards

A

The magnitude of vector w

Question 24

Q

What do slack variables do (SVM)?

Answer

Study These Flashcards

A

Move misclassified points to the margin

Support Vector Machines Flashcards

(24 cards)