Lecture 6 Flashcards

Question 1

Q

What causes irregular boundries?

Answer

A

Irregular distribution, imbalanced training sizes, and outliers

Question 2

Q

What causes misclassifications?

Answer

A

unoptimized decision boundries

Question 3

Q

Support Vectors

Answer

A

a subset of vectors that support or determine the boundry

Question 4

Q

What is the goal of support vector machines?

Answer

A

to learn a boundary that leads to the largest margin (buffer) from points on both sides

Question 5

Q

What points of the data set have an influence on the decision boundry (when using an svm)?

Answer

A

Only the support vectors. any point that isn’t a support vector has no influence and can be messed with while having no affect on the decision boundary.

Question 6

Q

What data points do SVMs use to compute predictions?

Answer

A

Only the support vectors, not the whole training set

Question 7

Q

What dimension is a decision boundary for a dataset with 2 features?

Answer

A

It is the third dimension and creates a 2D plane

Question 8

Q

Where is the decision function equal to zero?

Answer

A

On the decision boundary

Question 9

Q

When are inputs labeled as ‘undefined’?

Answer

A

When they lie between the margins i.e. between -1 and 1

Question 10

Q

What is the goal of decision functions for SVMs?

Answer

A

to maximize the margin between the data points and the hyperplane

Question 11

Q

How can optimal values of w and b be found?

Answer

A

through optimization via projective gradient descent

Question 12

Q

What do your graph and results look like when parameters w and b are optimized?

Answer

A

The algorithm correctly classifies the training examples and the margin is maximized

Question 13

Q

In a margin-based classifier, what happens to the margin when the weight vector w gets smaller?

Answer

A

The margin gets larger

Question 14

Q

In hard margin SVMs, do we minimize ‘1/2 abs(w)^2’ or ‘abs(w)’ and why?

Answer

A

1/2 abs(w)^2 because abs(w) is not differentiable at w = 0

Question 15

Q

What do you do when the data is not linearly separable?

Answer

A

Introduce slack variables and allow “error” in classification

Question 16

Q

What does the data have to look like for you to not be able to use a hard margin SVM?

Answer

Study These Flashcards

A

A margin can’t cleaning split the data without leaving, for example, a blue piece on the red side. In other words, if blue pieces separate a red piece from the other red pieces.

Question 17

Q

What are the two conflicting elements of soft margin SVMs?

Answer

Study These Flashcards

A

making the slack variables as small as possible to reduce the margin violations and
making w^T * w as small as possible to increase the margin

Question 18

Q

What does a kernel function map?

Answer

Study These Flashcards

A

It maps the low dimensional data to high dimensional space

Question 19

Q

What type of function is the kernel function?

Answer

Study These Flashcards

A

a similarity function

Question 20

Q

What’s the relationship between the weight vector w and the margin?

Answer

Study These Flashcards

A

The smaller the weight vector w, the larger the margin

Question 21

Q

When using RBF kernel in SVM what does a high Gamma value signify?

Answer

Study These Flashcards

A

The model would consider only the points close to the hyperplane for modeling.

Question 22

Q

Gamma parameter

Answer

Study These Flashcards

A

The gamma parameter in SVM tuning signifies the influence of points either near or far away from the hyperplane. For a low gamma, the model will be too constrained and include all points of the training dataset, without really capturing the shape. For a higher gamma, the model will capture the shape of the dataset well.

Question 23

Q

Disadvantages of SVMs

Answer

Study These Flashcards

A

It is not suited to larger datasets as the training time with SVM’s can be high

It is less effective on noisier datasets with overlapping classes

It was originally designed as a 2-class classifier

Lecture 6 Flashcards

(23 cards)