Lecture 6 Flashcards
What causes irregular boundries?
Irregular distribution, imbalanced training sizes, and outliers
What causes misclassifications?
unoptimized decision boundries
Support Vectors
a subset of vectors that support or determine the boundry
What is the goal of support vector machines?
to learn a boundary that leads to the largest margin (buffer) from points on both sides
What points of the data set have an influence on the decision boundry (when using an svm)?
Only the support vectors. any point that isn’t a support vector has no influence and can be messed with while having no affect on the decision boundary.
What data points do SVMs use to compute predictions?
Only the support vectors, not the whole training set
What dimension is a decision boundary for a dataset with 2 features?
It is the third dimension and creates a 2D plane
Where is the decision function equal to zero?
On the decision boundary
When are inputs labeled as ‘undefined’?
When they lie between the margins i.e. between -1 and 1
What is the goal of decision functions for SVMs?
to maximize the margin between the data points and the hyperplane
How can optimal values of w and b be found?
through optimization via projective gradient descent
What do your graph and results look like when parameters w and b are optimized?
The algorithm correctly classifies the training examples and the margin is maximized
In a margin-based classifier, what happens to the margin when the weight vector w gets smaller?
The margin gets larger
In hard margin SVMs, do we minimize ‘1/2 abs(w)^2’ or ‘abs(w)’ and why?
1/2 abs(w)^2 because abs(w) is not differentiable at w = 0
What do you do when the data is not linearly separable?
Introduce slack variables and allow “error” in classification