LEARNING Flashcards
provides a computer with data, rather than explicit instructions. Using these data, the computer learns to recognize patterns and becomes able to execute tasks on its own.
machine learning
a task where a computer learns a function that maps inputs to outputs based on a dataset of input-output pairs.
supervised learning
This is a task where the function maps an input to a discrete output.
classification
algorithm that, given an input, chooses the
class of the nearest data point to that input
nearest-neighbor classification
algorithm that, given an input, chooses the
most common class out of the k nearest
data points to that input
k-nearest-neighbor classification
drawback of k-nearest
using a naive approach, the algorithm will have to measure the distance of every single point to the point in question, which is computationally expensive
how to optimize k-nearest
by using data structures that enable finding neighbors more quickly or by pruning irrelevant observations.
drawback of perceptron learning
data are messy, and it is rare that one can draw a line and neatly divide the classes into two observations without any mistakes
perceptron learning
goal of the algorithm is to find the best weight vector, when the algorithm encounters new data it updates the current weights.
perceptron learning rule
for each data point, we adjust the weights to make our function more accurate.
sequences of numbers
vector
The weights and values in Perceptron Learning are represented using?
vectors
unable to express uncertainty, since it can only be equal to 0 or to 1.
hard treshold
uses a logistic function which is able to yield a real number between 0 and 1, expressing confidence in the estimate
soft treshold
they are designed to find the maximum margin separator
Support Vector Machines
A boundary that maximizes the distance between any of the data points
maximum margin separator
Benefit of Support Vector Machines
they can represent decision boundaries with more than two dimensions, as well as non-linear decision boundaries
this function gains value when the prediction isn’t correct and doesn’t gain value when it is correct
loss function
supervised learning task of learning a function mapping an input point to a continuous value
regression
functions that can be used when predicting a continuous value
L1 and L2 loss functions
L2 Loss Function Formula
(actual - predicted)^2
L1 Loss Function Formula
|actual - predicted|
L1 vs L2
L₂ penalizes outliers more harshly than L₁ because it squares the the difference