Test 1 W1-4 Flashcards
Tuesday, October 2
What is Deep Learning?
A set of machine learning algorithms that model high-level abstractions in data by using model architectures (often neural networks)
What is Machine Learning?
Let machines learn from data instead of writing programs from hand.
Provide examples that specify correct outputs for a given input and a machine learning algorithm takes the data and produces a program/ model that does the job (if done right, the program works for new cases as well, can change the program by training it on new data)
Given examples of inputs and corresponding desired outputs, predict outputs on future inputs
Supervised Learning
Given only inputs, automatically discover representations, features, structures, etc
Unsupervised Learning
Given sequences of inputs, actions from a fixed set, and scalar rewards/ punishments, learn to select sequences in a way that maximizes expected reward
Reinforcement Learning
Outputs are categorical; inputs are anything. Goal is to select correct class for new inputs.
Classification (Supervised Learning)
Outputs are continuous; inputs are anything (but usually continuous values). Goal is to predict outputs accurately for new inputs.
Regression (Supervised Learning)
-?- is used to scale the range of a feature
Feature Normalization
(need to normalize features so that the distance between two points is not governed by one particular feature with a broad range of value)
Many classifiers calculate the distance between two points by -?-
The Euclidean distance
-?- is often used as the method to find optimal solution, which converges much faster with feature scaling than without it
gradient descent
What are the three phases of classification tasks.
Data is split accordingly to be
- Training set
- Validation (aha development or held-out) data set (used for model selection/ averaging)
- Test Set
Ration ofter 60:20:20
What is a solution for when data is limited?
n-fold cross validation
What is supervised learning?
Given examples of inputs and corresponding desired outputs, the system predicts outputs on future inputs.
List 3 examples of supervised learning.
Classification, Regression, Time Series Prediction
What is unsupervised learning?
Given only inputs, automatically discover representations, features, structure, etc.
What is classification?
Outputs are categorical, inputs are anything. Goal is to select correct class for new inputs.
What is Regression?
Outputs are continuous, inputs are anything (but usually continuous. Goal is to predict outputs accurately for new inputs.
What is an example of unsupervised learning?
Clustering.
What type of learning is KNN?
Supervised learning.
In KNN what if k is too small?
Over fitting occurs. A learning algorithm corresponds too closely to a particular set of data, and may therefore fail to fit additional data or predict future observations reliably. (Overfit to noise)
In KNN what if k is too big?
Underfitting occurs.
Think about if k is equal to the total number of training data points, KNN does not learn much from the data it just becomes a majority classifier
What is leave-one-out validation?
When data scarce, may be appropriate to consider making the number of folds (cross validation) to be the number of data points
What is the K-Nearest Neighbot (KNN) ‘algorithm’?
A generative nonparametric classification model
Training: store all data in some way
Given a test point x, find a sphere around x enclosing k points
Classify x according to the majority of the k neighbors
What is the nearest neighbor classifier?
in KNN of K=1