Basic Machine Learning Flashcards
Algorithm
Step-by-step procedure designed to carry out a task.
Change detection
Identifying when a significant change has taken place in a process.
Classification
The separation of data into two or more categories, or (a point’s
classification) the category a data point is put into.
Classifier
A boundary that separates the data into two or more categories. Also
(more generally) an algorithm that performs classification.
Cluster
A group of points identified as near/similar to each other.
Cluster center
In some clustering algorithms (like 𝑘𝑘-means clustering), the central
point (often the centroid) of a cluster of data points.
Clustering
Separation of data points into groups (“clusters”) based on
nearness/similarity to each other. A common form of unsupervised
learning
CUSUM
Change detection method that compares observed distribution mean
with a threshold level of change. Short for “cumulative sum”.
Deep learning
Neural network-type model with many hidden layers.
Dimension
A feature of the data points (for example, height or credit score). (Note that there is also a mathematical definition for this word.)
EM algorithm
Expectation-maximization algorithm.
Expectation-maximization
algorithm (EM algorithm)
General description of an algorithm with two steps (often iterated), one that finds the function for the expected likelihood of getting the response given current parameters, and one that finds new parameter
values to maximize that probability.
Heuristic
Algorithm that is not guaranteed to find the absolute best (optimal) solution.
𝑘-means algorithm
Clustering algorithm that defines 𝑘 clusters of data points, each
corresponding to one of 𝑘 cluster centers selected by the algorithm.
𝑘-Nearest-Neighbor (KNN)
Classification algorithm that defines a data point’s category as a function of the nearest 𝑘 data points to it.