Basic Machine Learning Flashcards

1
Q

Algorithm

A

Step-by-step procedure designed to carry out a task.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Change detection

A

Identifying when a significant change has taken place in a process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Classification

A

The separation of data into two or more categories, or (a point’s
classification) the category a data point is put into.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Classifier

A

A boundary that separates the data into two or more categories. Also
(more generally) an algorithm that performs classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Cluster

A

A group of points identified as near/similar to each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cluster center

A

In some clustering algorithms (like 𝑘𝑘-means clustering), the central
point (often the centroid) of a cluster of data points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Clustering

A

Separation of data points into groups (“clusters”) based on
nearness/similarity to each other. A common form of unsupervised
learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

CUSUM

A

Change detection method that compares observed distribution mean
with a threshold level of change. Short for “cumulative sum”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Deep learning

A

Neural network-type model with many hidden layers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Dimension

A

A feature of the data points (for example, height or credit score). (Note that there is also a mathematical definition for this word.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

EM algorithm

A

Expectation-maximization algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Expectation-maximization

algorithm (EM algorithm)

A

General description of an algorithm with two steps (often iterated), one that finds the function for the expected likelihood of getting the response given current parameters, and one that finds new parameter
values to maximize that probability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Heuristic

A

Algorithm that is not guaranteed to find the absolute best (optimal) solution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

𝑘-means algorithm

A

Clustering algorithm that defines 𝑘 clusters of data points, each
corresponding to one of 𝑘 cluster centers selected by the algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

𝑘-Nearest-Neighbor (KNN)

A

Classification algorithm that defines a data point’s category as a function of the nearest 𝑘 data points to it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Kernel

A

A type of function that computes the similarity between two inputs; thanks to what’s (really!) sometimes known as the “kernel trick”, nonlinear classifiers can be found almost as easily as linear ones.

17
Q

Learning

A

Finding/discovering patterns (or rules) in data, often that can be applied to new data.

18
Q

Machine

A

Apparatus that can do something; in “machine learning”, it often refers to both an algorithm and the computer it’s run on. (Fun fact: before
computers were developed, the term “computers” referred to people who did calculations quickly in their heads or on paper!)

19
Q

Margin

A

For a single point, the distance between the point and the classification boundary; for a set of points, the minimum distance between a point in the set and the classification boundary. Also called the separation.

20
Q

Machine learning

A

Use of computer algorithms to learn and discover patterns or structure in data, without being programmed specifically for them.

21
Q

Misclassified

A

Put into the wrong category by a classifier.

22
Q

Neural network

A

A machine learning model that itself is modeled after the workings of neurons in the brain.

23
Q

Supervised learning

A

Machine learning where the “correct” answer is known for each data point in the training set.

24
Q

Support vector

A

In SVM models, the closest point to the classifier, among those in a category. (Note that there is a more-technical mathematical definition too.)

25
Q

Support vector machine (SVM)

A

Classification algorithm that uses a boundary to separate the data into two or more categories (“classes”).

26
Q

SVM

A

Support vector machine.

27
Q

Unsupervised learning

A

Machine learning where the “correct” answer is not known for the data points in the training set.

28
Q

Voronoi diagram

A

Graphical representation of splitting a plane with two or more special points into regions with one special point each, where each region’s points are closer to the region’s special point than to any other special
point.