Basic Concepts in Machine Learning Flashcards

Question 1

Q

Learning

Answer

A

Learning is the acquisition of new information or knowledge or the process to acquire knowledge or skill by systematic study or by trial and error

Question 2

Q

What is Machine Learning?

Answer

A

Machine learning is “the field of study that gives computers the ability to learn without being explicitly programmed”

Question 3

Q

A machine learning system is comprised of the following four components:

Answer

A

Dataset 𝒮: a set of samples generated by some system or process; the
samples can be single data points or pairs of input and output values
Model ℳ: an adjustable and compact representation of a certain class of
input/output relationships that is hypothesized to be capable of modeling the
system or process which generates 𝒮
Objective Function ℒ: a function that encodes the current performance of ℳ
(e.g. loss or reward)
Algorithm 𝒜: the learning algorithm that adjusts ℳ based on 𝒮 and ℒ

Question 4

Q

Machine learning is an important prerequisite for the implementation of a broad range of cognitive functions in artificial cognitive systems:

Answer

A

Learning and Development: modeling and implementation of biological
learning mechanisms (operant conditioning, implicit learning, explicit
learning, perception etc.)
Memory, Knowledge, and Internal Simulation: modeling and implementation
of the encoding, storage and retrieval of facts, experiences, and actions
(e.g. associative memory)
Perception: learning basic features to detect and categorize perceptual stimuli
(e.g. unsupervised learning of visual features)
Autonomy: dynamic adaption to changes in the environment (e.g. continuous
online learning from a live data stream

Question 5

Q

Practical Applications of Machine Learning Examples

Answer

A

Image classification
Speech recognition
Autonomous driving
Recommendation systems
Threat protection
Control systems

Question 6

Q

Definition of the Machine Learning Task

Answer

A

Train a model ℳ in a hypothesis space ℋ using a learning algorithm 𝒜 so that
ℳ minimizes loss ℒ

Question 7

Q

Types of Machine Learning

Answer

A

Unsupervised Learning
* Solely unlabeled data
* Discovery of structural features in the data set

Reinforcement Learning
* Interaction with the
environment
* Reward signal encodes feedback for the policy

Semi-Supervised Learning
* Labeled and unlabeled training samples
* A priori assumptions on input data required

Supervised Learning
* All training samples are labeled
* Desired output is
specified exactly

Question 8

Q

Combining Hypotheses to Ensembles

Answer

A

Ensemble methods in machine learning are a simple way to extend hypothesis
spaces by combining a set of hypotheses ℎ1, ℎ2, …, ℎ𝑛 ∈ ℋ to a new hypothesis
ℎ∗ ∈ ℋ𝑛.

Question 9

Q

Boosting

Answer

A

Boosting algorithms compute a strong learner by incrementally constructing an
ensemble of hypotheses:
* Every training sample 𝑠𝑖 ∈ 𝒮 is assigned a weight 𝑤𝑖; initially, all weights are
set to the same value
* Weights of incorrectly learned samples are increased
* The training of new hypotheses focusses on samples with high weights

Question 10

Q

Underfitting vs. Overfitting

Answer

A

Underfitting: ℎ fits the training data poorly and does not model the underlying process because ℋ is not expressive enough
Overfitting: ℎ fits the training data very well but does not model the underlying process because it does not generalize well

Question 11

Q

Generalization

Answer

A

predictive performance of ℎ on data that were not considered
during the training phase

Question 12

Q

Occam’s Razor

Answer

A

Of two competing theories, the simpler explanation of an
entity is to be preferred

Question 13

Q

Generative and Discriminative Models

Answer

A

Discriminative models are based on the posterior probabilities P(𝑦|𝑥)
Generative models are based on the prior probabilities P(𝑥| 𝑦) ; predictions can be computed by applying Bayes’ theorem: P𝑦𝑥 =P(𝑥|𝑦)P(𝑦)/P(𝑥).
Generative models are compact representations of the training data that have considerably less parameters than the dataset 𝒮

Question 14

Q

Training Validation and Test

Answer

A

Training set: the samples used in the training phase by the learning algorithm to search for a hypothesis ℎ in the hypothesis space ℋ
Validation set: a set of samples that are used to assess the performance of a
hypothesis ℎ that was computed in the training phase; based on the performance of ℎ, the parameters of the training phase can be adjusted
Test set: a set of samples (or real-world data) that is used to assess the performance of the final model

Question 15

Q

Cross-Validation

Answer

A

The dataset is partitioned into k subsets and learned in k iterations
In every iteration, a different subset is selected as validation set
The overall performance corresponds to the averaged performances of the k
iterations

Question 16

Q

Avoidance of Overfitting techniques

Answer

Study These Flashcards

A

Regularization
Overfitting is only possible with hypotheses ℎ that are complex enough to capture statistical features that do not explain the data (e.g. noise). A regularization term in the objective function ℒ guides the learning process
towards simpler solutions by punishing complexity.

More Training Data Overfitting can be reduced by increasing the size (i.e. the complexity) of the
dataset.

Dataset Augmentation If not enough training data is available, the size of the dataset can be increased
by applying transformations to the training samples (i.e. add noise, apply shifts,
translations, and rotations, etc.).

Question 17

Q

The Perceptron

Answer

Study These Flashcards

A

The perceptron is a linear classifier that is based on a single neuron with a digital threshold function

Question 18

Q

Perceptron Learning Rule Properties

Answer

Study These Flashcards

A

If a solution exists, i.e. if the data set is linearly separable, then the perceptron learning algorithm finds a solution within a finite number of steps (perceptron convergence theorem)
The solution computed by the algorithm depends on the initialization of the
parameters and the order of presentation of the training samples
The algorithm does not converge for not linearly separable data sets

Question 19

Q

Interpolation vs. Regression

Answer

Study These Flashcards

A

The interpolation function 𝑓(⋅) must
be consistent with 𝒮: 𝑓
𝑥𝑖 =𝑦𝑖.

The hypothesis ℎ(⋅) should minimize ℒ
and generalize well to new samples.

Question 20

Q

Classifying Multiple Classes Problems

Answer

Study These Flashcards

A

One-Versus-the-Rest Classifier (Top)
* Separation of 𝐾 classes with 𝐾 − 1 binary discriminant functions
* Every discriminant function separates one class from all others

One-Versus-One-Classifier (Bottom)
* Pairwise separation of 𝐾 classes with K
𝐾−1 /2 binary
discriminant functions
* The class of a data sample is assigned by a majority vote

In both cases, some regions are classified ambiguously!

Question 21

Q

Answer

Study These Flashcards

A

Basic Concepts in Machine Learning Flashcards

(21 cards)