Supervised Learning Flashcards

Question 1

Q

Bias error

Answer

A

Model does/can not correctly represent the concept (underfit)

Question 2

Q

Variance error

Answer

A

Model specializes in training set (overfit)
Regularization (favoring smoother
functions, output varies slowly with input) helps to mitigate the variance error

Question 3

Q

Multilinear Regression assumes

Answer

A

Relation between xi and y is linear
All variables (x) have Normal distributions
Variables are independent and residual / error is constant

Question 4

Q

The input of an artificial
neuron:

Answer

A

Comes from all neurons of the previous layer or it is an external
input

Question 5

Q

The output of an artificial neuron

Answer

A

Is sent to all neurons of
the next layer or is (part of) the network output

Question 6

Q

Backpropagation

Answer

A

Present each example (x(i),d(i))
Calculate network response x(i) : f(x(i))
Propagate error backwards (iteratively building error
derivative at each layer)
Save partial derivatives
After all examples processed, update weights

Question 7

Q

Artificial Neural Networks are

Answer

A

Robust to noise and approximations
Based in a simplified model of a neuron
Support incremental training
Compress information of many examples in a small model

Question 8

Q

Deep Learning

Answer

A

Alternating prediction layers with feature detection and decorrelation

Question 9

Q

Deep Learning - network structure

Answer

A

Convolutional layers: apply convolutions to get the feature maps
Pooling (sub-sampling) layers: reduce feature maps’ dimensions (combine features and/or decorrelate)
Dense layers – similar to the “hidden” layers on a classical neuronal network

Question 10

Q

kNN problems

Answer

A

Define distance
Define class selection
Non-linear problems

Question 11

Q

A set has the largest entropy if

Answer

A

each of its elements belongs to a
different class

Question 12

Q

PlayTennis(no/yes) - entropy(S)

Answer

A

− ( P(no) x log2 (P(no)) + P(yes) x log2 (P(yes)) )

Question 13

Q

Decision Tree - the best split

Answer

A

The best split is the split that results in the largest entropy reduction, that is, the largest information
gain (IG)

Question 14

Q

Decision Tree - C4.5 / C5.0

Answer

A

Similar to ID3, but. . .
* Support for continuous attributes - discretizes continuous attributes
* Allows missing values - examples not used when calculating entropy
* Allows different costs for attributes
* Prunning

Question 15

Q

Learning ensembles

Answer

A

Boosting (Kearns 88)
* Can a set of weak learners create a single strong
learner?
* Classification combines the results of all the subtrees
* Misclassified examples become more important for
the error in each iteration
* New trees are trained to fit the residual error

Bagging - Bootstrap aggregating: (Breiman 96)
* Selects randomly the subsets
* Trains several learners,
* Classification by voting, regression by averaging

Question 16

Q

XGBoost (eXtreme Gradient
Boosting)

Answer

Study These Flashcards

A

An optimized Gradient Boosting Machine
Uses many small trees
Classifies an example by joining the scores of each of the various trees
Train by adding trees that improve the result or pruning
Trees are fitted to predict the residual error

Supervised Learning Flashcards

(16 cards)