Chapter 10- Ensemble Models Flashcards

Question 1

Q

what is the principle of committee decision?

Answer

A

individual predictions combined appropriately should have better overall accuracy on average than an individual committee member

Question 2

Q

what three methods can be used to combine decisions?

Answer

A

averaging, voting, probablistic

Question 3

Q

what is the wisdom of crowds?

Answer

A

different versions make errors in different ways. combining them cancels out the errors of the individuals

Question 4

Q

what is bagging?

Answer

A

generate different training sets, by sampling with replacement to build a committee of models

Question 5

Q

is bagging a sequential or parallel method?

Question 6

Q

what are random forests?

Answer

A

an extension of bagging applied to decision trees.

Question 7

Q

what are the two methods of generating randomisation for random forests?

Answer

A

bootstrap, as in bagging

random selection of features are each split point

Question 8

Q

why do we force a random selection of feature at each split point in random forests?

Answer

A

to ensure it doesnt choose the best feature at every split.

Question 9

Q

what does the lack of pruning of random forest trees ensure

Answer

A

that the models aren’t too simple and therefore too similar to each other.

Question 10

Q

is adaboost parallel or sequential?

Answer

A

sequential

Question 11

Q

describe the adaboost algorithm

Answer

A

a new model trained at each round. at the end of each round the mis-classified examples are identified and have their emphasis increased

Question 12

Q

what is the key difference between bagging and boosting?

Answer

A

at each round, bagging has a uniform distribution while boosting adapts a non uniform distribution.

bagging is parallel, boosting is sequential.

Question 13

Q

what is the main boosting algorithm?

Question 14

Q

what is the update of the distribution function in adaboost after each round?

Answer

A

examples from the erroneous portion make up 50% of the next training

Question 15

Q

give the distribution update scheme for adaboost, i.e. what do we multiply Dj(i) by

Answer

A

1 / 2ej if the classification was incorrect

1 / 2(1-ej) if the classification was correct

Question 16

Q

Generally, an ensemble method works better if the individual models have…

Answer

A

Less correlation among predictions

Question 17

Q

Boosting is a family of algorithms, where

Answer

A

Each base model has a dataset drawn from a weighted distributed, proportional to the errors made by the previous model

Question 18

Q

Bagging is an ensemble algorithm, where …. is used to ensure predictors are …

Answer

A

Bootstrapping is used to ensure predictors are less correlated in their errors

Question 19

Q

where does the name bagging come from

Answer

A

bootstrap aggregating

Question 20

Q

list common ML models in order or stability (most to least)

Answer

A

svm
knn
logistic regression/perceptrons
neural networks 
decision trees

Question 21

Q

what kinds of models are completely unaffected by bootstrapping