12. Ensemble Method* Flashcards

Question 1

Q

decision trees

Answer

A

while easy to implement, train, use and interpret, it is inaccurate

Question 2

Q

core idea of ensemble method

Answer

A

when weak models are carefully trained and combined, they should be able to produce more accurate and robust prediction results

Question 3

Q

what are the ensemble methods

Answer

A

bagging (bootstrap aggregation)
- create diverse sample data with replacement
- train model with subset of data and aggregate -> learn B weak learners
- aggregate predictions from all learners
- achieve good results by reducing variance
boosting
- training a sequence of weak models, each compensating the weaknesses of its predecessors
- correct predicting error
- achieve good results by decreasing the bias
stacking
- train multiple models from the same training dataset and use another model to aggregate prediction

Question 4

Q

random forest

Answer

A

improve variance deduction of bagging by reducing the correlation between base models without increasing the variance

p * sigma^2 (1 - p) * sigma^2 / n

how does random forest work..?

Question 5

Q

adaboost**

Answer

A

put more weight on difficult instances and less on those already learned

Question 6

Q

gradient boost**

Answer

A

learn the errors (pseudo residuals) made from predecessors

Question 7

Q

what are the advantages to ensemble

Answer

A

Question 8

Q

what are the disadvantages to ensemble

Answer

A

(8 cards)