12. Ensemble Method* Flashcards

1
Q

decision trees

A

while easy to implement, train, use and interpret, it is inaccurate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

core idea of ensemble method

A

when weak models are carefully trained and combined, they should be able to produce more accurate and robust prediction results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are the ensemble methods

A
  1. bagging (bootstrap aggregation)
    - create diverse sample data with replacement
    - train model with subset of data and aggregate -> learn B weak learners
    - aggregate predictions from all learners
    - achieve good results by reducing variance
  2. boosting
    - training a sequence of weak models, each compensating the weaknesses of its predecessors
    - correct predicting error
    - achieve good results by decreasing the bias
  3. stacking
    - train multiple models from the same training dataset and use another model to aggregate prediction
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

random forest

A

improve variance deduction of bagging by reducing the correlation between base models without increasing the variance

p * sigma^2 (1 - p) * sigma^2 / n

how does random forest work..?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

adaboost**

A

put more weight on difficult instances and less on those already learned

  1. set weights of all samples
  2. bootstrap k samples based on weights
  3. fit leaner XX on sample data
  4. evaluate weak learner G on dataset and calculate loss
  5. compute impacting factor of alpha
  6. update sample weights based on e
  7. update G
  8. repeat till terminate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

gradient boost**

A

learn the errors (pseudo residuals) made from predecessors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what are the advantages to ensemble

A
  1. able to learn complex problems
  2. generally high accuracy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what are the disadvantages to ensemble

A
  1. might not fit well in simple problems
  2. lack of interpretability
  3. computationally expensive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly