7. Tree-based algorithms Flashcards

Question 1

Q

Purpose of tree-based algorithms

Answer

A

Prediction

( applicable to regression and classification )

Question 2

Q

Impurity measures

Answer

A

Used to determine the quality of the split
- Gini Index
- Entropy
- Re-substitution error

Question 3

Q

Explain decision tree alg

Answer

A

(0. Pre-processing e.g. binarization, else use regression tree instead of classification tree)
1. Recursive binary splitting and determine splits via impurity measures
2. Improve with Cost complexity pruning –> Grow large tree and prune it back.
( Select tuning parameter using (k-fold) cross-validation )

Question 4

Q

Decision Tree advantages

Answer

A

Low pre-processing effort (normalization, scaling not required)
Missing values have little effect
Easy to explain and interpret (closely mimic human decision-making)
Handle qualitative predictors without need to create dummy variables ( qualitative, quantitative, continuous, and discrete variables )
Faster than RF

Question 5

Q

Decision Tree disadvantages

Answer

A

Lower predictive accuracy (vs. Regression / Classification)
–> improve by aggregation, at loss of interpretability and speed
Overfitting risk (unlike Bagging/RF)
Instability for changes in data
Can become quite Complex (expensive)

Question 6

Q

Effect of Bagging, Boosting, RF

Answer

A

+ Increase predictive accuracy (lower variance)

Lower interpretability
Lower speed (complexity)

RF: Adds random selection to bagging, to produce uncorrelated trees. Less risk of overfitting

Question 7

Q

Explaining Bagging

Answer

A

Many weak learners (bootstrap samples) are trained.
Take the mean of these estimation over the collection of bootstrap samples
–> The overall prediction is the most commonly occurring class among the predictions (== majority vote)

Question 8

Q

Explain Boosting

Answer

A

System of ensemble learners, using a gradient boosting function to iteratively train models that use data values that have been modelled poorly in previous iterations.

( Bags chosen at random with replacement = bootstrap samples)
+ Improve performance and reduce variance

Question 9

Q

Boosting vs. RF vs. Bagging

Answer

A

Boosting:
— Selects from all predictive variables
— Sequentially depends on error rate of previous iteration
— 3 Tuning Param
Benefit: Learns with previous error term

RF:
— Selects from subset of predictive variables
— Built independently at each iteration
— 2 Tuning Param

Bagging:
— Aggregation (by mean) of bootstrap samples

7. Tree-based algorithms Flashcards

(9 cards)