6 Flashcards
What is the approximate depth of a decision tree trained with 1 million instances
Depth is calculated using log2(m) which roughly equals 20
Is a nodes Gini greater or lower then its parents
Its generally lower as the CART training equation aims to reduce impurity
If a decision tree is over fitting what should be done with the max_depth hyper parameter
Decrease max depth to constrain and regularize the model
If a decision tree is under fitting should the input features be scaled
It makes no difference
If it takes an hour to train a decision tree with 1 million instances how long will it take to train one with 10 million instances
11.7 hours
How does presort affect training speed
It speeds up training if the number of instances is less then a few thousand any more and it will slow it down