6 Flashcards

1
Q

What is the approximate depth of a decision tree trained with 1 million instances

A

Depth is calculated using log2(m) which roughly equals 20

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Is a nodes Gini greater or lower then its parents

A

Its generally lower as the CART training equation aims to reduce impurity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

If a decision tree is over fitting what should be done with the max_depth hyper parameter

A

Decrease max depth to constrain and regularize the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

If a decision tree is under fitting should the input features be scaled

A

It makes no difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

If it takes an hour to train a decision tree with 1 million instances how long will it take to train one with 10 million instances

A

11.7 hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does presort affect training speed

A

It speeds up training if the number of instances is less then a few thousand any more and it will slow it down

How well did you know this?
1
Not at all
2
3
4
5
Perfectly