Chapter 9 Flashcards

1
Q

What is a tree leaf node?

A

In a classification tree each time there is a partition, the measurements are divided into 2 leaf nodes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How to determine which questions is better in decision trees?

A

1) balance - how much each questions splits up a group.

2) Purity - we want the groups toonly contain one group (e.g. type of animal)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is impurity?

A

A number that represents how impure the root is and also how impure it is after split.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name impurity functions:

A

1) Entropy
2) Gini
3) ClassError

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is early stopping?

A

Instead of stopping at totally pure splits, stops at different criteria:

1) when branch contains fewer than x observations
2) Stop at certain tree depth
3) when purity gain of best split below a certain value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is pruning?

A

Cutting branches after stopped algorhitm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly