B08 Decision Trees Flashcards

Question 1

Q

Decision trees are part of a group of learners that use all
the data available to them on a first-come first-served
basis. As a result of this, they are known as __________

Answer

A

greedy

learners.

Question 2

Q

Decision Trees:

-Begins at ____ node.
-Passed through
_____ nodes.
-Data is split across
______ (outcomes).
-Ends at ______ (final decision).

Answer

A

Root Node
Decision Nodes
Branches
leaf/terminal node

Question 3

Q

Decision trees are built using a _______________
approach which splits data into subsets and then
recursively splits the subsets into even smaller subsets
until one or more stopping criteria are met. This
approach is also known as ___________.

Answer

A

recursive partitioning

divide and conquer

Question 4

Q

Some of the criteria that trigger a stop to the recursive
partitioning process include when:
-All data in a leaf node are of the ______.
-All _______ have been exhausted.
-A specified _________ has been met.

Answer

A

same class
features
tree size limit

Question 5

Q

For most decision tree algorithms, the decision about
which feature to split upon is usually made based on a
measure of impurity known as _______

Question 6

Q

For decision trees, entropy is a quantification of the
_______________ within a set of class
values.

Answer

A

level of randomness or disorder

Question 7

Q

Entropy is highest when the split is \_\_\_\_\_. As one class
dominates the other, entropy reduces to \_\_\_\_\_\_.

Answer

A

50-50

zero

Question 8

Q

To determine the optimal feature to split upon, the
decision tree algorithm calculates the change in
entropy that would result from a split on each possible
feature. This measure is known as _____________.

Answer

A

Information Gain (F)

Question 9

Q

________ is a modification of information gain that
reduces its bias on highly branching features by taking
into account the number and size of branches when
choosing a feature. It does this by normalizing information gain by the ____________ of a split.

Answer

A

Gain Ratio

Intrinsic information

Question 10

Q

Instead of Entropy, some decision tree algorithms use the __________ to determine the optimal feature to split upon. _______ is a measure of statistical dispersion

Answer

A

Gini impurity measure

Gini

Question 11

Q

Gini impurity goes from ___ to ____ (for an infinite number of
even partitions).

Question 12

Q

A split occurs at the _______ value for the Gini impurity

Question 13

Q

In R, the __________ uses entropy as a measure of

impurity, while the __________ uses Gini

Answer

A

C50

CART algorithm

Question 14

Q

Decision trees have a
tendency to _____ against
the training data.

Question 15

Q

To remediate this, the size of
the tree is reduced in order
for it to generalize better. This is known as ____

Question 16

Q

The __________________ is used to control the
size of the decision tree and to select the optimal tree
size.

Answer

Study These Flashcards

A

complexity parameter (cp)

Question 17

Q

When used for pre-pruning, if the cost of adding
another variable to the decision tree from the current
node is _______the value of the Complexity Parameter, then tree building does
not continue.

Answer

Study These Flashcards

A

above

Question 18

Q

For post-pruning, the cp
value that corresponds to
the \_\_\_\_\_\_\_\_\_ is used as the
threshold for pruning the
tree.

Answer

Study These Flashcards

A

lowest cross-validation

error

Question 19

Q

Strengths of Decision Trees?

Answer

Study These Flashcards

A

-Does well on most problems.
-Handles numeric and nominal
features well.
-Does well with missing data.
-Ignores unimportant features.
-Useful for both large and small
datasets.
-Output is easy to understand.
-Efficient and low cost model.

Question 20

Q

Weaknesses of Decision Trees?

Answer

Study These Flashcards

A

-Splits biased towards features
with a large number of levels.
-Easy to overfit or underfit.
-Reliance on axis-parallel splits
is limiting.
-Small changes in data result in
large changes to decision logic.
-Large trees can be difficult to
interpret or understand.

B08 Decision Trees Flashcards

(20 cards)