Decision Trees Flashcards

Question 1

Q

True or false: decision trees are a non-parametric alternative to regression

Question 2

Q

How do decision trees work?

Answer

A

They split predictors into regions and then assign the average value of the region in the regression setting and the most common value in the classification setting.

Question 3

Q

What algorithm is used to grow the trees and how does it work?

Answer

A

Recursive Binary Splitting
It selects a binary split that minimizes the MSE. The algorithm is greedy - it only optimizes the current split. The algorithm continues until the number of observations in a region is below a specified number.

Question 4

Q

Why do we prune the tree?

Answer

A

The resulting tree from recursibe binary splitting is probably too big: more splits means more flexibility, lower biais and higher variance.

Question 5

Q

True or false: there is no optimal number of splits that minimizes MSE?

Question 6

Q

What are the two pruning methods?

Answer

A

Cost complexity pruning

2. Weakest link pruning

Question 7

Q

What is the tuning parameter (alpha)?

Answer

A

The cost of a tree per terminal nods.

Question 8

Q

Wow is the tuning parameter selected?

Answer

A

Cross-validation

Question 9

Q

In a classification tree, what measure is used instead of the MSE as a number to minimize?

Answer

A

The classification error rate

Question 10

Q

For tree growing, why can’t we use the classification error rate and what can we use instead?

Answer

A

Not sensitive enough

Gini index or Cross-entropy

Question 11

Q

True or false: the Gini index is the variance of observations?

Question 12

Q

In a classification tree, what measures can be used for:

Pruning the tree
Splitting the tree

Answer

A

Classification error rate

2. Gini index or cross-entropy

Question 13

Q

What are the advantages of decision trees over linear regression?

Answer

A

Easier to explain
Closer to the way human decisions are made
Tree can be graphed, making it easier to interpret
Easier to handle categorical predictors (linear regression requires dummy variables)

Question 14

Q

What are the decision tree’s shortcomings?

Answer

A

Do not predict as well as linear regression

2. Not robust (small change in the input data can have a big effect on trees)

Question 15

Q

What methods can be used to adress the decision tree’s shortcomings?

Answer

A

Bagging, random forest and boosting

Question 16

Q

What is the effect of bagging, random forest and boosting on the variance of decision trees?

Answer

A

They lower the variance

Question 17

Q

Explain the bagging method.

Answer

A

Bagging is a form of bootstraping.

Select B bootstrap samples from the n observations.
Construct B trees
Average the results of each trees

Question 18

Q

What is a bootstrap sample?

Answer

A

Simulating a bootstrap sample of size n means drawing n items from the initial sample with replacement.

Question 19

Q

What is the effect of bagging on a simple tree’s variance?

Answer

A

Divides it by B (# of bootstrap saples)

Question 20

Q

Is there a danger of over fitting by making B too large in bagging?

Question 21

Q

What is out-of-bag (OOB) validation?

Answer

A

For n sufficently large, about 1/3 of the initial observations won’t be used in the bootstrap samples. For each tree, the test MSE can be computed using the OOB part of the sample. This eliminates the need for cross-validation.

Question 22

Q

Explain the random forest method

Answer

A

Specify a positive integer ‘m’

2. At each split, m predictors are selected randomly and those are the only predictors that are considered for splitting

Question 23

Q

Is there a danger of over fitting by making B too large in random forest?

Question 24

Q

Why would we use random forest over bagging?

Answer

A

Bagged trees may be correlated. Selectng m predictors has the effect of decorrelating the trees.

Question 25

Q

True or false : if m = k, random forest is reduced to bagging?

Question 26

Q

Is there a danger of over fitting by making B too large in boosting?