Week 10 DSE Flashcards by Hash Loi

What are the pros and cons of decision trees?

pro:
- no need to standardize data
- perform variable selection
- computationally efficient
- scales up very well, allows interpretation

cons:
- simple trees will likely lose out to other methods on prediction (unlike knn)
- don’t naturally lead to continuous models
- does not handle categorical var well when many categories
- can increase predictive performance but at the cost to interpretability (big, deep trees)

How well did you know this?

Not at all

Perfectly

What is discontinous data?

Change data would cause the important varibales predicted to be different
highly dependent on training data

How well did you know this?

Not at all

Perfectly

What is root node and leaf node?

root: first node
leaf: final decision

How well did you know this?

Not at all

Perfectly

What is covariate space?

space of x variable

How well did you know this?

Not at all

Perfectly

Regression tree ______________ the _____________ into a set of rectangles and then fit a _______________in each one.

splits effectively/partition
covariate space
simple model (constant)

How well did you know this?

Not at all

Perfectly

What ar ethe steps of building a tree?

CART
- form the tree using recursive binary partition of data

How well did you know this?

Not at all

Perfectly

To make a prediction, just find the ___________to which the new observation belongs and equate the _________to the __________ of that region.

interval
forecast
sample mean

How well did you know this?

Not at all

Perfectly

What is the differnece between the splits if xi is numeric vs categoric?

For numeric xi , the rule is based on the threshold xi < c.
For categorical xi , the rule lists the set of categories sent left.

How well did you know this?

Not at all

Perfectly

What is used to measure complexity of decision trees?

number of leaves/terminal nodes (T)

How well did you know this?

Not at all

Perfectly

What is usually the loss funciton of decision trees?

MSE

How well did you know this?

Not at all

Perfectly

What does alpha stand for in the tree choice minimisation problem

penalty parameter

How well did you know this?

Not at all

Perfectly

What happens when alpha is 0?

we would choose the tree that perfectly fits the data resulting in overfitting

How well did you know this?

Not at all

Perfectly

What is the algorithm in choosing tree size?

cost- complexity pruning (weakest link pruning)

How well did you know this?

Not at all

Perfectly

What are the steps for the tree minimisation problem?

Step 1: Grow big. Use recursive binary split with stop condition.
Rationale: seemingly worthless split high in the tree may really play an important role lower down

Step 2: Prune back. Recursively prune back big tree. Examine every pair of leaves and eliminate if result in SMALLEST increase in loss. Give sequence of subtree of initial big one that contains T hat

How well did you know this?

Not at all

Perfectly

What does smaller cp mean?

means smaller α
small penalty on complexity and hence bigger tree.

How well did you know this?

Not at all

Perfectly

_________ has CV- min
cp.

Study These Flashcards

Middle tree

Can decision trees help to improve linear regression model?