Lecture 8 Tree Flashcards

Question 1

Q

Tree

Answer

A

Non- linear
Doesn’t care about scaling of distribution
Interpretable

Question 2

Q

Building decision tress

Answer

A

Individual tree is built on a subset of data

Question 3

Q

Criteria ( classification

Answer

A

Gino index
Cross entropy

Question 4

Q

Regression tree

Answer

A

MAE, MSE, predict mean, without regularization / pruning- each lead often contains a single point to be pure

Question 5

Q

Parameter tuning

Answer

A

Pre- pruning and post

Depth, leaf nodes, samples split

Question 6

Q

Drawback

Answer

A

Extrapolation— only based on current range of the training — nearest leaf node — no ability to generate new response

Question 7

Q

Instability

Answer

A

Split data different may get different root nood, unstable feature importance , may take one or multiple from a group of correlated features

Question 8

Q

Splitting method

Answer

A

Linear models used if extrapolation is needed

Question 9

Q

Ensemble models

Answer

A

Method that combine multiple machine learning method to create more powerful method

Question 10

Q

Poor man’s ensemble

Answer

A

More models —> better if they are not correlated—> average the result

Question 11

Q

Bagging

Answer

A

Generic way to build slightly different models

Question 12

Q

Bias and variance

Answer

A

Generalization depends on strength of individual classifiers and inversely on their correlation

Strength: ability to accurately predict the target variable

High strength— low bias

Uncorrelating—> help

Lecture 8 Tree Flashcards

(12 cards)