2. Decision Tree Flashcards

1
Q

What is the two different methods of decision tree?

A

Classification and regression trees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which type of learning can you use with the methods?

A

Both methods can take supervised and un-supervised learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When do you use classification trees?

A

When target variable is categorical, splitting based on similar categories and final prediction is a class label

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do you evaluate a classification tree?

A

Classification trees are evaluated through purity and impurity measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When do you use regression tree?

A

Regression tree is used when the target variable is continous, splitting data into segments of similar value and final prediction being numerical value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you evaluate regression model?

A

A regression model is evaluated through computing average loss per observation (either L1, L2) or through MSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What output does classification produce?

A

Classification tree produce discrete output, learning a function that maps a data object to a discrete class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does classification tree classify?

A

The relation between attributes and class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What can a classifier help predict?

A

It can help predict the class of a new data object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can we explain the classification tree?

A

It is represented by a rooted tree, where each node represents a partition of the input space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does branches and leafs represent in classification tree?

A

Branches = attributes
Leafs = decisions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What sentence can explain classification tree?

A

Ask a series of questions until a conclusion is reached

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the foundation of the impurity measure I(r) or I(vk)?

A

Impurity measure is ensuring the best split for each split for each node. How impure is a set of data points is with respect to their class labels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a good impurity score for classification problem?

A

Zero is all data points belong to a single class
High values of impurity indicate even mix of classes and is bad

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In classification what is the three different impurity measures?

A

Gini index, entropy and misclassification error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In classification, when do use each of the three different measures?

A

Gini index - focus on higher performance (accuracy)
Entropy - information gain, theoretical measure less sensitive to small changes in probability
ClassError - focus on fraction of misclassified instances, incorrect predictions

17
Q

In classification which method is used to see how good the model is?

A

Purity gain

18
Q

In classification how does the purity gain work?

A

Also called impurity reduction, evaluating how well a particular feature or attribute splits the data in a decision tree

19
Q

In regression, which method is used to evaluate the model?

A

Absolute error, average loss per observation

20
Q

In regression, there is two different error calculations?

A

Mean absolute error (MAE)
Mean squared error (MSE)

21
Q

In regression when should you use MAE and MSE?

A

MAE - penalizes all errors linearly and is more robust to outliers, used when data is more alike
MSE - penalizes larger errors more heavily, used for larger deviations in the data

22
Q

What different Euclidean loss are there and where are they used?

A

L1 - MAE, less sensitive to outliers
L2 - MSE, average of the squared difference between predicted values and the actual
L ∞ norm - measure the maximum error, focus on worst case error