Decision Trees Flashcards

Question 1

Q

What role does Entropy play?

Answer

A

Controls how the DT splits the data. It’s the measure of impurity in a bunch of examples. Impurity being how uniform are the classes in the example set.

Question 2

Q

What is the formula for Entropy?

Answer

A

Entropy = Sum(i) { p(i) * log2(p(i)) }, where p(i) = fraction of examples in class i, and sum(i) sums over all classes.

Question 3

Q

What is the entropy of all examples being same class?

Question 4

Q

What is information gain?

Answer

A

entropy(parent) - [weighted average]*entropy(children)

Question 5

Q

How does the decision tree utilize information gain?

Answer

A

It maximizes information gain to determine the splits.

Question 6

Q

Give intuitive explanation for how to remember bias

Answer

A

I can train the model with all sorts of data but it’s bias towards it’s original behavior and doesn’t change

Question 7

Q

Give intuitive explanation for how to remember variance

Answer

A

It cares so very much about the data it’s being trained on and will change it’s behavior to match it’s behavior to whatever data it sees

Question 8

Q

What are DT strengths and weaknesses?

Answer

A

Strengths: Easy to use, graphically interpretable (knowledge representation), can build bigger classifiers from them with ensemble methods
Weaknesses: Prone to overfitting especially with lots of features,

Question 9

Q

Give an example of remembering xor logic gate

Answer

A

When someone asks do you want to go to the movie or bowling. usually they mean xor as in pick one or the other but not both and not neither

Question 10

Q

Decision tree space - compare xor and or

Answer

A

xor - exponential space for nodes

or - linear as you add nodes

Question 11

Q

What is Inductive Bias

Answer

A

The inductive bias of a learning algorithm is the set of assumptions that the learner uses to predict outputs given inputs that it has not encountered. A classical example of an inductive bias is Occam’s Razor, assuming that the simplest consistent hypothesis about the target function is actually the best.

Question 12

Q

What is Preference Bias?

Answer

A

A preference bias is when a learning algorithm incompletely searches a complete hypothesis
space. It chooses which part of the hypothesis space to search. An example is decision trees

Question 13

Q

What is Representation Bias?

Answer

A

A representation bias completely searches and incomplete hypothesis space. It searches the
whole space, but it is a small incomplete space.

Decision Trees Flashcards

(13 cards)