Question 1

Inductive Learning

Accepted Answer

also known as discovery learning, is a process where the learner discovers rules by observing examples. This is different from deductive learning, where students are given rules that they then need to apply.

Question 2

Decision Tree Strucutre

Accepted Answer

Consists of root notes (where the tree starts) Branches (splits with children) Leaf nodes (end of the tree - represents possible outcomes) nodes {where a parent and child meet}

Question 3

Experience Table

Accepted Answer

A labeled data set with your target variable and all of the features for which data was collected

Question 4

What kind of algorithm will we use for our decision trees?

Accepted Answer

ID3

Question 5

Decision Tree Algorithm

Accepted Answer

(1) Choose the best attribute to split the remaining instances - that becomes the root
(2) repeat process with children
(3) stop when - all instances have the same target attribute value, there are no more attributes, or there are no more instances

Question 6

How do you identify the best attribute to become the root of your decision tree?

Accepted Answer

Information gain

Question 7

What makes a good decision tree?

Accepted Answer

It must be small AND classify accurately small trees are less susceptible to overfitting and are easier to understand

Question 8

Information Gain and Impurity Levels

Accepted Answer

{xxxxxyxxxxyxxx} not pure {xxxxxxxxxxxxxx} as pure as it gets {xxxxxxxyyyyyyyy} least pure

Question 9

Information Gain

Accepted Answer

We want to determine which attribute in a given set of training feature vectors is most useful for discriminating between classes to be learned.

Information gain tells us how important a given attribute of the feature vectors is

We use it to decide the order of attributes in the nodes of a decision tree

Question 10

Decision Tree CONS

Accepted Answer

suffer from a problem of errors propagating throughout the tree (becomes more of an issue as number of classes increases)

Question 11

Error Propogation

Accepted Answer

Since decision trees work by a series of local decisions, what happens when one of these local decisions is wrong? Everything beyond that point is incorrect, and we may never return to the right path

Question 12

Noisy data in decision trees

Accepted Answer

When 2 values have the same attribute / values pairs but different classifications

some values of the attributes are incorrect because of errors in the data acquisition process or the preprocessing phase

Some attributes may be irrelevant to the decision making process (the color of a dice used to roll)

Question 13

Overfitting in Decision Trees

Accepted Answer

Irrelevant attributes can VERY EASILIY lead to overfitting Too little training data can also lead to overfitting

Question 14

How to avoid overfitting in Decision Trees

Accepted Answer

Stop growing the tree when the data split is not statistically significant Acquire more training data Remove irrelevant attributes Grow a full tree then post - prune

Question 15

How to select the best decision tree

Accepted Answer

Measure performance over training data Measure performance over separate validation sets Add complexity penalty to performance measure

Random Forest Flashcards

(26 cards)