Chapter 4: Information-based Learning Flashcards

Question

How does ID3 work?

Answer 1

It builds a tree in a recursive depth-first manner, beginning at the root node and working down to the leaf nodes.

Answer 2

Start by choosing the best descriptive feature to test using information gain (basically best question to ask first)

Answer 3

Add the root node to the tree and label it with the selected test feature

Answer 4

- Partition the training dataset using the test (the chosen attribute value) - One partition is created for each possible test result which contains the training instances that returned that result

Answer 5

A branch is grown from the node for each partition

Answer 6

Repeat the process for each branch using the relevant partition of the training set in place of the full training set and with the selected test feature excluded form further testing

Answer 7

This process is repeated until all the instances in a partition have the same target level, at which point a leaf node is created and labeled with that level.

Answer 8

A correct decision tree for a domain will classify instances from that domain in the same proportion as the target level occurs in the domain

Answer 9

- First, the dataset of training instances considered at each of the interior nodes in the tree is not the complete dataset, rather it is the subset of instances considered at its parent node that had the relevant feature value for the branch from the parent to the current node.

Answer 10

Once a feature has been tested, it is not considered for selection again along that path in the tree. A feature will only be tested once on any path in the tree, but it may occur several times in the tree on different paths.

Answer 11

- All instances in the dataset have the same classification (target value) - Return a leaf node with that classification as its label

Answer 12

- The set of features left to test is empty - Return a leaf node tree with the majority class of the dataset as its classification/label

Answer 13

- The dataset is empty - Return a leaf node tree with the majority class of the dataset at the parent node that made the recursive call

Answer 14

- Decide which descriptive feature should be tested at this node (use information gain) - This is based on purity and homogeneity of the resulting partition

Answer 15

After choosing the most informative feature, the algorithm adds a new node the the tree - It then splits the dataset considered at this node according the levels the new node can take

Answer 16

Remove the feature of the new node form the set of features to be considered for testing later on

Answer 17

The algorithm grows a branch in the tree for each of the values in the domain of the new node

Answer 18

- Information gain ratio - Computed by dividing the information gain of a feature by the amount of information used to determine the value of the feature

Answer 19

Gini Index

Answer 20

It is calculating how often you would misclassify an instance in the dataset if you classifies it based on the distribution of classifications in the dataset

Answer 21

Replace the entropy measure with the Gini index

Answer 22

Try out different impurity metrics and compare the results to see which suits a dataset best

Answer 23

- Classification And Regression Tree - another version of ID3 algorithm which uses Gini Index as replacement for Information Gain

Answer 24

Turn them into Boolean features by defining a threshold and using it to partition the instances based on their value of the continuous descriptive feature

Answer 25

- The instances in the dataset are sorted according to the continuous feature values - The adjacent instances in the ordering that have different classifications are selected as possible threshold points - The optimal threshold is found by computing the information gain for each classification boundary and selecting the boundary with the highest information gain as the threshold

Answer 26

- The dynamically created new Boolean feature can compete with the other categorical features for selection as the splitting feature at that node - The process is repeated at each node as the tree grows

Chapter 4: Information-based Learning Flashcards

(51 cards)