Classification: Basic Concepts Flashcards
What is classification?
model or classifier is constructed to predict class (categorical) labels
What is a numeric prediction?
model constructed predicts a continuous-valued function, or ordered value, as opposed to a class label
What is regression analysis?
statistical methodology that is most often used for numeric prediction
What are the two major types of prediction problems?
Classification and numeric predictions
What are the two steps in data classification?
learning step and classification step
what is the learning step
The training phase where classification algorithm builds teh classifier by analyzing or learning form a training set with the associated class labels.
What is classification step?
model used to predict class label for a given data
What is the accuracy of a cllassifier?
In a given test set, is the percentage of test set tuples that are correctly classified by the classifier.
What is desicion tree induction?
learning of decision trees from class-labeled training tuples
What is a desicion tree?
flowchart-like tree structure
What does each internal node (non leaf node) denotes?
a test on an attribute
What does each branch in a desicion tree represent?
outcome of the test
What does each leaf node represent?
terminal node holds the class label
How are decision trees used for classification?”
Given a tuple, X, for which the associated class label is unknown, the attribute values of the tuple are tested against the decision tree. A path is traced from the root to a leaf node, which holds the class prediction for that tuple
Why do we do attribute selection measures?
used to select the attribute that bests partitions the tuples into distinct classes.