Rule-Based Classifiers Flashcards
Rule-based classifier
A machine learning algorithm that makes predictions based on a set of explicitly defined rules, which are based on the relationships between the features and the target classes in the training data.
Coverage of a rule
Fraction of records that satisfy the antecedent of a rule
Coverage (r) = |A| / |D|
|A| - The number of instances covered by the rules.
|D| - The total number of instances in the dataset.
Accuracy of a rule
Fraction of records that satisfy the antecedent that also satisfy the consequent of a rule.
Accuracy (r) = |A n y| / |A|
3 Type of Rule
Mutually exclusive rules - Every record is covered by at most one rule
Exhaustive rules - Each record is covered by at least one rule
Default rule - Not trigger any rules
2 Type of Rule-Set
Ordered Rule Set - Rules ranked in decreasing order according to their priority
Rule-based Ordering Scheme - Ranked based on their quality.
Class-based Ordering Scheme - Shorter length, higher priority
Unordered Rule Set
Hard Voting (Majority Voting) - The highest number of votes.
Soft Voting (Weighted Voting) - Vote weighted by the rule’s accuracy..
2 Rule Extraction Methods
Direct method - Extract rules directly from data
Indirect method - Extract rules from other classification models; E.g. Decision Tree
4 Step of Direct Method - Sequential Covering
Start from an empty rule
Grow a rule using the Learn-One-Rule function
Remove training records covered by the rule
Repeat Step (2) and (3) until stopping criterion is met
5 Aspects of Sequential Covering
Rule Growing
Instance Elimination
Rule Evaluation
Stopping Criterion
Rule Pruning
3 Metric of Rule Evaluation
Accuracy = nc / n
Laplace = (nc + 1) / (n + k)
M-estimate = (nc + kp) / (n + k)