Classification Flashcards

1
Q

Classification

A

The process of categorising data into predefined classes based on their attributes, characteristics, or features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

2 Approach of Building Model

A

Induction - Learn model
Deduction - Apply model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

4 Types of Classification

A

Binary Classifier
Multiclass Classifier
Multilabel Classifier
Multioutput Classifier (Multioutput–multiclass classification)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Class vs Label

A

Class: A group or category of data points with similar characteristics or attributes.
Label: A name or identifier assigned to a particular data point that indicates the class to which it belongs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Batch Learning

A

Training the model on the entire dataset at once; No update

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

2 Design Issues of Decision Tree Induction

A

Splitting criterion: Attribute test condition to divide records into smaller subsets
Stopping criterion: All records belong to the same class, Early termination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

1 Approach Used in Splitting Criterion

A

Greedy approach - Nodes with purer class distribution are preferred

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

3 Step to Find the Best Split

A

Compute impurity measure (P) before splitting
Compute impurity measure (M) after splitting
Choose the attribute test condition that produces the highest gain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

4 Advantage of Decision Tree Based Classification

A

Relatively inexpensive to construct
Easy to interpret for small-sized trees
Fast at classifying unknown records
Versatile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

2 Disadvantage of Decision Tree Based Classification

A

Each decision boundary involves only a single attribute
Instability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

3 Measures of Node Impurity

A

Gini Index - min. = 0; max. = 0.5
Entropy - min. = 0; max. = 1
Misclassification error - min. = 0; max. = 0.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly