Lecture 3 - Multi-class classification and regression Flashcards
Give an example of multi-class classification
- Disease type diagnosis
- topic classification
what are the two approaches to turn binary classifier into multi-class
- One versus rest
- One versus one
What is one versus rest
Each classifier distinguishes between one specific class and all other classes combined. The class with the highest confidence score from its respective classifier is chosen as the final prediction.
what is the process of learning and inference one-versus-rest
Learning: traink or k-1 seperate classifiers where k is the number of classes
Inference: use all the and form a code word based on the output of the classidier. Next compare the code word against all the rows and dinf the cnearest row in the code matrix.
what is one-versus-one
strategy for multi-class classification in machine learning where a separate binary classifier is trained for every possible pair of classes.
For n classes, this results is n(n−1)/2 classifiers for symmetric, and n(n-1) for asymetric.
who does this apply to?
one versus one
what is the process of training and inference of one-versus-one
Training: traiin seperate classifiers for each pair of classes
Inference: use all the classifications for a code word based on the output of the classifier. Next, compare the code word against all rows and find the nearest row in the code matrix. Take a voting scheme when distances are not unique.
How to get the accuracy in a confuzion matrix of a three-class confuzion matrix
add up all the True Positives divided by the total
How to get the precision in a confuzion matrix of a three-class confuzion matrix
get the precision for each class, then multiply it by the distribution with the total, then add all the three weighted precisions together.
how can we know how good a classifier can be?
- macro-average
- micro-average
what is macro-average
macro-averate will compute the metric independently for each class and then take the average.
what is micro-average
micro-average will aggregate the contributions of all classes to compute the average metric.
how should AUC curves be used for multi-class classifiers
The average AUC over binary classification tasks, eigher in a one-versus-rest of one-versus-one.
what does ROC stand for
Receiver operating characteristics
What is regression loss
Regression models are evaluated by applying a loss function to the residuals. f(x)- ^f(x)