Intro To Classification Flashcards
What is classification?
The prediction of objects by class
What is binary classification?
The model is making a prediction between two classes, example dog or cat and yes or no
What is multiclass classification?
A model prediction between multiple classes, example: meat, dairy, fruit or vegetables
What is balanced or unbalanced classification?
Evaluating how balanced the classes are such as both sides have equal amount if dogs and cats. The balance of dataset will help us interpret of classification
What is decision trees for classification?
Works similar to Regression except final prediction is not an average but the class itself, also called classification trees
What is false positives? (Type 1 errors)
Also known as false positives is a common error type, for example phrase like want to find employees but avoid bad hires(false positives) even turning away qualified people(false negatives)
What is False negatives? (Type 2 errors)
Is another common error Type where we predict that something is negative when it is in fact, positive. Example a false negative for a covid test but in reality the person does have covid
What is a confusion matrix?
A table that is often used to describe the performance of a classification model (or “classifier”) on a set of test data for which the true values are known.
What is accuracy?
Accuracy is the most intuitive metric, in other worlds accuracy is correct predictions our model made out of the total number of precitions
What is Recall?
When we want to reduce the number of false negatives, we want to improve recall
What is Precision?
When we want to reduce the numbers of the false positives, we want to improve precision
What is class probabilities?
Calculates the probabilities of each class and returns the most likely
What is Decision Thresholds?
When working with a binary classification problem, the model prediction will, by default, be the class whoe probability is greater than 0.5 or 50%
What is K Nearest Neighbors (KNN)?
K-nearest neighbors is a model that uses the “k” most similar observations in order to make a prediction.
What are the Advantages of KNN?
Easier to understand and explain than other machine learning algorithms. Can be used for classification or Regression or can be used for multi-class classification problems