Föreläsning 4 (classification) Flashcards
Why is classification different from regression?
We have a finite number of
labels. Unlike numbers the labels
are not ordered.
What are some practical applications of classification? (Name at least 5 of the examples from slides)
- Will this treatment help that person?
- Will this person pay back that loan?
- Will this person like that book?
- Is this email spam or not?
- Is this review positive or negative or neutral?
- What musical genre does this song belong to?
- What breed of dog does this picture show?
What is a good classifier?
One that performs well on previously unseen parts of the data set
Why do we use a confusion matrix?
For understanding the strengths and weaknesses of a classifier
What is the K-NN algorithm?
K-NN means K-nearest neighbours and is a classification algorithm that chooses the nearest neighbours to a data point. Nearest in this case means distancewise
Is it appropriate to use decision trees for classification?
yes woho very appropriate
What is the formula to calculate gini impurity?
Gini = 1 – p(+)^2 – p(-)^2