Föreläsning 4 (classification) Flashcards

1
Q

Why is classification different from regression?

A

We have a finite number of
labels. Unlike numbers the labels
are not ordered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some practical applications of classification? (Name at least 5 of the examples from slides)

A
  • Will this treatment help that person?
  • Will this person pay back that loan?
  • Will this person like that book?
  • Is this email spam or not?
  • Is this review positive or negative or neutral?
  • What musical genre does this song belong to?
  • What breed of dog does this picture show?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a good classifier?

A

One that performs well on previously unseen parts of the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why do we use a confusion matrix?

A

For understanding the strengths and weaknesses of a classifier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the K-NN algorithm?

A

K-NN means K-nearest neighbours and is a classification algorithm that chooses the nearest neighbours to a data point. Nearest in this case means distancewise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Is it appropriate to use decision trees for classification?

A

yes woho very appropriate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the formula to calculate gini impurity?

A

Gini = 1 – p(+)^2 – p(-)^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly