C4 Flashcards

Question 1

Q

accuracy is not a good measure in most cases of classification

Answer

A

classes are often unbalanced: high accuracy in one class might mean low accuracy in another class
--> use precision and recall

Question 2

Q

precision

Answer

A

what proportion of the assigned labels are correct? (for one class)

Precision = |A * T| / |A| = tp / (tp + fp)

Question 3

Q

recall

Answer

A

what proportion of true labels was assigned?

Recall = |A * T| / |T| = tp / (tp + fn)

Question 4

Q

F-score

Answer

A

average of precision and recall

F1 = 2 * precision * recall / (precision + recall)

Question 5

Q

what determines the quality of the classifier

Answer

A

Question 6

Q

analysis for classification

Answer

A

per-item evaluation: which are the most difficult items, one classifier may work better for one subset, another for another subset

per-category evaluation: what is the precision and recall per class (especially with class imbalance)

error analysis: confusion matrix

Question 7

Q

evaluation for regression

Answer

A

mean squared error:

Question 8

Q

evaluation for rankings

Answer

A

option 1: proportion of correct answers at position 1

option 2: proportion of items that have the correct answer in the top n

option 3: the rank of the correct answer or the rank of the highest ranked relevant answer (multiple answers)

(8 cards)