Supervised Learning Flashcards

1
Q

Example Scenario: Email Spam Detection

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Consider an email service provider implementing a spam detection system. The goal is to

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

emails as either “spam” or “not spam” (also known as “ham”).

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Precision:

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Precision measures the proportion of corectly identified spam emails out of all emails

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

by the model.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Formula: Precision = True Positives / (True Positives + False Positives)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True Positives (TP): The number of emails correctly classified as spam.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

False Positives (FP): The number of non-spam emails incorrectly classified as

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Precision Example: Let’s say the spam detection system flagged 100 emails as Sp

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

inspection

A

90 of them were actually spam (True Positives)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Precision = 90 /(90 + 10) = 90%

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

So

A

in this scenario

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

flagged as spam

A

90% of them were actually spam

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Recall:

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Recall measures the proportion of correctly identified spam emails out of all actual

A
17
Q

dataset.

A
18
Q

Formula: Recall =True Positives / (True Positives + False Negatives)

A
19
Q

False Negatives (FN): The number of spam emails incorrectly classified as no

A
20
Q

Recall Example: Suppose there were I 50 actual spam emails in the dataset. The spa

A
21
Q

correctly identified 90 of them as spam (True Positives)

A

but it missed 60 spam emails

22
Q

Recall = 90/(90 + 60) = 60%

A
23
Q

In this case. the recall ofthe spam detection system is 60%. This indicates that out of all

A
24
Q

the system managed to identify 60% of them correctly

A

but it missed 40%.

25
Q

Interpretation:

A
26
Q

• High Precision: Indicates that when the model preuicts an email as spam

A

it 19

27
Q

correct. This reduces the chances of legitimate emails being incorrectly marke

A
28
Q

crucial for user trust and satisfaction

A