Lesson 11.2 - Spam Flashcards

1
Q

What is the problem with SPAM?

A
  1. Filters: Someone must design filters
  2. Storage: email servers must store SPAM
  3. Security problems: Phishing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How much of email traffic is SPAM

A

95%?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How to differentiate “SPAM” from “Ham”

A
  1. Content-based: Looked at what in the mail
  2. IP address of sender (blacklist)
  3. Behavioral features - if the mail is sent at a time or in a batch.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s the problem with Content-based filtering?
A: Too slow
B: Easy for attackers to evade
C: Words are difficult to parse

A

B: Easy to evade

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Behavioral features

A
  1. Geographical location of sender/receiver
  2. Set of target recipients
  3. Upstream ISP
  4. Botnet?

This is challenging because you must understand network behavior and then build classifiers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

BGP “Agility” make blacklists ineffective why?

A
  1. Hijack IP prefix (within 10 minutes)
  2. Send SPAM
  3. Withdraw IP prefix. This makes IP blacklists ineffective.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Features that worked well that the receiver could make a decision make on based on the first packet?

A
Single packet features:
- Distance between sender/rec
- Density IP space 
- Time of day
- AS
Single message 
- # of recipients
- length of message
Aggregates (group of messages9
- variation in message length
How well did you know this?
1
Not at all
2
3
4
5
Perfectly