Lesson 11.2 - Spam Flashcards
1
Q
What is the problem with SPAM?
A
- Filters: Someone must design filters
- Storage: email servers must store SPAM
- Security problems: Phishing
2
Q
How much of email traffic is SPAM
A
95%?
3
Q
How to differentiate “SPAM” from “Ham”
A
- Content-based: Looked at what in the mail
- IP address of sender (blacklist)
- Behavioral features - if the mail is sent at a time or in a batch.
4
Q
What’s the problem with Content-based filtering?
A: Too slow
B: Easy for attackers to evade
C: Words are difficult to parse
A
B: Easy to evade
5
Q
Behavioral features
A
- Geographical location of sender/receiver
- Set of target recipients
- Upstream ISP
- Botnet?
This is challenging because you must understand network behavior and then build classifiers
6
Q
BGP “Agility” make blacklists ineffective why?
A
- Hijack IP prefix (within 10 minutes)
- Send SPAM
- Withdraw IP prefix. This makes IP blacklists ineffective.
7
Q
Features that worked well that the receiver could make a decision make on based on the first packet?
A
Single packet features: - Distance between sender/rec - Density IP space - Time of day - AS Single message - # of recipients - length of message Aggregates (group of messages9 - variation in message length