E9 Flashcards
1
Q
Data science tasks can be split into two groups
A
- Unsupervised methods
- Supervised methods
2
Q
Main difference unsupervised methods and supervised methods
A
Unsupervised: There is NO specific target variable
Supervised: There is a specific target variable
3
Q
Unsupervised methods
A
- Affinity Grouping
- Similarity Matching
- Clustering
- Sentiment Analysis
4
Q
Supervised methods
A
- Predictive Modeling
2. Causal (Explanatory) Modeling
5
Q
Affinity Grouping
A
a.k.a. associations, market-baset analysis
Which items are commonly purchased together?
6
Q
Similarity Matching
A
Which other companies are similar to ours?
7
Q
Clustering
A
Do my customers form natural groups?
8
Q
Sentiment Analysis
A
What is the sentiment of my users?
9
Q
Predictive Modeling
A
- Will customer X default on her loan?
- Which customer accounts have been defrauded?
- How much will client X spend?
- Who might be “friends” in our social networking site?
10
Q
Causal (Explanatory) Modeling
A
- How much does one additional year of education contribute to a person’s income?
- How much would client X spend if I gave her a discount?