Week 10 Flashcards
What is Association Rule Mining (ARM)?
A “rule”-based machine learning method
What is the objective of ARM?
To discover interesting relations between variables in large DBs.
Association Rule Mining aims to identify _____ rules discovered in DBs, using some measures of interestingness
strong
T/F - Association Rule Mining assumes all data is numeric
False. ARM assumes all data is categorical
T/F - ARM features good algorithms for numeric data
False
What was Associated Rule Mining initially used for?
Market analysis, to determine how items purchased by customers are related
A customer buys bread, there is an 80% chance they buy butter.
bread > butter
or
A customer buys {onions, potatoes} > buys{tomatoes}
What is this an example of?
Associated Rule Mining
It was useful for marketing activities like product promotion or product pricing
Bread > Butter[20%, 45%]
What is the Antecedent, Consequent?
What is the Support, Confidence?
Antecedent = Bread
Consequent = Butter
Support = 20%
Confidence = 45%
In a store where there are 100 total transactions, what does a support of 20% for bread mean?
It means 20 transactions contained bread
With 20 transactions that contain bread there is a 45% confidence that the transaction also contained butter. How many transactions had butter?
9
T/F - Recommendation systems like Amazon’s “also viewed” use ARM
True
Here are 3 algorithms for association rule:
Apriori Algorithm
Elcat Algorithm
F.P. Growth Algorithm
Which one is the best?
Apriori
What do we find with this equation?
of transactions where X appears / Total # of transactions
Support (X)
To find this we divide Support(X,Y) with Support (X)
Confidence
Algorithm for k products:
- User sets a minimum support criterion
- Generate a list of one-item sets that meet the support criterion
- Use the list of one-item sets to generate list of two-item sets
- Use the list of two-item sets to generate list of three-item sets
- Continue up through k-item set
- All item sets must meet the support criterion
What do we get by following this algo.?
The Apriori algo. for Generating Frequent Item sets