Week 9: MBA Flashcards
What is “support”?
The fraction of transactions that contain that item/item set
What is confidence of a rule (A→B)?
The conditional probability that a transaction that contains the items on the LHS also contain those on the right
What is confidence of a rule (A→B)?
The conditional probability that a transaction that contains the items on the LHS also contain those on the right
How to calculate confidence?
Support (A->B)/Support A
What is lift and how do you calculate it?
It is the ratio by which the confidence of a rule exceeds the expected confidence.
Confidence(A–>B) / Expected confidence (Support B)
What is the Apriori principle?
It states that if an itemset is frequent, then all of its subsets must also be frequent
What are the steps in the algorithm to generate frequent item sets?
- Start with all item sets with a single item and compute their support (or count)
- Remove the ones that do not have minimum support
- Generate all two-item item sets using the results retained in the previous step and compute their support
- Remove the ones that do not have minimum support
- Continue increasing number of items in item sets till you have item sets of all sizes (or desired max size) with the required minimum support (or count)
- Output: A list of all frequent itemsets in the dataset that meet the support threshold.