Week 9: MBA Flashcards

Question 1

Q

What is “support”?

Answer

A

The fraction of transactions that contain that item/item set

Question 2

Q

What is confidence of a rule (A→B)?

Answer

A

The conditional probability that a transaction that contains the items on the LHS also contain those on the right

Question 3

Q

What is confidence of a rule (A→B)?

Answer

A

The conditional probability that a transaction that contains the items on the LHS also contain those on the right

Question 4

Q

How to calculate confidence?

Answer

A

Support (A->B)/Support A

Question 5

Q

What is lift and how do you calculate it?

Answer

A

It is the ratio by which the confidence of a rule exceeds the expected confidence.

Confidence(A–>B) / Expected confidence (Support B)

Question 6

Q

What is the Apriori principle?

Answer

A

It states that if an itemset is frequent, then all of its subsets must also be frequent

Question 7

Q

What are the steps in the algorithm to generate frequent item sets?

Answer

A

Start with all item sets with a single item and compute their support (or count)
Remove the ones that do not have minimum support
Generate all two-item item sets using the results retained in the previous step and compute their support
Remove the ones that do not have minimum support
Continue increasing number of items in item sets till you have item sets of all sizes (or desired max size) with the required minimum support (or count)
Output: A list of all frequent itemsets in the dataset that meet the support threshold.

(7 cards)