Week 9: MBA Flashcards

1
Q

What is “support”?

A

The fraction of transactions that contain that item/item set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is confidence of a rule (A→B)?

A

The conditional probability that a transaction that contains the items on the LHS also contain those on the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is confidence of a rule (A→B)?

A

The conditional probability that a transaction that contains the items on the LHS also contain those on the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to calculate confidence?

A

Support (A->B)/Support A

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is lift and how do you calculate it?

A

It is the ratio by which the confidence of a rule exceeds the expected confidence.

Confidence(A–>B) / Expected confidence (Support B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the Apriori principle?

A

It states that if an itemset is frequent, then all of its subsets must also be frequent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the steps in the algorithm to generate frequent item sets?

A
  1. Start with all item sets with a single item and compute their support (or count)
  2. Remove the ones that do not have minimum support
  3. Generate all two-item item sets using the results retained in the previous step and compute their support
  4. Remove the ones that do not have minimum support
  5. Continue increasing number of items in item sets till you have item sets of all sizes (or desired max size) with the required minimum support (or count)
  6. Output: A list of all frequent itemsets in the dataset that meet the support threshold.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly