Quiz 2 Flashcards

1
Q

What is association rule mining?

A

Looking at how the data collected can be associated to each other. The frequency of purchasing an item and purchasing another item.

For Example: Diapers and Beer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q
  1. Give two examples of how association rule mining can be used in other than the market-basket use case. [Hint: Think through the various examples we discussed in class Tuesday (Jan 29 2024)]
A

Identifying common topics where students struggle

Health care and medical diagnosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q
  1. Why is the confidence of association rules always less than 1?
A

Partial Coverage, Diverse Outcomes, Incomplete data/Noise, Conditional Nature of Confidence.

Reaching complete confidence is uncommon/unrealistic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
  1. Consider a transactional database where a pen is bought 15 times and a bag is bought 25 times. What is the maximum number of times that a pen and a bag are bought together?

A. 10
B. 15
C. 20
D. 25

A

B. 15

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
  1. What is the main goal of the market-basket model for frequent itemset mining in supermarket shelf management?

A. Maximize the number of items on the shelf
B. Identify items that are bought together by sufficiently many customers
C. Minimize the sales data collected with barcode scanners
D. Reduce variety of products in the supermarket

A

B. Identify items that are bought together by sufficiently many customers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
  1. In association rules, what does the “interest” of a rule measure?
    A. The likelihood of a rule being significant
    B. The difference between confidence and support
    C. The probability of the items being picked up
    D. The overall popularity of the items in the rule
A

A. The likelihood of a rule being significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
  1. What information is typically stored in a supermarket’s market-basket data?
    A. Individual item prices
    B. Customer demographics
    C. Lists of items purchased by each customer on a specific day
    D. Sales revenue generated by each item
A

C. Lists of items purchased by each customer on a specific day

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
  1. True | False In the market basket analysis, association rules with high confidence are always considered interesting, regardless of the support of the itemsets involved.
A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
  1. True | False The support threshold in market basket analysis determines the minimum number of baskets in which an itemset must appear to be considered frequent.
A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
  1. True | False The Naïve algorithm to find frequent pairs takes n2 bytes of memory, where n is the number of universal items available.
A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  1. True | False The triangular matrix approach to count the number of pairs is considered to be more efficient than the triples approach in all cases.
A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
  1. True| False All rules with high confidence are considered as interesting rules.
A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
  1. True | False An item can be part of a frequent itemset even if it is individually not a frequent item.
A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
  1. True | False If the confidence of association rule A, B, C -> D is low, then the confidence of
    A, B -> C, D is also low.
A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly