Association FINAL Flashcards

Question 1

Q

Association Rules interested in

Answer

A

Observing which objects occur together

Question 2

Q

Association rules recommending or co-occur?

Answer

A

Seeing which items co-occur

Question 3

Q

Association Rule Mining

Answer

A

Given a set of transactions, find the rules that will predict the occurrence of an item based on the occurrences of other items in the transaction

Question 4

Q

Does implications mean casuality?

Answer

A

No, means co-occurrence

Question 5

Q

{} -> {}

Answer

A

Antecedent -> Consequent

Question 6

Q

3 types of database

Answer

A

Binary, Transaction, Vertical

Question 7

Q

Items

Answer

A

I = {x1, x2, …, xm}

Question 8

Q

A set X within the set of items

Question 9

Q

An itemset of cardinality k

Answer

A

k-itemset

Question 10

Q

I^(k)

Answer

A

set of all k-itemsets

Question 11

Q

Transaction identifiers, tids

Answer

A

T = {t1, t2, …, tn}

Question 12

Q

t within T

Question 13

Q

Transaction

Answer

A

Tuple in the form (t, X) where t is a unique transaction identifier and X is an itemset

Question 14

Q

Support

Answer

A

The support of an itemset X in a dataset D denoted sup(X, D) is the number of transactions in D that contain X

Question 15

Q

Relative Support

Answer

A

The relative support of X is the fraction of transactions that contain X: sup(X,D)|D|

Question 16

Q

We use F to

Answer

A

denote the set of all itemsets

Question 17

Q

We use F^(k)

Answer

A

to denote the set of k-itemsets

Question 18

Q

Itemset mining problem

Answer

A

Given a minimum support threshold (minsup), find all itemsets X s.t. sup(x) >= minsup

Question 19

Q

Frequent itemsets

Answer

A

An itemset X is frequent if sup(x) >= minsup where minsup is a user specified minimum support threshold (if minsup is fraction, then relative support is implied)

Question 20

Q

Total possible subset

Question 21

Q

Naive approach to generate all itemsets that are frequent

Answer

A

For all x in I:
compute support
if support >= minsup
add to list

Question 22

Q

The brute force method

Answer

A

Explores the entire itemset search space, regardless of minsup

Question 23

Q

Goal of Association Rule Mining

Answer

A

Given a set of transactions T, find all the rules having:
support >= minsup
confidence >= mincond

Question 24

Q

Apriori principle

Answer

A

If an itemset is frequent, then all of its subsets must be frequent as well

Question 25

Q

Apriori principle 2

Answer

A

If an itemset if infrequent, then all of its supersets must be infrequent as well

Question 26

Q

A rule is frequent if

Answer

A

the itemset XY is frequent, sup(XY) >= minsup

Question 27

Q

A rule is strong if

Answer

A

conf >= minconf

Question 28

Q

Rules are pruned using

Answer

A

confidence

Question 29

Q

confidence (x->y)

Answer

A

sup(XY)/sup(x)

Question 30

Q

Unlike support, confidence does not exhibit

Answer

A

the monotone property

Question 31

Q

If a rule x -> y\x does not satisfy the confidence threshold, then

Answer

A

any rule x’->y\x’, where x’ within X, must not satisfy the confidence threshold as well

Question 32

Q

What happens if misnup is too high

Answer

A

we may miss interesting low-support items ex: such items may correspond to expensive products that are rarely purchased by customers, but whose patterns are interesting to mine for the retailer

Question 33

Q

What happens if minsup is too low

Answer

A

We get information overload: too many frequent itemsets and too many spurious rules

Question 34

Q

How can some high confidence rules be misleading?

Answer

A

High confidence might not imply a meaningful relationship if the consequent is already common in the dataset, irrespective of the antecedent

Question 35

Q

Confidence measure ignores

Answer

A

the support of the itemset appearing in the rule consequent

Question 36

Q

which metric accounts for the consequent

Question 37

Q

lift

Answer

A

conf(x->y)/rsup(y)

Question 38

Q

value of lift close to 1 implies

Answer

A

that the support of the rule is expected

Question 39

Q

Good lifts, bad lifts

Question 40

Q