Chapter 6 test deck Flashcards
frquent pattern
A set of items subsequences substructures that occure frequently in a data set
an intrinsic and important p[roprty if dataset
frequent itemset
a set of items that appear frequently together in a transaction data set, e.g. milk and bread
grquent sequential pattern
buying your first pc, then a digital camera and then a memory car
substructures
refer to different strucural forms such as subgraphs, subtrees, or sublattice, which may be combined with itemsets or subsequences. If a substructure occurs frequently it is called a frquent structured pattern
fequent pattern mining
searches for recurring relationships in a given data setand is the foundation fro many essential data mining tasks
market basket analysis
the earliest form of frequent pattern mining for association rule
association rule
each item has a boolean variable representing the presence or absence of that item
each baskit can then be represented by a boolean vector of values
support
usefulness
confidence
certainty of rules
the teliability of the inference made by a rule
the higher the confidence the more likely it is for b to be present in transactions that contain A
itemset
a set of one or more items
occurenace frequency
frquency of an itemset x, supportm suoport count, or count of the itemset
relative support
the fraction of transaction that contain x
Association rule does not necessarily imply causality
mining association rules
- find all frequent itemsets: each of these item sets will occur at least as frequently as a predetermined min support count
- generate strong association rules from the frequent itemsets: these rules must satisfy minimum support and minimum confidence
apriori property
all nonempty subsets of a frequent itemset must also be frequent