UNIT 2 - Introduction To Continuous Association Rule Mining Algorithm (CARMA) Flashcards

Question 1

Q

Why use CARMA?

Answer

A

Efficient: uses less space/time than Apriori, uses at
most two scans can get the rules
CARMA uses rule support instead of antecedent
support (used by Apriori)
Allows rules with multiple consequents
Allows changes of support thresholds during execution
Only support binary/flag variables

Question 2

Q

What are the two phases of CARMA?

Answer

A

Phase I
identifies frequent itemsets in the data through the construction of a lattice of all potentially frequent itemsets. (constructed in one scan pass)

Phase II
removes itemsets that are not frequent and then generates rules from the lattice of frequent itemsets

Question 3

Q

What should be considered when choosing a method for association analysis?

Answer

A

a) characteristics of data to be analysed

b) strengths and weaknesses of the techniques under consideration.

Question 4

Q

What should be done to numeric inputs before using APRIORI and CARMA?

Answer

A

Numeric fields have to be categorized or binned

Question 5

Q

CARMA can handle inputs with more than two categories whereas Apriori can only handly binary inputs. True or False?

Answer

A

False.

CARMA can only handle binary inputs whereas Apriori can handle inputs with more than two categories

Question 6

Q

If rules with many concequents are desired, which is the preferred method to use?

Question 7

Q

Which methods are flexible in terms of choice in evaluation methods?

Answer

A

CARMA and APRIORI

The Apriori method allows the choice of four different rule evaluation measures as discussed previously, whereas the CARMA method uses the rule support and rule confidence for rule evaluation. In addition, CARMA also allows users to specific the rule size, and allows users to vary the support threshold.

Question 8

Q

What is ARSD?

Answer

A

Association Rule Summary Diagram
-Assemble related rules into a single diagram that is
succinct and easier to interpret.

Question 9

Q

What is ARSD used for?

Answer

A

-Assemble related rules into a single diagram that is
succinct and easier to interpret.
- Important when analysts need to present mining results to non-technical audience such as business managers.

Question 10

Q

What is Phase 1 of CARMA?

Answer

A

Increment the count of all itemsets in the lattice that also occurs in T
For each subset v of T, if v is not in L and all subsets of v are in L, insert v into L, update some statistics of v > continuous feedback
Prune the lattice by removing itemsets with low support (< Min Rule Support)

Question 11

Q

What is Phase 2 of CARMA?

Answer

A

Determines the precise support of all itemsets
Removes infrequent itemsets and their supersets to reduce lattice size (downward closure principle)
Generates rules from the lattice of frequent itemsets

Question 12

Q

What’s the advantages of ARSD?

Answer

A

In general, ARSD can be used to organise different rules that have part-whole relationships into a diagrammatic representation. There are two interesting
properties that can be used to validate the correctness of ARSD. Firstly, the support percentage of the second rule must be less than or equal to the support percentage of the first rule (i.e., y ≤ w). Secondly, the number of arrows corresponds to the number
of rules represented by ARSD.

Question 13

Q

Why use both instead of target or input?

Answer

A

we want the software to consider each attribute as a possible antecedent or consequent.

Question 14

Q

Why use None role?

Answer

A

they will not be used in this illustration

Question 15

Q

Advantages of CARMA?

Answer

A

> Efficient
- uses less space/time than Apriori
- uses at most two scans can get the rules
Online user interactive feedback oriented technique, user can continuously change the support threshold during the process.
Suitable for learning large dataset, and where transactions are read
from a network.

Question 16

Q

Use of Web node?

Answer

Study These Flashcards

A

Link line represents there is relation between two nodes A and B, the
darker or wider the line, the stronger the relation is.
But don’t know which is antecedent, which is consequent, so usually use for visualization of possible associations.

UNIT 2 - Introduction To Continuous Association Rule Mining Algorithm (CARMA) Flashcards

(16 cards)