UNIT 2 - Introduction To Continuous Association Rule Mining Algorithm (CARMA) Flashcards

1
Q

Why use CARMA?

A
  • Efficient: uses less space/time than Apriori, uses at
    most two scans can get the rules
  • CARMA uses rule support instead of antecedent
    support (used by Apriori)
  • Allows rules with multiple consequents
  • Allows changes of support thresholds during execution
  • Only support binary/flag variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two phases of CARMA?

A

Phase I
identifies frequent itemsets in the data through the construction of a lattice of all potentially frequent itemsets. (constructed in one scan pass)

Phase II
removes itemsets that are not frequent and then generates rules from the lattice of frequent itemsets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What should be considered when choosing a method for association analysis?

A

a) characteristics of data to be analysed

b) strengths and weaknesses of the techniques under consideration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What should be done to numeric inputs before using APRIORI and CARMA?

A

Numeric fields have to be categorized or binned

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

CARMA can handle inputs with more than two categories whereas Apriori can only handly binary inputs. True or False?

A

False.

CARMA can only handle binary inputs whereas Apriori can handle inputs with more than two categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If rules with many concequents are desired, which is the preferred method to use?

A

CARMA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which methods are flexible in terms of choice in evaluation methods?

A

CARMA and APRIORI

The Apriori method allows the choice of four different rule evaluation measures as discussed previously, whereas the CARMA method uses the rule support and rule confidence for rule evaluation. In addition, CARMA also allows users to specific the rule size, and allows users to vary the support threshold.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is ARSD?

A

Association Rule Summary Diagram
-Assemble related rules into a single diagram that is
succinct and easier to interpret.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is ARSD used for?

A

-Assemble related rules into a single diagram that is
succinct and easier to interpret.
- Important when analysts need to present mining results to non-technical audience such as business managers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Phase 1 of CARMA?

A
  1. Increment the count of all itemsets in the lattice that also occurs in T
  2. For each subset v of T, if v is not in L and all subsets of v are in L, insert v into L, update some statistics of v > continuous feedback
  3. Prune the lattice by removing itemsets with low support (< Min Rule Support)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Phase 2 of CARMA?

A
  1. Determines the precise support of all itemsets
  2. Removes infrequent itemsets and their supersets to reduce lattice size (downward closure principle)
  3. Generates rules from the lattice of frequent itemsets
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What’s the advantages of ARSD?

A

In general, ARSD can be used to organise different rules that have part-whole relationships into a diagrammatic representation. There are two interesting
properties that can be used to validate the correctness of ARSD. Firstly, the support percentage of the second rule must be less than or equal to the support percentage of the first rule (i.e., y ≤ w). Secondly, the number of arrows corresponds to the number
of rules represented by ARSD.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why use both instead of target or input?

A

we want the software to consider each attribute as a possible antecedent or consequent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why use None role?

A

they will not be used in this illustration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Advantages of CARMA?

A

> Efficient
- uses less space/time than Apriori
- uses at most two scans can get the rules
Online user interactive feedback oriented technique, user can continuously change the support threshold during the process.
Suitable for learning large dataset, and where transactions are read
from a network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Use of Web node?

A
  • Link line represents there is relation between two nodes A and B, the
    darker or wider the line, the stronger the relation is.
  • But don’t know which is antecedent, which is consequent, so usually use for visualization of possible associations.