lec 8(done) Flashcards

1
Q

data transformation

A

the data are transformed or consolidated into forms appropriate for mining, so that:

1-The resulting mining process may be more efficient.
2-The patterns found may be easier to understand.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data Transformation Strategies:

A

1-Attribute construction
2-Aggregation
3-Normalization
4-Discretization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Attribute construction (or feature construction)

A

1-New attributes are constructed and added from the given set of attributes to help the mining process.

2-Can help improve accuracy and understanding of structure in high dimensional data.

For example, we may wish to add the attribute areabased on the attributes heightand width.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Aggregation

A

Summary or aggregation operations are applied to the data.
Typically used in constructing a data cube for data analysis at multiple abstraction levels.

For example, the daily sales data may be aggregated as to compute monthly and annual total amounts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Normalization :

A

1-The attribute data are scaled so as to fall within a smaller range
such as [-1,1] or [0.0,1.0]
2-Helps avoid dependence on the choice of measurement units.
3-Normalizing the data attempts to give all attributes an equal weight.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data Normalization Methods:

A

1-min-max normalization
2-z-score normalization
3-normalization by decimal scaling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Min –Max Normalization

A

slide 7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Z-Score Normalization

A

slide 8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Normalization by Decimal Scaling

A

slide 9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data discretization

A

transforms numeric data by mapping values to interval or concept labels

slide 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Discretization techniques

A

1-Binning
2-Histogram analysis
3-Cluster analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

For Nominal data=> Concept hierarchy generation can be used to transform the data into multiple levels of granularity

Example: street attribute can be generalized to higher-level concepts, like city or country.

A

slide 11

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Concept Hierarchy Generation for Nominal Data

A

slide 12-13

How well did you know this?
1
Not at all
2
3
4
5
Perfectly