KDD Process Steps Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What are the steps in KDD process? (1st)

A

Data Cleaning:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Data Cleaning?

A

Removal of noisy and irrelevant data from datasets

  • Cleaning in case of Missing values.
  • Cleaning noisy data, where noise is a random or variance error.
  • Cleaning with Data discrepancy detection and Data transformation tools.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the steps in KDD process? (2nd)

A

Data Integration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the steps in KDD process? (3rd)

A

Data Selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the steps in KDD process? (4th)

A

Data Transformation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the steps in KDD process? (5th)

A

Data Mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the steps in KDD process? (6th)

A

Pattern Evaluation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the steps in KDD process? (7th)

A

Knowledge representation or Data Visualisation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Data Integration?

A

heterogeneous data from multiple sources combined in a common source (DataWarehouse).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does Data Integration consist of?

A
  1. Data Migration tools
  2. Data Synchronization tools
  3. ETL(Extract-Transform-Load) process
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Data Selection?

A

The process where data relevant to the analysis is decided and retrieved from the data collection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does Data selection uses?

A
  1. Neural network.
  2. Decision Trees.
  3. Naive bayes.
  4. Clustering, Regression, etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Data Transformation?

A

The process of transforming data into appropriate form required by mining procedure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the Data Transformation process consist of?

A

Data Mapping: Assigning elements from source base to destination to capture transformations.

Code generation: Creation of the actual transformation program.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Data Mining?

A

Clever techniques that are applied to extract patterns potentially useful.
- Extraction of interesting potentially useful patterns
or knowledge from huge amount of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does data mining do?

A
  1. Transforms task relevant data into patterns.

2. Decides purpose of model using classification or characterization.

17
Q

What is Pattern Evaluation?

A

Identifying strictly increasing patterns representing knowledge based on given measures.

18
Q

What does Pattern Evaluation do?

A
  1. Find interestingness score of each pattern.

2. Uses summarization and Visualization to make data understandable by user.

19
Q

What is Knowledge representation?

A

It is a technique which utilizes visualization tools to represent data mining results.

20
Q

How does knowledge representation work?

A
  1. Generate reports
  2. Generate tables
  3. Generate discriminant rules, classification rules, characterization rules, etc
21
Q

Additional Information

A
  1. KDD is an iterative process where evaluation measures can be enhanced, mining can be refined, new data can be integrated and transformed in order to get different and more appropriate results.
  2. Preprocessing of databases consists of Data cleaning and Data Integration.