Chapter 16 Flashcards

1
Q

Does data transformation also includes data cleansing

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is ETL cycle

A

From where we start and then which thing comes on the way through ETL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is load

A

The process of writing the data into the target source

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is transform

A

The process of transforming the extracted data from its original state into a consistent state so that it can be placed into another database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is extract

A

The process of reading data from different sources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the first step in ETL

A

Data extraction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are types of data extraction

A
  • Logical extraction

- Physical extraction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are logical extraction types

A
  • Full extraction

- Incremental extraction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are physical extraction types

A
  • Online extraction
  • Offline extraction
  • Legacy vs OLTP
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is full extraction

A

Extract all data from the system with out taking extra information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is incremental extraction

A

Does not drag all data at once, but get data in chunks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is online extraction

A

Keep the system on while extracting data and use a intermediary system for data transformation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is offline extraction

A

Data extraction not from source. First data to be saved in dump files, databases or other sources. and then move to destination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Please explain OLTP vs legacy systems

A

Data in hand written sheets. First data entered in system from hand written sheets and then entered in destination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are basic steps for data transformation

A
1- Selection
2- Splitting/joining
3- Conversion
4- Summarization
5- Enrichment (gather data in unified format and insert missing data chunks)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are data loading strategies

A
  • Data freshness
  • System performance (bulk loading is more efficient than frequent loading)
  • Data volatility
17
Q

What are 3 data loading strategies after transformation

A
  • Full data refresh (Load empty dataware house from data, loading speed faster)
  • Incremental data refresh (Load dataware house which already loaded)
  • Continuous feed (It informs hacking or disaster immediately)