Chapter 44 Flashcards

1
Q

What is Data profiling

A

Data profiling is a process which involves gathering of information about column through execution of certain queries with intention to identify erroneous records. It should also provide us a detailed view about the quality of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is standardization

A

Standardization process involves the consistency of number and types of columns, date formats, and storing conventions in database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What we identify during data profiling

A
  • Total number of values in a column
  • Number of distinct values in a column
  • Domain of a column
  • Values out of domain of a column
  • Validation of business rules
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How much time we should perform data profiling for the effectiveness of transformation

A

Twice. Once before transformation and once after transformation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What function is used for + or - in dates

A

CAST function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is golden copy

A

original copy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly