DAT Data Quality Flashcards

1
Q

Define Data Profiling

A

The process of doing the initial data assessment for a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the steps in Data Profiling?

A
  1. Individual fields - Check content against data/domain definitions. Count violations and nulls.
  2. Tables - Check inter-field relationships and keys. Check inter-table relationships.
  3. Check whole data set - Are business rules fulfilled?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Data Quality?

A

Rather than absolute quality, data quality is data being of sufficient quality for the purpose it’s being used for. This depends on how it’s used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can errors be handled?

A

Accept
Reject
Correct (rectify)
Estimate (Interpolate/extrapolate)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the Data Quality dimensions

A

AACCTUV

Accuracy
Appropriateness (correct data to solve problem)
Completeness (relevant data, not missing)
Consistency (compatible sources)
Timeliness (suitable temporal period)
Uniqueness (each record corresponds to a real world item)
Validity (data follows rules)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the acronym relating to Data Security?

A

CIA
Confidentiality (auth access only)
Integrity (auth changes only)
Availability (when needed, may use SLA [Service Level Agreement])

How well did you know this?
1
Not at all
2
3
4
5
Perfectly