Data Quality Flashcards
Data Quality Indicators (TARMAC)
Trackability, Acceptibility, Relevance, Measureability, Accountability, Controllability
TARMAC - trackability
Can measure data quality over time
TARMAC - Acceptability
be able to define what good looks like
TARMAC - Relevance
Make sure measuring something relevant to the business
TARMAC - Measureability
What will actually be measured? how can it be measured?
TARMAC - Accountability/Stewardship
Who will be held accountable if it goes wrong?
TARMAC - Controllability
Defining remedial actions in advance of the thing going wrong
What must you know to be able to define quality quality?
the purpose of use of the data
How can you manage data quality when you don’t know the purpose?
don’t over assume - stick to basic validity
Reference Data
data not subject to change e.g., identifiers
Master Data
Descriptive attributes of business entities
What can be defined as data standards?
data types, acceptable values, attribute domains, metadata format
What are the two type of data quality management?
- Governing/Strategic
- Tactical
What is tactical data quality management?
short terms fixing of problems
What is governing/strategic data quality management?
Overarching long term goals e.g., root cause analysis
Common DQ mistakes
- failing to consider the intended use of the data
- Confusing Validity and accuracy
- treating it as a one time activity
- not fixing at the source
- applying software quality principle’s
- laziness, blaming the system
- believing good data quality is the end goal (not the use of that data)
- believing that quantity beats quality
Data quality firewall
taking external data and applying data cleansing before it is stored in the DB
Impacts of of poor data
- aggravation
- loss of reputation
- loss of business
- regulatory risk
- loss of life
Why is it important to communicate the cost of poor quality data?
to raise awareness
Data Quality Management Cycle
Plan -> Deploy -> Monitor -> Action
What are the four data quality governance steps
- standardisation
- assignment
- escalation
- completion
Causes of data issues
Human causes
Organisational (system)
Physical causes
roles of a the data quality oversight board
- setting data quality improvement priorities
- establishing communications & feedback mechanisms
- producing certification & compliance policies
- approving data quality strategies