W8 ni aya Flashcards
is the overall utility of a dataset(s) as a function of its ability to be
processed easily and analyzed for a database, data warehouse, or data analytics
system.
DATA QUALITY
(RDQA) is a simplified version of
the Data Quality Audit (DQA) which allows programs and projects to verify
and assess the quality of their reported data.
Routine Data Quality Assessment Tool
An _________ is a project management tool that shows how a
project will evolve at a high level.
Implementation Plan
analyzes information and identifies
incomplete or incorrect data. Cleansing such data follows after the completion of
the profiling of data concerns, which could range anywhere from removing
abnormalities to merging repeated information.
Data Quality Tools
A _________ is a class of problem solving methods aimed at
identifying the root causes of the problems or events instead of simply addressing the obvious symptoms.
Root Cause Analysis
Answers the question “What do you want to accomplish?”
Define Goals/Objectives
Outline the high level schedule in the implementation phase
Outline the high level schedule in the implementation phase
Determine whether you have sufficient resources, and decide how you will procure what’s missing
Allocate Resources
Create a general team plan with overall roles that each team member will play.
Designate Team Member Responsibilities:
How will you determine if you have achieved your goal? (Smartsheet, 2017)
Define Metrics for Success:
Refers to the decomposition of fields into component parts and formatting the values into consistent layouts based on industry standard and patterns and user defined business rules
PARSING AND STANDARDIZATION
means the modification of data values to meet domain restrictions, constraints of integrity or other rules that define data quality as sufficient for the org.
GENERALIZED CLEANSING
This is the identification and merging related entries within or across data sets
MATCHING
Refers to the analysis of data to capture statistics or metadata to determine the quality of data and identify data quality issues
PROFILING
The deployment of controls to ensure conformity of data to business rules set by the org
MONITORING