Data Quality Flashcards
IT SIGNIFIES THE DATA’S APPROPRIATENESS TO SERVE ITS PURPOSE IN A GIVEN CONTEXT
Data Quality
Is the overall utility of a dataset(s) as a function of its ability to be processed easily and analyzed for a database, data warehouse, or data analvtics system
Data Quality
Data Quality
Used in the areas of:
Customer relationship management
(CRM)
Data integration
Regulation requirements
generates costs, affects customer satisfaction, company reputation, and even strategic decisions of the management
Poor data quality
A tool that allows the use of small random samples (19 sxs) to distinguish between different groups of data elements (or lots) with high and low data quality
Widely applied in the health care industry for decades and has been used for quality assurance of products
Lot Quality Assurance Sampling (LQAS)
- Smallest sampling size to use and still become statistically accurate. Samples that are more than____ are more expensive while sampling size less than___ is not accurate.
19 sxs
is adopted in the context of District Health Information System (DHIS) data quality assurance (DQA)
Lot Quality Assurance Sampling (LQAS)
Formula for report timeliness rate
= # of on-time reports / total # of
reports for that section x 100
Level of acceptable error =
70% +/-10% (60 - 80%)
• It is a simplified version of the Data Quality Audit (DOA) tool which allows programs and projects to verify and assess the quality of reported data
Routine Data Quality Assessment (RDQA)
Rapidly verify the quality of reported data
Implement corrective measures with action plans for strengthening data management and reporting system and improving data quality
RDQA
EXAMPLE: DENGUE PREVENTION AND CONTROL
PROGRAM IN MINDANAO
• External auditors are important to check for flaws in the system and their visits can be more frequent, more organized, and less resource intensive to benefit the institution at the end of the day
RDQA
• A project management tool that illustrates how a project is expected to progress at a high level
• Important in ensuring the efficient flow of communication between those involved in the project
• Minimize issues that would delay delivery of the project
Development Implementation Plan
TOOLS THAT ARE CRUCIAL IN MAINTAINING ACCURACY & RELEVANCY IN HEALTH INFORMATION
Data Quality Tools
• Analyzes information and identifies incomplete or incorrect data
Data Quality Tools
- removing of abnormalities of data or repeated information
Data cleansing
By maintaining________, the process enhances the reliability of the information used by an organization
data integrity
• Decomposition of fields into component parts and formatting the values into consistent layouts based on industry standards and patterns and user-defined business rules
Parsing and standardization
Is the modification of data values to meet domain restrictions, constraints on integrity, or other rules that define data quality as sufficient for the organization
Generalized cleansing
Identification and merging of related entries within or across data sets
Matching
Refers to the analysis of data to capture statistics or metadata to determine the quality of the data and identify data quality issues
Profiling
Refers to the deployment of controls to ensure conformity of data to business rules set by the organization
Monitoring
Enhancement of the value of the data by using related attributes from external sources such as consumer demographic attributes or geographic descriptors
Enrichment
PROBLEM SOLVING METHOD THAT IDENTIFIES THE “ROOT CAUSE” OF PROBLEMS OR EVENTS INSTEAD OF SIMPLY ADDRESSING THE OBVIOUS SYMPTOMS.
Root Cause Analysis