Metadata Management Flashcards
The definition of ___________ is the planning, implementation, and control activities to enable access to high quality, integrated metadata
metadata management
Describe the goals of metadata management (per the DMBoK).
Goals:
1. Provide organizational understanding of business terms and usage.
2. Collect and integrate metadata from diverse sources.
3. Provide a standard way to access metadata.
4. Ensure metadata quality and security.
What type of metadata focuses largely on the content and condition of the data and includes details related to data governance?
Business metadata
Name 3 examples of business metadata
- Definitions and descriptions of data sets, tables, and columns
- Business rules, transformation rules, calculations, and derivations
- Data models
- Data quality rules and measurement results
- Schedules by which data is updated
- Data provenance and data lineage
- Data standards
- Designations of the system of record for data elements
- Valid value constraints
- Stakeholder contact information (e.g., data owners, data stewards)
- Security/privacy level of data
- Known issues with data
- Data usage notes
What type of metadata provides information about the technical details of data, the systems that store data, and the processes that move it within and between systems?
Technical metadata
Give 3 examples of technical metadata
- Physical database table and column names
- Column properties
- Database object properties
- Access permissions
- Data CRUD (create, replace, update and delete) rules
- Physical data models, including data table names, keys, and indexes
- Documented relationships between the data models and the physical assets
- ETL job details
- File format schema definitions
- Source-to-target mapping documentation
- Data lineage documentation, including upstream and downstream change impact information
- Program and application names and descriptions
- Content update cycle job schedules and dependencies
- Recovery and backup rules
- Data access rights, groups, roles
Which type of metadata describes details of the processing and accessing of data?
Operational
Give 3 examples of operational metadata
- Logs of job execution for batch programs
- History of extracts and results
- Schedule anomalies
- Results of audit, balance, control measurements
- Error Logs
- Reports and query access patterns, frequency, and execution time
- Patches and Version maintenance plan and execution, current patching level
- Backup, retention, date created, disaster recovery provisions
- SLA requirements and provisions
- Volumetric and usage patterns
- Data archiving and retention rules, related archives
- Purge criteria
- Data sharing rules and agreements
- Technical roles and responsibilities, contacts
This ISO Metadata Registry Standard, provides a framework for defining a Metadata registry. It is designed to enable Metadata-driven data exchange, based on exact definitions of data, beginning with data elements.
ISO/IEC 11179
_______________ metadata is data that defines and describes the characteristics of other systems.
Process metadata
Give 3 examples of process metadata
Data stores and data involved
Government / Regulatory bodies
Organization owners and stakeholders
Process dependencies and decomposition
process feedback loop documentation
Process name
Process order and timing
Process variations due to input or timing
Roles and Responsibilities
Value chain activities
_________ is data about the businesses subject matter experts, their processes, and responsibility assignments.
Data steward metadata
Give 3 examples of data stewardship metadata
Business drivers / goals
Data CRUD rules
Data definitions - business and technical
Data owners
Data Sharing rules and agreements / contracts
Data Stewards, roles and responsibilities
Data Stores and systems involved
Data subject areas
Data Users
Government / regulatory bodies
Government organization structure and responsibilities
A metadata architecture that consists of a single metadata repository that contains copies of live metadata from various sources
Centralized Metadata Architecture
A metadata architecture that consists of a single access point. The metadata retrieval engine responds to user requests by retrieving data from source systems in real time; there is no persistent repository.
Distributed Metadata Architecture
A metadata architecture that is a combined alternative. Metadata still moves directly from the source systems in to the repository, however, repository design only accounts for the user-added metadata, the critical standardized items and the additions from manual sources.
Hybrid Metadata Architecture
Why is data lineage important? Name 3 Reasons
*To show WHAT transformations have been made to the data as it passes from one process to another;
*To show WHICH business processes have been applied to the data; *To highlight WHAT type of actions those processes perform on the Data (Create, Read, Update, Delete);
*To provide an AUDIT TRAIL showing WHO has supplied, accessed, updated, approved and deleted the data and WHEN this occurred; *To trace back when IMPACT ANALYSIS need to be undertaken after a data issue is encountered downstream;
*To confirm that the CORRECT PROCESS has actually been undertaken;
*To affirm that the CORRECT MASTER AND REFERENCE DATA is used in the process;
*To COMPLY with certain aspects of REGULATORY ACTS (e.g. Sarbanes Oxley, HIPAA, Basel, BCBS239 …)
How does data lineage provide a view of data flow from origin to destination? Names 3 ways.
*Rules and transformation for each flow *Enables “what-if analysis” for change in an ETL flow (Extract Transform & Load)
*Helps identify right source and optimum data flow for any new requirement * Provides meaning of specific field in a report *Eliminates data redundancy and ensure completeness
* Provides information on report usage
* Identifies data quality associated with a data element, thus increasing trust * Provides operational metadata
What program in data management benefits from the two items below?
- An end-to-end view eases identification of business rules discrepancy and data incompleteness
- An end-to-end view improves this program’s response to regulations like Sarbanes-Oxley, HIPAA, Basel II
Data Governance
What are some tools that are used to store metadata?
Name 3-4
—Metadata repo
—Business glossary
—Business intelligence
—Configuration management database (CMDB) for IT assets
—Data dictionary
—Data integration tools
—Database management / system catalogs
—Data mapping mgmt tools
—Data Quality tools
—Event messaging tools: move data between diverse systems
—Modeling tools
—MDM and Reference Data repos
—Service registries – service-oriented architecture (SOA) perspective
When preparing to implement metadata, a _____________ should be conducted via objective inspection of existing Metadata, along with interviews with key stakeholders.
Metadata assessment
Implementation of an enterprise data governance strategy needs senior management support and engagement to help drive _________.
Organization and Cultural Change Management