Metadata Flashcards
Most important use of metadata
lineage and impact analysis
Tag big data
describe what the data really means
Harvest
collecting data from various sources into a repository
Metadata metrics
coverage, quality, usage, quantity, maturity
Data Modellers both…
produce and consume metadata
Metadata
Data that describes data, gives context
Metadata can contain
definition, business rules, format, abbreviation, required etc
Three types of metadata capture
automatic, manual, mixed
user defined metadata
additional metadata for further data above what the system will store
Metadata should cover…
the six interrogatives of Data: Who, What, Where, Why, When, How
Automatic capture
by product of using the tool e.g., when was it created, when was it last updated, who created the data
Manual capture
Data which isn’t collected automatically E.g., Why are we storing this data? Who owns the data? What are the business rules? Who is the steward of this data?
Metadata Solution
- whats required
- what the sources are
- how will it be harvested
Metadata sources
- BI
- business process models
- data models
- data transformation
- data quality tools
- ERP, CRM
- photos
- IoT
ERP acronym
enterprise resource planning
CRM acronym
customer relations management
Format of metadata
key | metadata1, metadat2, metadata3 …
Application code will…
collect metadata
CMDB
configuration management database
PREMIS
Preservation Metadata: Implementation Strategies Scheme
COBOL copybooks
flat file that describes the layout of records and fields in a COBOL data file.
Three categories of metadata
- business metadata
- technical metadata
- operational metadata
Business Metadata
relates to the business perspective to the metadata user e.g., regulatory constraints, business rules, known issues with data
Technical Metadata
technical details of the data in IT systems e.g, column names, DDL, access rights
Operational Metadata
targeted at IT operations users needs e.g., loading, archiving, purging, error logs
Information Science types of metadata
Descriptive e.g., author
Structural e.g., indexes
Administrative e.g., versions
Other real world types of metadata
process metadata (roles and responsibilities), data stewardship metadata (subject areas, owners)
Major data governance aspects held in metadata include
- business glossary
- data dictionary
- data stewardship
- data standards
- privacy and security
- traceability and audit
Business Glossary
defining the business metadata definitions for business data elements
Data Dictionary
Defining the technical metadata definitions for data objects e.g., fields, files, tables
Data Stewardship
Aligning data stewardship or ownership roles to key data objects
Data standards
provide governance and rules for current and future development based on both business and IT input
Privacy & Security
Identification of privacy and security levels for business data
Traceability and Audit
Understanding of how data is used across the organisation
Data Values Standards (ISO)
- Country codes
- currency codes
- date / time
- language codes
- units of measurement
- classification of disease
Alternative standards codes
FIPS, IOC
ISO 8601
Date time standards YYYY-MM-DD
Metadata container standards
So that metadata gets shipped with the rest of the file
EXIF (image) ID3 (audio)
EDID
Metadata standard for monitor communications
CWM acronym
Common Warehouse Metadata
ISO / IEC 11179
Metadata Registry Standard
DCMI
Dublin Core metadata intitiative
XMI
XML metadata interchange
Role of physics data model in a meta data repository
To describe how and where our data is stored in our systems applications or packages
Updating the metadata repository is a recommended activity during project close out in the SDLC
TRUE
What type of metadata provides developers and administrators with knowledge and information about systems
techincal & operational metadata
Examples of process meta data
data stores, data involved, government / regulatory bodies, roles and responsibilities, process dependencies and decomposition
Which metadata scheme focusses specifically on documents?
Preservation metadata
Metrics associated with meta data management
Steward representation coverage
metadata repository availability
metadata management maturity
Example of preservation metadata (focussed on documents)
PREMIS
ISO/IEC 1179
standard for framework for defining a metadata repository
Which metadata architecture cannot support user defined entries
Distributed metadata architecture
Which level of a metamodel describes the relationships between systems?
High level conceptual model
A metadata repository scanning process uses and produces all of these files
Control, backup, log, reuse
What levels of granularity should be checked for data quality
Data element value, data instance/record, dataset
types of metadata (unstructured)
Descriptive
Structural
Administrative
Business
Process
Technical/operational
data Stewardship
Descriptive metadata
This provides information about the content of a resource such as its title, author and keywords.
Structural metadata
provides information about the organization and relationships between different components of a resource
Administrative metadata
provides information about the creation management and usage of resources such as rights restrictions and technical specifications.
Business metadata
provides context and meaning to data in terms of business rules, definitions and requirements that business users can understand.
Process metadata
provides information about the processes used to create, transform and manipulate data, including workflows, algorithms and data lineage.
Technical/Operational Metadata
describes the technical aspects of data storage, processing and management, such as database schemas, data models and API specifications.
Data Stewardship Metadata
describes the data governance operating model and expectations.
Electronic Data Interchange (EDI)
concept of businesses electronically communicating information that was traditionally communicated on paper, such as purchase orders and invoices.
Preservation metadata
a subset of administrative metadata, preservation metadata is information describing the context of unstructured data such as documents e.g. information to preserve and save a resource.
Examples of process metadat
data stores and data involved
government/regulatory bodies
roles and responsibilities
process dependencies and decomposition
Which level of a metamodel describes the relationships between systems?
High level conceptual model
Business Drivers for metadata management
It helps organisations meet regulatory compliance
It helps improve communication between the IT team and the data consumers
It provides context to the data and data quality measurements thereby creating more trust in the data
It allows for impact analysis hence reducing the risk of project failure
What does a data dictionary capture?
The structure and contents of data sets, e.g. for a single application
Which metadata architecture delivers metadata that is as current and as valid as possible, but cannot support user-defined metadata entries?
Distributed Metadata Architecture
Which Metadata scheme focuses specifically on documents?
Preservation Metadata
What belongs in a metadata repository
- data requirements
- data lineage diagrams
- data models
- data dictionary
What is the difference between an Industry and a Consensus Metadata Standard?
The terms are used interchangeably to describe the same concept
What type of Metadata is used by developers and administrators to understand what’s happening inside systems?
Technical operational metadata
During a business change project, how many artifacts must be searched for in the Metadata repository?
There is no minimum number, but it is highly recommended that the library is examined
Which of the following initiatives relied upon an industry Metadata Standard?
EDI
Master data architechtures
Hybrid, Registry, Virtualised, Repository
A RACI matrix is a useful tool to support the ________ in an outsourced arrangement
Segregation of duties
These are examples of which type of Meta-Data: Data Stores & Data Involved, Government/ Regulatory Bodies; Roles & Responsibilities; Process Dependencies and Decomposition
Process meta data
Who has primary responsibility for data capture and usage design within programs
Software Architects & Developers