Metadata Flashcards
Most important use of metadata
lineage and impact analysis
Tag big data
describe what the data really means
Harvest
collecting data from various sources into a repository
Metadata metrics
coverage, quality, usage, quantity, maturity
Data Modellers both…
produce and consume metadata
Metadata
Data that describes data, gives context
Metadata can contain
definition, business rules, format, abbreviation, required etc
Three types of metadata capture
automatic, manual, mixed
user defined metadata
additional metadata for further data above what the system will store
Metadata should cover…
the six interrogatives of Data: Who, What, Where, Why, When, How
Automatic capture
by product of using the tool e.g., when was it created, when was it last updated, who created the data
Manual capture
Data which isn’t collected automatically E.g., Why are we storing this data? Who owns the data? What are the business rules? Who is the steward of this data?
Metadata Solution
- whats required
- what the sources are
- how will it be harvested
Metadata sources
- BI
- business process models
- data models
- data transformation
- data quality tools
- ERP, CRM
- photos
- IoT
ERP acronym
enterprise resource planning
CRM acronym
customer relations management
Format of metadata
key | metadata1, metadat2, metadata3 …
Application code will…
collect metadata
CMDB
configuration management database
PREMIS
Preservation Metadata: Implementation Strategies Scheme
COBOL copybooks
flat file that describes the layout of records and fields in a COBOL data file.
Three categories of metadata
- business metadata
- technical metadata
- operational metadata
Business Metadata
relates to the business perspective to the metadata user e.g., regulatory constraints, business rules, known issues with data
Technical Metadata
technical details of the data in IT systems e.g, column names, DDL, access rights
Operational Metadata
targeted at IT operations users needs e.g., loading, archiving, purging, error logs
Information Science types of metadata
Descriptive e.g., author
Structural e.g., indexes
Administrative e.g., versions
Other real world types of metadata
process metadata (roles and responsibilities), data stewardship metadata (subject areas, owners)
Major data governance aspects held in metadata include
- business glossary
- data dictionary
- data stewardship
- data standards
- privacy and security
- traceability and audit
Business Glossary
defining the business metadata definitions for business data elements
Data Dictionary
Defining the technical metadata definitions for data objects e.g., fields, files, tables
Data Stewardship
Aligning data stewardship or ownership roles to key data objects
Data standards
provide governance and rules for current and future development based on both business and IT input