Metadata (IBM reading) Flashcards
Aim of Metadata
Metadata should bring as much information about the data sets as is known collectively by the organization.
Why do we need metadata?
Metadata enables data to be used outside of the application that created it.
- Analytics and decision making
- New business applications
- Reporting and compliance
Metadata describes the format and content of data allowing people to judge which data set to use for a new project
- Structure
- Meaning
- Origin
- Valid values and quality
- Usage and ownership
- Regulations and classifications that apply
Metadata describes the business context and classification of data allowing automated governance processes to operate.
Scope of metadata for a data driven organization
Current issues with metadata
- Many data platforms do not have metadata support
- No-one supports everything you need and assumes all tools come from their suite• Each tool starts “empty” requiring effort to populate metadata
- Each tool operates as if it is the only tool
- No integration/interoperability of metadata repositories from different vendors
- Expensive efforts to create an enterprise data catalogue
Vision of metadata
An enterprise data catalogue that lists all of your data, where it is located, its origin (lineage), owner, structure, meaning, classification and quality
- Spanning systems both on premise and within cloud providers
- Hosted locally to your data platforms but integrated to provide the enterprise view
New data tools (from any vendor) connect to your data catalogue out of the box
- No vendor lock-in; nor expensive population of yet another proprietary siloed metadata repository
Metadata is added automatically to the catalogue as new data is created
- Extensible discovery processes characterise and classify the data
- Interested parties and processes are notified