Week 6 Flashcards
Sets or groups of things that we treat the same are called…
equivalence classes or categories.
A sequence of organizing decisions based on a fixed ordering of resource properties is called…
a hierarchy.
An assertion that an individual is a member of a class is called…
classification.
The systematic assignment of resources to a system of intentional categories is called…
classification.
Classifications designed to make it more likely that people or computational agents will organise and interact with resources in the same way are called…
institutional taxonomies.
Precisely defined abstractions needed to ensure that information can be efficiently exchanged and used are called…
institutional semantics.
The justification for the choice of categories and their names are called…
warrant principle.
A degree to which the classification can accommodate new resource are…
hospitality / flexibility / extensibility.
What is a name for a bias that arises from limitations and constraints of a system that results in unfairness?
technical bias
Bias that emerges from the interplay of people and systems is called…
emergent bias.
Bias that embodies personal or societal features…
pre-existing bias
Rather than just annotating a word, link the work to an ontology entry is called…
semantic annotation.
Automatically deriving an ontology from text is called…
ontology learning.
Given an ontology, populating the concepts into it that are automatically derived from the text is called…
ontology population.
What are the differences between semantic annotation and ontology population?
In semantic annotation, we assign words with terms from an ontology, whereby the document is changed. In case of ontology population, the ontology is getting populated from a text, so an ontology is modified directly.
What does OAT stand for?
Ontology Annotation Tool
A version of clustering where an example can belong to only one cluster is called…
hard clustering.
A version of clustering where an example can belong to multiple clusters is called…
soft clustering.
The process of discovering classes in a set of documents in an unsupervised way is called…
clustering.
The systematic assignment of resources to a system of intentional categories, often institutional ones, is called…
classification.
A classification scheme where multiple resource properties are considered in a fixed sequence, and each property creates another level in the system of categories is called…
hierarchy / taxonomy.
A published and maintained specification that is developed by consensus of all the relevant stakeholders in some domain by following a defined and transparent process is called…
a standard.
The principle that holds that a classification must be based only on the specific resources that are being classified is called…
literary warrant
Principle that governs that categories in a classification scheme are mutually exclusive is called…
uniqueness principle
What does EIA stands for?
Enterprise Information Architecture
Give an example of a role that clarifies the role of data as an asset
Data steward / data custodian / data owner
Give an example of a role that clarifies the requirements for the intended use of data
data quality manager
Give an example of a role that establishes the semantics of data so that it’s interpretable by the users
data architect
Give an example of a role that specifies access requirements of data
data security officer
The processes, governance, policies, standards and tools that consistently define and manage the critical data of an organization to provide a single point of reference are called…
master data management
A notion that provides the necessary guidance to manage your data as an asset is called…
data governance
A decision support database maintained separately from the operational databases is called a
data warehouse
Smaller, less ambitious data warehouses, usually defined on departamental level…
data mart
Even less ambitious stores, just dumo everything and figure out later…
data lake