Data Management - What I missed from his slides Flashcards
What is business intelligence?
- Data driven decision making
- Transforming data into meaningful information/knowledge to support business decision making
What is knowledge?
Processed data or information that is applicabale to a business decision problem
What are descriptive analytics, predictive analytics and prescriptive analytics?
- Using data to understand past and present
- Predict future behaviour based on past performance
- Make decisions or recommendations to achieve the best performance
How is business intelligence a product?
It provides information and knowledge that enables decision making
How is business intelligence a process?
You have to extract information and knowledge from data. This requires a series of actions.
What is a database management system (DBMS)?
The software that controls the data.
What is the difference between a super key and a candidate key?
A superkey is any combination of attributes that identifies each row.
A candidate key is a superkey without any unnecessary attributes
What is cardinality?
The way tables are related. Their relationship.
What is the difference in terms of time between a data warehouse and an operational system?
Operational: Current value data
Data warehouse: More historical perspective.
In a data warehouse, every key structure contains an element of time (whether you know it or not).
What are 4 conceptual characteristics of a data warehouse?
- Subject-oriented
- Integrated
- time-variant
- nonvolatile (no updates)
What is an independent data mart architecture?
Different data marts are seperately designed and built in a nonintegrated fashion. They are loaded with data from the source layer.
Wat is a bus architecture?
Similar to an independent data mart system, except that the data marts have conformed dimensions.
What is a hub-and-bespoke architecture?
You have datamarts that are loaded with data from the reconciled layer.
One of the most used architectures. Much attention to scalability and extensibility.
Reconciled layer: already integrated, consistent and correct data.
+ creates a common reference model
- leads to more redundancy
What is a centralized architecture?
It is an implementation of the hub and bespoke, in which the reconciled layer and the data marts are collapsed into a single physical repository.
What is a federated architecture?
You have datamarts filled by the operational data sources. BUT the data marts are integrated (virtually or physically) after that.
Happens often when there is a merger or acquisition and some datawarehouse needs to continue to exist.