Data Warehousing Flashcards
Data warehouse
A collection of data that helps analysts to make decisions
True or false: Data warehouses store operational databases
False! Data warehouses store historical data.
Features of a data warehouse
Subject-oriented - provides info around a subject rather than operations
Integrated - data from many different sources
Time-variant - data is identified with a particular period (e.g. last 12 months)
Non-volatile – data is not erased when new data is added
Information processing
Processing data via queries or statistical analysis
Analytical processing
Processing data via Online Analytical Processing (OLAP) tools
Data mining
Finding hidden patterns and associations in data
Features of an enterprise data warehouse
o An EDW
o An operational data store
o Data marts
Operational data store
A hybrid data warehouse containing integrated information
Data extraction
Gathering data from a variety of sources
Data cleaning
Finding and correcting errors
Data transformation
Converting data to warehouse format
Data loading
Sorting/summarising/consolidating/checking integrity and building patterns accordingly
Refreshing
Updating data sources to the warehouse
Dimensional modelling
Different individual models (e.g. separate models for sales and inventory)
True or false: Dimensional modelling leads to fewer tables
True! Data is grouped together and includes redundancies.