Data warehousing Flashcards
constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making.
data warehouse
Who first coined the term data warehouse in 1990?
Bill Inmon
According to Inmon, a data warehouse is a subject oriented, integrated, time-variant, and non-volatile collection of data. This data helps analysts to take informed decisions in an organization.
TRUE
What is OLAP?
Online Analytical Processing
4 features of Data Warehouse
Subject Oriented
Integrated
Time Variant
Non-volatile
A data warehouse is subject oriented because it provides information around a subject rather than the organization’s ongoing operations. These subjects can be product, customers, suppliers, sales, revenue, etc. A data warehouse does not focus on the ongoing operations, rather it focuses on modelling and analysis of data for decision making.
Subject Oriented
A data warehouse is constructed by integrating data from heterogeneous sources such as relational databases, flat files, etc. This integration enhances the effective analysis of data.
Integrated
The data collected in a data warehouse is identified with a particular time period. The data in a data warehouse provides information from the historical point of view.
Time Variant
This means the previous data is not erased when new data is added to it. A data warehouse is kept separate from the operational database and therefore frequent changes in operational database is not reflected in the data warehouse.
Non-volatile
T or F
A data warehouse does not require transaction processing, recovery, and concurrency controls, because it is physically stored and separate from the operational database.
TRUE
T OR F
data warehouse helps business executives to organize, analyze, and use their data for decision making. A data warehouse serves as a sole part of a plan-execute-assess “closed-loop” feedback system for the enterprise management.
TRUE
Data Warehouse Applications
Financial services
Banking services
Consumer goods
Retail sectors
Controlled manufacturing
3 types of Data Warehouse
Information Processing
Analytical Processing
Data Mining
A data warehouse allows to process the data stored in it. The data can be processed by means of querying, basic statistical analysis, reporting using crosstabs, tables, charts, or graphs.
Information Processing
A data warehouse supports analytical processing of the information stored in it. The data can be analyzed by means of basic OLAP operations, including slice-and-dice, drill down, drill up, and pivoting.
Analytical Processing
supports knowledge discovery by finding hidden patterns and associations, constructing analytical models, performing classification and prediction. These mining results can be presented using the visualization tools.
Data Mining
the process of constructing and using a data warehouse. A data warehouse is constructed by integrating data from multiple heterogeneous sources that support analytical reporting, structured and/or ad hoc queries, and decision making. involves data cleaning, data integration, and data consolidations.
Data Warehousing
Functions of Data Warehouse Tools and Utilities
Data Extraction
Data Cleaning
Data Transformation
Data Loading
Refreshing
Involves gathering data from multiple heterogeneous sources.
Data Extraction
Involves finding and correcting the errors in data.
Data Cleaning
Involves converting the data from legacy format to warehouse format.
Data Transformation
Involves sorting, summarizing, consolidating, checking integrity, and building indices and partitions.
Data Loading