Business Intelligence and Data Warehouse Flashcards
is the practice of working with data to glean useful information, which can then be used to make informed decisions.
Data Analysis
Organizations tend to grow and prosper as they gain a better understanding of their
environment.
Need for Data Analysis
a term that describes a comprehensive,
cohesive, and integrated set of tools and
processes used to capture, collect, integrate,
store, and analyse data with the purpose of
generating and presenting information to
support business decision making.
Business Intelligence (BI)
is a framework that allows a business to
transform data into information, information into
knowledge, and knowledge into wisdom.
Business Intelligence (BI)
BI’s Framework
the integrated processes of getting data from original sources into the data warehouse.
Extraction, Transformation, and
Loading (ETL)
retrieving
data from original data sources
Extraction
manipulating the data into an appropriate form
transformation
storing the data in the data warehouse
loading
a web-based system that presents key business
performance indicators or information in a single, integrated view with clear and concise graphics.
Dashboard
a unified, single point of entry for information distribution.
Portal
is stored in a relational database in which the structures (tables) tend to be highly normalized.
Operational Data
is an arrangement of computerized tools used to assist managerial decision making
Decision Support System
A decision support database is a specialized DBMS tailored to provide fast answers to complex queries
Decision Support Database Requirement
It must support complex data representation.
Database Schema
The decision support database is created largely by extracting data from the operational database and by importing additional data from external sources. Data-filtering capabilities must include the ability to check for inconsistent data or data validation rules.
Data Extraction and Filtering
Decision support databases tend to be very large; gigabyte and terabyte ranges are not unusual
Database Size
can be defined as integrated, subjectoriented, time-variant, non-volatile collection of data that provides support for decision making.
Data Warehouse
data are being stored in a globally accepted fashion with consistent naming conventions, measurements, encoding structures, and physical attributes, even when the underlying operational systems store the data differently.
Integrated
all relevant data about a subject is gathered and stored as a single set in a useful format such as customers, products and sales.
Subject-Oriented
This provides a tracker to produce reports including the data changes done over time.
Time-Variant
the data in data warehouse is a read-only where it can be loaded and accessed in the data warehouse.
Non-Volatile
A data modelling technique used to map multidimensional decision support data into a relational database.
Star Schema
the measurements (values) that measure a specific business aspect or activity.
Facts
contains facts that are linked through their dimensions.
fact table
are qualifying characteristics that provide additional perspectives to a given fact.
Dimension
are often used to search, filter, or classify facts. Each dimension table contains attributes.
Attribute
provides a top-down data organization that is used for two main purposes: aggregation and drill-down/roll-up data analysis.
Attribute Hierarchy