Adbase H2 Midterms Flashcards
warehouse is a database designed to enable and support business intelligence (BI) activities, especially analytics.
intended to perform queries and analysis
optimized for data retrieval, not for transaction processing
centralizes and consolidates large amounts of data from multiple sources
allows organizations to derive valuable business insights from their data to improve decision-making
can be considered an organization’s “single source of truth”
data warehouse
The DW can analyze data about a particular subject or functional area.
Subjects can be products, customers, departments, regions, etc.
The functional area can be sales, marketing, finance, distribution, etc.
Focuses on the data rather than on the processes that modify the data
Subject-Oriented
The DW creates consistency among different data types from different sources.
Integrated
A student’s level in the database might be defined as “freshman”, “sophomore”, “junior”, or “senior” in the accounting department, and “FR”, “SO”, “JR”, “SR” in the computer information systems department.
Integrated
Data in DW represents the flow of data through time. It can be organized weekly, monthly, or annually, etc.
Time-variant
Once data is in a data warehouse, it is stable and does not change.
Non-Volatile
This is a databank that stocks all enterprise data and makes it manageable for reporting.
Data Warehouse Database
always implemented on the relational database management system (RDBMS) technology like SQL
Data Warehouse Database
These tools are used for performing all the conversions, summarizations, and all the changes needed to transform data into a unified format in the data warehouse. These include:
In case of missing data, populating them with defaults
Calculating summaries and derived data
Eliminating unwanted data in operational databases from loading into the data warehouse
Converting to common data names and definitions
- Extraction, Transformation, and Loading Tools (ETL)
is data about data that describes the data warehouse. It provides the source, transformation, integration, storage, usage, relationships, and history of each data element.
Metadata
contains information about the warehouse, which is used by data warehouse designers and administrators.
Technical Metadata
contains details that give end-users an easy way to understand the information stored in the data warehouse.
Business Metadata
Corporate users generally cannot work with databases directly.
- Data Warehouse Access Tools
help users produce corporate reports for analysis that can be in the form of spreadsheets, calculations, or interactive visuals.
Query and reporting tool
In such cases, custom reports are developed using application development tools when built-in graphical and analytical tools do not satisfy the analytical needs of an organization.
Application development tools
a process of discovering meaningful new correlations, patterns, and trends by mining a large amount of data. Data mining tools are used to make this process automatic.
Data mining
allow users to analyze the data using elaborate and complex multi-dimensional views.
OLAP tools
a small, single-subject data warehouse subset that provides decision support for the particular user group.
Data Marts
- Provides consistent information on various cross-functional activities. It is also supporting “blank” reporting and query.
ad-hoc
is a data-modeling technique used to map multi- dimensional decision support data into a relational database.
star schema