Business Intelligence and Data Warehouses Flashcards

1
Q

What is business intelligence

A

BI: A comprehensive, cohesive, and integrated set of tools and processes used to capture, collect, integrate,
store, and analyse data with the purpose of generating and presenting information to support business decision
making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Differentiate between operational data and decision support data - Time span

A
  1. Time span: Operational data (OD) covers a short time frame, while decision support data (DSD) tends to
    cover a longer time frame.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Differentiate between operational data and decision support data - Granularity

A
  1. Granularity: Describes the levels of aggregation. Decision support data must be presented at various
    levels of aggregation, from highly summarized (DSD) to nearly atomic (OD).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Differentiate between operational data and decision support data - Dimensionality

A
  1. Dimensionality: Operational data focuses on representing individual transactions, rather than the effect of
    those transactions over time (DSD). Decision support data includes multiple dimensions to show how data
    relates across those dimensions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a data warehouse, and what are its main characteristics?
How does it differ from a data mart?

A

Data warehouse: An integrated, subject-oriented, time-variant, nonvolatile collection of data that provides
support for decision making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Components of a data warehouse are 1. Integrated

A
  1. Integrated. The data warehouse is a centralized, consolidated database that integrates data derived
    from the entire organization and from multiple sources with diverse formats.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Components of a data warehouse are 2. Subject-oriented

A
  1. Subject-oriented. Data warehouse data is arranged and optimized to provide answers to questions from
    diverse functional areas within a company.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Components of a data warehouse are 3. Time-variant

A
  1. Time-variant. In contrast to operational data, which focuses on current transactions, warehouse data
    represents the flow of data through time.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Components of a data warehouse are 4. Non-Volatile

A
  1. Non-volatile. Once data enters the data warehouse, it is never removed.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is data mart.

A

Data mart: A small, single-subject data warehouse subset that provides decision support to a small group or
function, created from a larger data warehouse for faster data access

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Using a star schema in a data warehouse environment

A

The star schema consists of a central fact table that stores transactional or measured data. Surrounding the
fact table are one or more dimension tables, which are linked to the fact table through ER (Entity-Relationship)
connections. The fact table contains the measurements or numerical data, while the dimension tables
describe these facts by answering key questions such as “Who,” “What,” “Where,” “When,” and “How.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Performance improvement techniques used in a star schema - 1. Normalizing Dimensional Tables:

A
  1. Normalizing Dimensional Tables: Simplifies semantic representation and user navigation through the
    data by normalizing the dimension tables.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Performance improvement techniques used in a star schema - 2. Maintaining Multiple Fact Tables

A
  1. Maintaining Multiple Fact Tables: Stores fact tables at different aggregation levels (e.g., region, state, city)
    to speed up query operations.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Performance improvement techniques used in a star schema - 3. Denormalizing Fact Tables

A
  1. Denormalizing Fact Tables: Improves performance by storing data more efficiently, reducing the number
    of records needed to be accessed during queries.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Performance improvement techniques used in a star schema - 4. Partitioning and Replicating Tables

A
  1. Partitioning and Replicating Tables: Improves data access by splitting tables into subsets and replicating
    them across different locations.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe what OLAP and it’s characteristics

A

Online Analytical Processing (OLAP): Provides tools for multidimensional data analysis, supporting decision
making, business modelling, and operations research.

17
Q

what are the OLAP characteristics: 1. Multidimensional Data Analysis

A
  1. Multidimensional Data Analysis: Advanced data presentation, aggregation, computation, and modelling
    functions for business analysis.
18
Q

what are the OLAP characteristics: 2. Advanced Database Support

A
  1. Advanced Database Support: Provides access to various DBMSs, aggregated data, and rapid query
    responses with advanced navigation.
19
Q

what are the OLAP characteristics: 3. User-Friendly Interfaces

A
  1. User-Friendly Interfaces: Offers an analytical interface to simplify and accelerate decision-making
    processes.