Lecture 2: Introduction to Data Warehousing Flashcards

1
Q

Front

A

Back

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a Data Warehouse?

A

A system designed to collect, store, and manage data from multiple sources for analysis and business intelligence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does a Data Warehouse differ from a regular database?

A

A regular database (OLTP) is optimized for day-to-day transactions; a Data Warehouse (OLAP) focuses on historical data and analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the four key characteristics of a Data Warehouse?

A

Subject-oriented, Integrated, Time-variant, and Nonvolatile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain ‘subject-oriented’ in Data Warehousing.

A

Data is organized by business topics (sales, marketing, finance) rather than daily transactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does ‘integrated’ mean in the context of a Data Warehouse?

A

Data from multiple sources is standardized and combined, ensuring consistency in formats, naming, and units of measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define ‘time-variant’ in a Data Warehouse.

A

Data is associated with specific time periods, allowing historical analysis and trend identification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is a Data Warehouse considered ‘nonvolatile’?

A

Once data is loaded, it is not updated in real-time; it is primarily read-only, preserving historical snapshots.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Name three types of Data Warehouses.

A

Enterprise Data Warehouse (EDW), Operational Data Store (ODS), and Data Mart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is an Enterprise Data Warehouse (EDW)?

A

A centralized, large-scale data repository that provides a unified approach to organizing and accessing data across the entire organization.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe an Operational Data Store (ODS).

A

A near real-time data store that helps with immediate reporting needs, bridging the gap between operational databases and the Data Warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a Data Mart?

A

A subset of a Data Warehouse focused on a specific department or function (e.g., sales, marketing) for faster, targeted analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

List some common Data Warehousing tools.

A

IBM Datastage, Oracle, Amazon Redshift, SAP, Google BigQuery, and DOMO.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the three main DW architecture types?

A

Single-tier, two-tier, and three-tier. Three-tier (with database, OLAP server, and client tools) is the most widely used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why is selecting the right DW architecture important?

A

It depends on factors like organizational needs, resources, the complexity of data, and the reporting/analysis requirements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly