data warehousing Flashcards

1
Q

What is a data warehouse?

A

A decision-support database, separate from operational databases, providing consolidated, historical data for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the four key characteristics of a data warehouse?

A

Subject-Oriented: Organized around subjects like customers, products, or sales.
Integrated: Combines data from multiple sources into a consistent format.
Time-Variant: Stores historical data for long-term analysis.
Non-Volatile: Data is read-only and does not undergo frequent updates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does a data warehouse differ from an operational database?

A

Operational Database (OLTP): Focuses on day-to-day transactions, contains current and detailed data.
Data Warehouse (OLAP): Focuses on historical data analysis, contains aggregated and consolidated data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are fact and dimension tables in a data warehouse?

A

Fact Table: Stores measurements or metrics of a business process (e.g., sales amounts).
Dimension Tables: Store descriptive attributes for filtering and labeling query results (e.g., product names, dates).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is it necessary to separate a data warehouse from operational databases?

A

Optimized for different purposes: OLTP handles transactions, while OLAP supports analysis.
Consolidation of data from multiple sources.
Maintains historical data not available in operational databases.
Reconciles inconsistencies in data formats and representations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the design approaches for building a data warehouse?

A

Top-Down: Structured and comprehensive design starting with overall planning.
Bottom-Up: Rapid prototyping starting with smaller projects.
Hybrid: Combines both approaches.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the three data warehouse models?

A

Enterprise Warehouse: Unified data for the entire organization.
Data Mart: Subset of a data warehouse for specific business units.
Virtual Warehouse: Logical view of operational databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is ETL in data warehousing?

A

Extract: Gather data from multiple sources.
Transform: Convert data into a consistent format.
Load: Store data into the warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two OLAP server architectures?

A

ROLAP (Relational OLAP): Uses relational databases for data storage and analysis.
MOLAP (Multidimensional OLAP): Uses arrays for fast access to summarized data.
HOLAP (Hybrid OLAP): Combines features of ROLAP and MOLAP.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the main uses of a data warehouse?

A

Information Processing: Querying, reporting, and statistical analysis.
Analytical Processing: Multidimensional analysis using slicing, dicing, and pivoting.
Data Mining: Discovering hidden patterns, associations, and predictions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is metadata in a data warehouse?

A

Metadata describes warehouse objects, including schema, data lineage, and performance metrics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly