Data Warehous Flashcards

1
Q

Types of Data Warehousing architecture

A

1) Centralized (default)

2) Component - Based

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Centralized Data Warehouse

A

Single Database

One stop shopping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

2 types of data marts

A

Dependent - Depend on a datawarehouse

Independent - draws information from a source database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

2 variations of ETL

A

Initial

Incremental

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
  • Normally one time only
  • Created right before DW goes live
  • All relevant data necessary for BI & analytic
  • Redo if DW “blows up”
A

Initial ETL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What to bring into the DW

A
  • Data definitely needed for BI & analytics
  • Data probably needed for BI & analytics
  • Historical Data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
  • periodically “refreshed” the DW
  • Adds new data
  • Modifies Data
  • Special handling for deleted data
A

Incremental ETL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

4 major Incremental ETL patters

A
  • Append (adds new data to end)
  • In place update (updates current data with new info)
  • Complete Replacement
  • Rolling Append (when you add new data is deleted the oldest data)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

2 Current MOST USED incremental ETL patters

A
  • Append

- In place Update

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

2 Data Transformation overarching goals

A

Uniformity

Restructuring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

6 common transformation models

A
  • Data Value Unification
  • Data type and size unification
  • De-duplication
  • Dropping columns (vertical slicing)
  • Value-based row filtering (horizontal slicing)
  • Correcting Known Errors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In DW
Measurements = ?
Dimensional Context = ?

A

Facts

Dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

4 types of Fact Tables

A
  • transaction
  • periodic snapshot
  • accumulating snapshot
  • factless
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Record facts (measurement) from transactions

A

Transaction Fact Table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Track a given measurement at regular intervals

A

Periodic Snapshot Table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Track the progress of a business process through formally defined stages

A

Accumulating Snapshot fact tables

17
Q

1) Record occurrence of a transaction that has no measurements
2) Record coverage or eligibility relationships

A

Factless Fact table

18
Q

What is it called when you filter down the data coming from the source databases to only bring in the data that you need.

A

Change Data Capture

19
Q

Change Data Capture techniques

A
  • Transactional Data Timestamp
  • Database Logs
  • Last resort: database scan-and-compare