Basic Data Wrehousing and Architectures Flashcards

1
Q

-a central goal you are trying to achieve

A

theme

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

-a group of data elements tat are central to achieving that goal

A

critical success factor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q
  • a specific question that can be tied to data to identifyif the critical success factoe is being met ot not.
A

business questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
  1. Select the business process to model
  2. Declare the grain of the business process
  3. Choose the dimensions that apply to each fact table row.
    4.Identify the facts
    –is part science, part art
A

Dimensional Modeling Process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
A

select business process to model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

-specify exactly what an individual fact table row represents-
-conveys the level of detail associated with fact table measurements.
-it is highly recommended to choose the most granular or atomic information captured by the business process.

A

declare the grain of the business process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

-determine the ways the data will be aggregated or filtered.
-identify the level of hierarchy associated with each part of the grain

A

choose the dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

-determine the measurements that are available at the chosen grain
-identify any consolidations, calculations or conversions to be done.

A

identify the facts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

-all measurements must be at the sam grain
-contains two or more foreign keys to dimension tables
-expresses the many-to-many relationships between dimensions in dimensional models.
-primary table which stores the performance measurements of the business.
-each measurement is taken at the intersection of all relevant dimensions

A

fact table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

-look at the OLTP schema (or availbale extracts) to determine identify possible measures.
-determine the lowest grain possible

A

design guidelines

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

-contain the textual dscriptors of the business
-usually low in cardinality, but very wide
-dimensionattributes used as queart constraints, groupings, and report labels.
-the more descriptice the dimension attributes, the better
-often contain hierarchical relationships

A

dimension tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

(benefits of dimensional model)
-easy for business users to understand.
-improved query performance

A

simplicity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

(benefits of dimensional model)
-easily accommodates change

A

extensibility

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

gross profit/salesdollar amount

A

gross margin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

sales dollar amount-cost dollar amount

A

gross profit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

a fact is …if we can sum the fact across all dimensions and obtain a valid and correct number

A

additive

17
Q

a fat is … if the summation of the fact across any dimension results in a meaningless, nonsensical number

A

nonadditive

18
Q

-a fact is…if it is additive across some dimensions and nonadditivie across other dimensions

A

semiadditive

19
Q

-it is highly recommended to use this for dimension table keys.
-are simply integers assigned sequentially to a particular dimension row.

A

surrogate keys

20
Q

-are frequently source natural keys and used to determine the surrogate key to use.
-are also retained for analysis purposes

A

operational codes

21
Q

buffer the data warehouse from changes in operational codes,
-can save space due to their small size comapared to operational codes.
-allow recording of conditions which do not have an operational code
-allow handling of changes to diemnsion table attributes

A

benefits of surrogate keys

22
Q

-that extracts data from legacy systems and external sources, consolidates and summarizes them, and loads them into the data warehouse.

A

data acquisition (back-end) software

23
Q

-that allows users to access and analyze data from the warehouse

A

client (front-end) software