Lecture 3: OLAP Business Databases Flashcards
What is a Data Lake?
A database that holds raw data in its narrative form. Copies everything that might be relevant.
(Schema on read)
Name one ETL data conversion example?
Date and time conversion.
Name one ETL cleaning example?
Garbine in = garbage out (GIGO)
Which two ETL load options are there?
1: Integral load (all records taken and compared with
records in DWH, delete and load).
2: Delta/incremental load (only changed records are
taken and processed).
Why use OLAP?
Powerful and intuitive way to explore large amounts of data.
Questions are multi-dimensional.
Answers simple and complex business questions.
Where does multi-dimensional data consists of?
Measures
Dimensions
What is an data measure?
Summable information concerning a business process.
What is an data dimension?
Representing the different perspectives.
Year, month
Which four OLAP operators are there?
Roll-up
Drill-down
Slice & Dice
Pivot (Rotate)
For multi-dimensional modeling you can use two types of schemas, which one?
Star schema
Snowflake schema`
What is a star schema?
A fact table in the middle connected to a set
of dimension tables.
(Not normalized)
What is a snowflake schema?
A refinement of star schema where
some dimensional hierarchy is normalized into a set of
smaller dimension tables, forming a shape similar to
snowflake
(Normalized)