10 Data warehouse Flashcards
1
Q
what is a data warehouse
A
- central repository of integrated data
- for data analytics and reporting
- architecture
- os
- integration layer
- data warehouse
2
Q
what is ETL
A
- extract
- transform
- load
3
Q
data loading
A
- ETL
- staging layer
- store raw data - integration layer
- integrate data from staging layer - access layer
- help user retrieve data
4
Q
data loading challenges
A
- data volume
- dirty data
- data synchronisation
5
Q
data warehouse schemas
A
- star
2. snowflake
6
Q
star schema pro/cons
A
pro
- denormalised
- simple
- query performance
cons
- not enforced data integrity
- not flexible
- not suitable for complex analysis
7
Q
snowflake schema pro/cons
A
pro
- storage saving
cons
- complex query
8
Q
OLAP
A
- online analytical processing
- availability/ speed are not big concerns
- OLAP cube structure
9
Q
OLAP cube
A
slice
- keep same dimension constant and show changes for others
measure
- quantifiable eg (sum of people)
dimension
- non quantifiable eg (yr 2022)
cells
- number that represents measure
10
Q
OLAP cube operations
A
- slice
- dice
- drill down
- from summarised to detailed - roll up
- from detailed to summarised - pivot
- rotate cube