10 Data warehouse Flashcards

1
Q

what is a data warehouse

A
  1. central repository of integrated data
  2. for data analytics and reporting
  3. architecture
    - os
    - integration layer
    - data warehouse
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is ETL

A
  1. extract
  2. transform
  3. load
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

data loading

A
  1. ETL
  2. staging layer
    - store raw data
  3. integration layer
    - integrate data from staging layer
  4. access layer
    - help user retrieve data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

data loading challenges

A
  1. data volume
  2. dirty data
  3. data synchronisation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

data warehouse schemas

A
  1. star

2. snowflake

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

star schema pro/cons

A

pro

  • denormalised
  • simple
  • query performance

cons

  • not enforced data integrity
  • not flexible
  • not suitable for complex analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

snowflake schema pro/cons

A

pro
- storage saving

cons
- complex query

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

OLAP

A
  1. online analytical processing
  2. availability/ speed are not big concerns
  3. OLAP cube structure
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

OLAP cube

A

slice
- keep same dimension constant and show changes for others

measure
- quantifiable eg (sum of people)

dimension
- non quantifiable eg (yr 2022)

cells
- number that represents measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

OLAP cube operations

A
  1. slice
  2. dice
  3. drill down
    - from summarised to detailed
  4. roll up
    - from detailed to summarised
  5. pivot
    - rotate cube
How well did you know this?
1
Not at all
2
3
4
5
Perfectly