Database Data Engineering Fundamentals - Performance Optimisation Flashcards
1
Q
_________________ Avoid Full table scan. Enforce data uniqueness and integrity
a) indexing
b)partitioning
A
Indexing
2
Q
________________ reduce amount of data scanned. Helps with data lifecycle management. Enables parallel processing.
a) indexing
b)partitioning
A
Partitioning
i.e by month without having to a full scan of the entire month. Then be able to move off old partitions over time.
3
Q
___________ speeds up data transfer, reduce storage and disk reads. Columnar compression.
a) Indexing
b) partitioning
c) Compression
A
Compression
Format for Redshift:
GZIP, LZOP, BZIP2, ZSTD
4
Q
A