Overview Flashcards
1
Q
BDP V’s?
A
Volume, Variety, Velocity
2
Q
ETL?
A
Extract, Transform, Load
3
Q
Types of processing?
A
Stream and Batch
4
Q
What is Data Parallelism?
A
You split the data into chunks and apply the same algorithm to all pieces.
5
Q
What is Task Parallelism?
A
You split the tasks into chunks, and run it on a cluster of machines.
6
Q
Properties of BDP?
A
Robustness, Low latency read/write, Scalability, Minimal maintenance
7
Q
Revolution of BDP?
A
large scale computing processing on distributed, commodity computers, enabled by advanced software using elastic resource allocation.