Overview Flashcards

1
Q

BDP V’s?

A

Volume, Variety, Velocity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

ETL?

A

Extract, Transform, Load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Types of processing?

A

Stream and Batch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Data Parallelism?

A

You split the data into chunks and apply the same algorithm to all pieces.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Task Parallelism?

A

You split the tasks into chunks, and run it on a cluster of machines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Properties of BDP?

A

Robustness, Low latency read/write, Scalability, Minimal maintenance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Revolution of BDP?

A

large scale computing processing on distributed, commodity computers, enabled by advanced software using elastic resource allocation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly