02 Distributed Storage Flashcards

1
Q

What is fragmentation?

A

Splitting data intro fragments to distribute them to different nodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is replication?

A

Put same data fragment on multiple nodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Horizontal Fragmenation

A

Split data by rows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Vertical fragmentation

A

Split data by columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Shared Data Advantages (2)

A
  • Load Balancing
  • Fault-tolerant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Shared Data Disadvantages

A

High Latency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Shared-nothing advantages (2)

A
  • Low latency
  • simple to implement
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Shared-nothing disadvantage

A

Prone to load imbalance and node failures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is allocation of data?

A

Decide on which server to put fragment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

3 Fragmentation Schemes

A
  • Round-Robin
  • Range partitioning
  • Hash-Partitioning
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is co-partitioned data?

A

Same partitioning function used for both tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is pruning of partitions?

A
  • skip unnecessary partitions during query execution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is data-distribution skew?

A

Some nodes have more data than others

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly