Storage Devices Flashcards

1
Q

What is a Block Device?

A

A peripheral storage device, such as an HDD or SSD.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

TRUE/FALSE: Hard Disk Drives are the last remaining mechanical component of modern computers.

A

TRUE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe wear-leveling.

A

The spreading of writing operations across all memory cells to ensure no cell wears out immediately from overuse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is erasure coding?

A

A data-protection technique to ensure bits can be recovered even if one memory cell degrades. Often simply done through repeated memory fragments and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is bad-block tracking?

A

The process of keeping track of blocks that have gone bad (usually from overuse), so we can avoid them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

TRUE/FALSE: Optimal scheduling optimizes on throughput.

A

FALSE: Optimal scheduling is modular; It can be defined by the user to optimize for latency or fairness instead, for example.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

TRUE/FALSE: Optimal scheduling uses Dynamic Programming to achieve efficient and complete scheduling.

A

FALSE: Optimal scheduling is NP-complete, since we don’t know what comes in, and to achieve a decent approximation, a Greedy approach is used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is I/O Q-Depth?

A

The Queue-Depth for currently waiting I/O operations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

TRUE/FALSE: Random writes can be made sequential using a log-structured strategy.

A

TRUE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Direct I/O?

A

The movement of data directly from the source buffer to a memory device (we did this in project 4!).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Scatter/Gather I/O?

A

A single read/write request being able to collect data blocks from many disjoint locations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

TRUE/FALSE: We perform Read/Write amplification to improve our I/O efficiency.

A

FALSE: Read-write amplification is bad! We over-perform due to block nature of read/write.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What’s the major tradeoff with compression?

A

Performance for capacity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is sharding?

A

When a logical chunk of data is split into smaller chunks. Used for when a database contains too much data for a single machine to handle efficiently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is replication?

A

Data replication stored for recovery purposes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is RAID?

A

Shadow Legends. Also the Redundant Array of Independent Disks. Data storage technology that combines many disks into a single logical unit.

17
Q

What are the three types of Kernel Abstractions?

A

A device driver, a kernel buffer, and a file system.

18
Q

What did we implement in Project #4?

A

A file system.

19
Q

TRUE/FALSE: A file system is subject to block and alignment constraints.

A

FALSE: While direct writing to memory is subject to these constraints, the abstraction performed makes it unconstrained in this manner.

20
Q

What does a Volume Manager do?

A

It abstracts access to storage devices for user interaction.

21
Q

TRUE/FALSE: A Key/Value store is an accurate general descriptor of many databases.

A

TRUE, though not for all databases.

22
Q

What is a database?

A

A format of data organization that defines relations and operations to query data.

23
Q

What is Computational Storage?

A

Devices that integrate processing and computation on storage devices to reduce CPU load.

24
Q

What is a CSD?

A

Computational Storage Device. Device where memory is stored (starts with an SSD) but can have embedded CPU cores to boot. Effective for training large models.