Mass-Storage Flashcards

1
Q

What is the structure of a disk?

A

One-dimensional arrays of logical blocks (logic_blocks[])

Data addresses with logical block number (index)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What makes a good disk?

A
  • Speed (bandwidth)
  • Speed (Access time, or latency)
  • Reliability
  • Power
  • Cost
  • Capacity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is disk scheduling?

A

Queue of pending requests created when multiple processes use the disk at once.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What affects the performance of disks when it comes to disk scheduling?

A

Order in which requests are serviced.

Minimisation of head movement.

Fairness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the advantages of FCFS Scheduling?

A

Simple

Fair - no chance of starvation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the disadvantages of FCFS scheduling?

A

May involve excessive head movement as it doesn’t take into account other requests in the queue.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is SSTF Scheduling?

A

Shortest Seek-Time First Scheduling.

Next request processed is the one with the shortest seek time.

Try to service all requests in local area before moving, pick request with closest cylinder number.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the disadvantages of SSTF Scheduling?

A

Starvation can occur. A request can be held up indefinitely if it far from local area being processed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is SCAN scheduling?

A

Head continually scans disk from one end to the other and back (Elevator algorithm)

Requests are serviced as the head passes. Fair.

Non-uniform delays

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is C-SCAN scheduling?

A

Variant on SCAN, returns to start immediately on reaching end (circular).

Area with most requests is start.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is LOOK / C-LOOK Scheduling?

A

Variants on SCAN / C-SCAN, only goes as far as final requests in each direction (IE Bounded)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do SSDs deal with requests?

A

FCFS policy, can’t use other scheduling policies as it doesn’t have a moving head.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

In what time are Reads and Writes on SSDs?

A

Read: Uniform
Write: Non-uniform
Some SSD schedulers merge only adjacent write requests.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do Block interfaces to SSD’s work?

A

Logical only.

SSDs run software that manage wear on components. Requesting a particular block doesn’t guarantee a hardware block address.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is deduplication?

A

Don’t store things twice on the drive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do we check for duplication?

A

We use hashing, on both a per-file basis and block basis.

17
Q

What are the issues with per-file hashing?

A

Adding anything to a file breaks the hash similarity

18
Q

Is block hashing better than file hashing?

A

Yes but not in all cases.

Inserting any data not the size of a block breaks the similarity.

Requires more computational power.

19
Q

How does variable length hashing work?

A

Uses a rolling hash.

Resyncs the hash boundaries because the data is the same.

Shift resistant hashing.

20
Q

What issue exists with variables length hashing?

A

Hard to parallelise.

21
Q

What is RAID?

A

Many disks attached to a computer system.

Improves Read/Write performance and reliability.

Redundant Array of Inexpensive Disks

22
Q

What is the effect of Redundancy in raid?

A

Improves reliability as >1 copy of data.

However, this causes a storage overhead

23
Q

What is mirroring?

A

Duplicate each disk, single logical disk consists of 2 mirrored physical disks. Write to both, read from one.

24
Q

How does RAID affect parallelism?

A

Read request service rate doubled with mirroring, request can be serviced from either copy.

Transfer rate increased through striping.

25
Q

What is data striping

A

Data fragmented and each fragment written o a different disk.

Increases transfer rate, can read from both.

26
Q

What do each of the different RAID levels mean?

A

Different combinations of mirroring, striping and parity techniques

Cost/performance trade offs different for each

27
Q

What are the attributes of RAID 0?

A

Block-level striping, no redundancy but improved performance. Min 2 disks.

28
Q

What are the attributes of RAID 1?

A

Disk mirroring.
Min 2 disks.
- High storage overhead.

29
Q

What are the attributes of Raid 2?

A

Error Correcting Code organisation, correct single-bit errors and detect double-bit errors.

EEC stored on additional disks.

30
Q

What happens on read with a Raid 2 organisation?

A

Ever disk takes part in every I/O request. Data and ECC bits delivered to controller on read.

31
Q

What are the attributes of Raid 3?

A

Bit-interleaved parity.

Raid 2 but only single parity disk. Can detect read errors

32
Q

What are the attributes of Raid 4?

A

Block interleaved parity. Parity block stored on additional disk.

Block access involves a single disk, multiple I/O requests serviced in parallel.

33
Q

What are the attributes of Raid 5?

A

Block-interleaved distributed parity. Parity blocks distributed among disks.

Avoid bottleneck of single parity disk.

34
Q

What are the attributes of Raid 6?

A

Raid 5 but with two parity calculations, meaning it can handle two disk falilures.