Lecture 10- Physical Storage Flashcards

1
Q

What are each of these levels?

A
  • External level = the way data is seen by individual users
  • Conceptual/Logical level = views database as a collection of tables in the relational model
  • Internal/Physical = the way data is stored inside the system
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Storage classification based on?

A
  • Speed of data access
  • Cost per unit of data
  • Reliability of the medium
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is primary, secondary and tertiary storage?

A
  • Primary = cache and main memory
  • Secondary = flash memory (SSD) and magnetic disk (HDD)
  • Tertiary = magnetic tapes and optical disks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

As we move down the hierarchy, what happens?

A

As we move down the hierarchy, the cost per bit decreases, whereas the access time increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some parts of physical disk structure?

A
  • Disk surface = divided into tracks
  • Sectors = tracks are subdivided into sectors (the smallest unit of information that can be read of written to the disk
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sectors are typically what size?

A

512 bytes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the key to lower I/O cost?

A

Reduce seek/rotation delays

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are some software solutions to reducing I/O costs?

A
  • Arrange blocks of a file sequentially on disk = store pages containing related information close together on disk, page size trade off
  • Read/Write in bigger chunks
  • Buffering = keep cache of recently accessed pages in main memory
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the page size trade-off?

A
  • Large page size = data related to x stored in same page hence additional page transfer can be avoided
  • Small page size = reduce transfer time, reduce buffer size in main memory
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the hardware solution to reduce I/O costs

A
  • Arrange disk arrays = several disks that give abstraction of a single, large disk
  • Partition data into stripping units and distribute them over several disks
  • A disk array refers to a collection of multiple physical hard drives that are combined ot work as a single, larger logical drive
  • The idea is to aggregate the storage capacity and performance of several disks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

More disks == ?

A

More failures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the main measure of disk quality?

A
  • Capacity
  • Access time
  • Data transfer rate
  • Reliability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is access, transfer and seek time?

A
  • Access time = seek time + rotational delay + transfer time
  • Transfer time = time to actually read or write the data in the block
  • Seek time = time for repositioning the arm over the correct track
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is rotational delay?

A

Time spent waiting for the sector to be accessed to appear under the head

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the difference between sequential access and random access?

A
  • Sequential access = successive requests are for successive block numbers on the same track/adjacent tracks (seek may be required for the first block, successive requests would either not require a seek or require a seek to adjacent tacks)
  • Random access = successive requests are for tracks that are randomly located on disk (each request requires a seek)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The arrangement in the hardware solution helps with?

A
  • Increased storage capacity
  • Improved performance
  • Redundancy and fault tolerance (in some configurations)
17
Q

____ is a common way to implement disk arrays

A

RAID

18
Q

What is RAID 0, RAID 1, RAID 5?

A
  • RAID 0 (stripping) = for better performance by distributing data across disks
  • RAID 1 (mirroring) = for redundancy by copying data to multiple disks
  • RAID 5 = combination of performance and fault tolerance by stripping data with parity information