Data Storage & Organization Flashcards

1
Q

What is transfer size?

A

the unit of memory that can be individually accessed, read and written

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is latency?

A

the time it takes for info to be delivered after the initial request is made

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is bandwidth?

A

The rate at which info can be delivered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Processor cache?

A

Faster memory storing recently used data that reduces the average memory access time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Solid State Drive?

A
  • uses flash memory for storage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is RAID (Redundant Arrays of Independent Disks) ?

A

a disk organization technique that utilize a large number of inexpensive, mass-market disks to provide increased reliability, performance and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What do RAIDs do?

A

store extra data incase of disk failure.

  • Mirror or shadow
  • duplicates entires disks on multiple disks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is mean time to failure (MTTF)?

A

the average time the device is expected to run continuously without any failure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Explain RAID Level 0

and what is its capacity?

A

Striping at the block level (non-redundant)

  • used for high performance where data loss is not crucial (parallesism)

Capacity: N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Explain RAID Level 1 and what is its capacity?

A

Mirrored disks (redundancy)

  • for apps that require redundancy (protection from disk failure)

Capacity: N/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Explain RAID Level 2

A

Memory-Style-Error-Correcting-Codes with bit stripping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain RAID Level 5 and what is its capacity?

A
  • offers both reliability & increased performance

Capacity: N-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain RAID Level 6 and what is its capacity?

A
  • offers extra redundancy compared to Level 5
  • used to deal with multiple drive failures

Capacity: N - X
X = # of parity drives such as 2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a block-level interface

A

Allows the program to read & write a chunk of memory called a block (or page) from the device

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a byte-level interface

A

allows the program to read & write individually addressable bytes from the device

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a file-level interface

A

abstracts away the device addressable characteristics & provides a standard byte-level interface for files to programs running on the OS

17
Q

Hierarchy of a database

A

Database is made up of files

  • each file contains blocks
  • each block contains records
  • each record contains fields
  • each field is a representation of a data item in a record
18
Q

What does a record consist of?

A

One or more fields grouped together

19
Q

What are the two main types of records? And what are they?

A
  1. Variable-length records: the size of the record varies

2. Fixed-length records: all records have the same size

20
Q

4 situations where variable formats are useful

A
  1. The data doesn’t have a regular structure in most cases
  2. The data values are sparse in the records
  3. There are repeating fields in the records
  4. The data evolves quickly so schema evolution is challenging
21
Q

3 disadvantages of variable formats

A
  1. Waste space by repeating schema info for every record
  2. Allocating variable-sized records efficiently is challenging
  3. Query processing is more difficult and less efficient when the structure of the data varies
22
Q

What are the 6 issues related to storing records in blocks? Describe each one.

A
  1. Separation
    - how do we separate adjacent records
  2. Spanning
    - can a record cross a block boundary?
  3. Clustering
    - can a block cross a block boundary?
  4. Splitting
    - are records allocated in multiple blocks?
  5. Ordering
    - are the records sorted in any way?
  6. Addressing
    - how do we reference a given record?
23
Q

What are the 2 options when records do not fit in a block?

Describe them

A

Unspanned
- waste space at the end of the block

Spanned
- Start a records at the end of a block and continue on the next

24
Q

What is clustering?

6 storing issues

A

allocating records of different types together on the same block (or same file) cause they are frequently accessed together.

25
Q

What is split record?

6 storing issues

A

a records where portions of the record are allocated on multiple blocks for reasons other than spanning

26
Q

What is ordering records

6 storing issues

A

when the records in a file (block) are sorted based on the value of one or more fields

27
Q

What are the 2 methods of ordering records, describe them.

6 storing issues

A
  1. Physically ordered
    - the records are allocated in blocks in sorted order
  2. Logical ordered
    - the records are not physical sorted, but each records contains a pointer to the next in the sorted order
28
Q

What is addressing records?

6 storing issues

A

A method for defining a unique value or address to reference a particular record

29
Q

What are the 2 methods of addressing records, describe them.
(6 storing issues)
(Think ordering records)

A
  1. Physically addressed
    - a record has a physical address based on the device where its located
  2. Logically addressed
    - A record has a key value or some other identifier that can be used to lookup its physical address
30
Q

What is pointer swizzling?

A

the process for converting disk pointers to memory pointers and vice versa when blocks move between memory and disk

31
Q

What is a buffer and a buffer manager?

A

A buffer is a portion of main memory available to store copies of disk blocks

A buffer manager is a subsystem responsible for allocating buffer space in main memory

32
Q

What is buffer replacement strategy ?

A

determines which block should be removed from the buffer when space is required