Review - Section 1 Flashcards

Memorization of some of the key terms/ideas from Chapters 1-4

1
Q

Data

A

A collection of raw facts from which conclusions might be drawn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data Explosion

A

Accelerated data growth due to inexpensive and easier ways to create, collect and store all types of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Types of Data

A

Structured and Unstructured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Big Data

A

Data sets whose sizes are beyond the capability of commonly used software tools to capture, store, manage, and process within acceptable time limits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Information

A

The intelligence and knowledge derived from data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Storage devices (storage)

A

Devices designed for storing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Server-Centric storage architecture

A

The approach in which storage is internal to the server and could not be shared with any other servers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Information-centric architecture

A

The approach in which storage devices are managed centrally and independent of servers, and are shared with multiple servers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Virtualization

A

A technique of abstracting physical resources and making them appear as logical resources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Cloud computing

A

Enables individuals or businesses to use IT resources as a service over the network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Application

A

A computer program that provides the logic for computing operations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Database Management System (DBMS)

A

Controls the creation, maintenance, and use of a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Hosts / Compute Systems

A

The computers on which applications run.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Memory

A

Determines both the size and number of applications that can run on a host.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Logical Volume Managers (LVMs)

A

Software that runs on the compute system and manages logical and physical storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Partitioning

A

Partition a larger-capacity disk into virtual, smaller-capacity volumes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Concatenation

A

Aggregating several smaller disks to form a larger virtual volume which is presented to the host as one big logical volume.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Physical volume identifier (PVID)

A

Unique, assigned to each volume when initialized for use by the LVM.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

File

A

A collection of related records or data stored as a unit with a name.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

File system

A

A hierarchical structure of files.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Directories

A

Containers for storing pointers to multiple files.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

File system block

A

The smallest “unit” allocated for storing data.

23
Q

HBA

A

Host Bus Adapter

24
Q

Disk Service Time

A

The time taken by a disk to complete an I/O request.

25
Q

Seek Time/Access Time

A

The time taken to position the R/W heads across the platter with a radial movement.

26
Q

Rotational Latency

A

the time taken by the platter to rotate and position the data under the R/W head.

27
Q

Data Transfer Rate

A

The average amount of data per unit time that the drive can deliver to the HBA.

28
Q

Direct-attached Storage (DAS)

A

An architecture in which storage is connected directly to the hosts.

29
Q

Command Queuing

A

A technique that determines the execution order of received I/Os and reduces unnecessary drive-head movements to improve disk performance.

30
Q

Seek Time Optimization

A

The commonly used algorithm for command queuing.

31
Q

RAID

A

An enabling technology that leverages multiple drives as a part of a set that provides data protection against drive failures.

32
Q

RAID Array

A

An enclosure that contains a number of disk drives and supporting hardware to implement RAID.

33
Q

RAID Set/RAID Group

A

A grouped subset of disks within a RAID array.

34
Q

Striping

A

A technique to spread data across multiple drives to use the drives in parallel.

35
Q

Strip

A

A predefined number of contiguously addressable disk blocks.

36
Q

Stripe

A

The set of aligned strips that spans across all the disks within the RAID set.

37
Q

Strip size/Stripe depth

A

The number of blocks in a strip.

38
Q

Mirroring

A

A technique in whereby the same data is stored on two different disk drives, yielding two copies of the data.

39
Q

Parity

A

A method to protect striped data from disk drive failure without the cost of mirroring.

40
Q

Hot Spare

A

A spare drive in a RAID array that temporarily replaces a failed disk drive by taking the identity of the failed disk drive.

41
Q

Cache

A

Semiconductor memory where data is placed temporarily to reduce the time required to service I/O requests from the host. Consists of the data store and tag RAM.

42
Q

Prefetch/Read-Ahead algorithm

A

Used when read requests are sequential. A contiguous set of associated blocks is retrieved and several other blocks that have not yet been requested can be read from the disk and placed into cache in advance.

43
Q

Write-back cache

A

Data is placed in cache and an ack is sent to the host immediately. Data is committed to the disk later.

44
Q

Write-through cache

A

Data is placed in cache and immediately written to the disk, and an ack is sent to the host.

45
Q

Idle Flushing

A

Occurs continuously, at a modest rate, when the cache utilization level is between the high and low watermark.

46
Q

High watermark flushing

A

Activated when cache utilization hits the high watermark. Additional resources are dedicated to flushing and has some impact on I/O processing.

47
Q

Forced flushing

A

Occurs in the event of a large I/O burst when cache reaches 100% capacity and significantly affects the I/O response time. The system flushes the cache on priority by allocating more resources.

48
Q

Cache mirroring

A

Each write to cache is held in two different memory locations on two independent memory cards.

49
Q

Cache vaulting

A

A set of physical disks is used to dump the contents of cache during power failure. When power is restored, data is written back to write cache and then written to the intended disks.

50
Q

Storage Provisioning

A

The process of assigning storage resources to hosts based on capacity, availability, and performance requirements of applications running on the hosts.

51
Q

Logical Unit Number (LUN)

A

A unique ID assigned to each logical unit created from a RAID set.

52
Q

MetaLUN

A

A method for expanding LUNs; created by combining two or more LUNs. Consists of a base LUN and one or more component LUNs.

53
Q

Virtual Storage Provisioning

A

Creating and presenting a LUN with more capacity than is physically allocated to it on a storage array, called a thin LUN.

54
Q

LUN Masking

A

A process that provides data access control by defining which LUNs a host can access and is implemented on the storage array.