Module 1: Data Center Infrastructure Flashcards

Question

True or false. In erasure coding, if a data fragment or a parity fragment become corrupt or lost, you require the original data for data recovery?

Answer 1

A

A location holding Compute, network and storage.
Contains secure power supply, security and environment controls.
Usually for Mission-Critical and Business-Critical applications.

Answer 2

A

The server systems or storage on which applications are hosted.

Answer 3

A

The network component such as switches, routers and cables.

Answer 4

A

The persistent systems on which all data is stored

Answer 5

A

Converged is prepackaged system of server network storage, components are discrete, preconfigured, and separate. Deployment options with reference architecture and pre-racked. Used for Private Cloud and Machine learning.

Hyperconverged provides components that are tightly coupled and abstracted through virtualisation, with greater gains through automation. Often deployed on commodity components. Ideal for remote offices or VDI.

Answer 6

A

Magnetic disk - large capacity, cheap
SSD - fast, more expensive
Optical Disk drive - polycarbonate disk, uses laser beams, CD, DVD, etc. WORM
Magnetic tape drives - large capacity, cheap used for backups, sequential reads and write

Answer 7

A

Read cache
Write cache
SCM sits between the disks and a cache to give a bigger cache system

Higher performance and lower cost
used for random read-intensive workloads
Reduces latency and improves throughput
Data can be address at the byte or block level

Answer 8

A

Block level

Answer 9

A

Data storage device that uses SSD memory also known as flash

Key components are flash controller and NAND flash memory chips

Commonly used SSD interfaces SAS, SATA, FC, PCIe and USB drives

Answer 10

A

Higher reliability
Higher performance
Faster access to data
Weighs less
More power-efficient
Less cooling required

Answer 11

A

In flash cells combined into pages and then into blocks

There are 3 primary types of SDD are;

Single-Level Cell (SLC) - 1 bit/flash cell - most expensive and lowest capacity.
Multi-Level Cell (MLC) - 2 bits/flash cell - Cheaper than SLC but higher error rates than SLC.
Triple-Level Cell(TLC) - 3 bits/flash cell - High capacity but low performance.

Answer 12

A

There are 3 types

Page-Read
Page-Write
Block-Erase

Page status is either free, written or Invalid.

Answer 13

A

When the number of writes to the media is higher than the writes issued by the host.

You must copy the written data to a new location. The controller reads the content into cache and erases the block.

Answer 14

A

Garbage Collection - SSD controller cleans up blocks for deletion and defrags the blocks
Wear Levelling - distribute the workload across the Cells (there are two types inflight and background).
Overprovisioning - the flash drive will have hidden blocks available that can be used if required when space need.
Caching - write coalescing (hold in write cache until a whole block is written), write combining (when a block is written to repeatedly in a short amount of time. It is held in cache until the final resulting IO is then written to the disk.

Answer 15

Study These Flashcards

A

It extends the life of an SSD

Answer 16

Study These Flashcards

A

Direct Attached storage.
This is where the data storage resides in the same frame as the compute.
E.g. an internal hard disk on a PC or server

Answer 17

Study These Flashcards

A

Either

SAN ( storage device connected via FC switches to hosts to provide LUNs )
NAS ( storage device connected via IP switches to host providing network shares in CIFS or NFS)

Answer 18

Study These Flashcards

A

SATA - common interface on consumer desktops and laptops. Cheap, low performance, high capacity. Direct attached
SCSI - enterprise storage interface providing parallel transmission and used for high performance and mission critical workloads
SAS - Serial SCSI is serial point to point variant of SCSI. SAS is used for high-end computing.
NLSAS - same as SAS but in the back end uses SATA drives which are cheaper and slower but offer large capacites.
FC - Fibre Channel the protocol for sending data on a SAN network with very high throughput support up to 65 Gbits/sec
NVMe is a flash storage based interface that reduces I/O overhead and includes performance improvements.

Answer 19

Study These Flashcards

A

RAID means “Redundant array of independent disks”
RAID technology combines multiple physical drives to create one logical drive.

Function is to protect data and improve performance (through more available spindles).

Answer 20

Study These Flashcards

A

Software RAID -= implemented at the OS level, performance dependant on the host
Hardware RAID - implemented at the host or storage level, dedicated hardware so faster.

Answer 21

Study These Flashcards

A

Striping - RAID 0 - all disks combined to make one logical drive, no protection
Mirroring - RAID 1 - two or more disks hold duplicate copies of the data best for protection
Parity - RAID 3 - one disk holds parity, rest hold data. Good for large sequential writes and reads. RAID 5 splits parity across all disks for better performance.

RAID DP - RAID6 - this uses parity across the disks in horizontal and vertical slices two disks lost to parity but better protection

RAID-TEC (tripe erasure coding). Add a triple-parity disk to a RAID DP group. Protects against 3 disk failures (where there are long rebuild times). Default for NL-SAS and SATA.

Answer 22

Study These Flashcards

A

Five

Answer 23

Study These Flashcards

A

Forward error correction. Parity based technology uses Reed-Solomon encoding algorithm.
This is used to re-create any corruption
Benefits are reliability and availability with more storage efficiency.
Best used for sequential rights.
Best used for archival storage (ideally in cloud).

Data is split into data and parity fragments. D+P=N (Storage node).

Module 1: Data Center Infrastructure Flashcards

(25 cards)