Midrange Performance Fundamentals Flashcards
What is Cache Dirty Size?
capacity of mirrored cache holding data yet to be committed to drive
What is a Vault?
internal area within the storage system that is required for write cache dumping
What does it mean to coalesce IO?
to combine multiple smaller IO into a larger IO
What is locality?
active data regions relative to logical block address range or aggregate of ranges
What parts of RAID affect performance the most?
data copies incur additional drive IO (RAID 1/0)
parity generation will take CPU utilization (RAID 5 and 6)
parity operations will take drive utilization (RAID 5 and 6)
Why are SSDs capable of much higher IOPS?
multiple internal channels allow multiple operations in parrallel
What can affect the performance on an SSD?
wear leveling
garbage collection
endurance levels
What is an example of availability?
10 drives have higher performance potential than 5 drives but a lower availability due to the higher probability of multiple drive failures
What are the ways to choose a RAID type?
fast - for a given number of spindles how good is the relative performance
cheap - how much usable capacity is there - compared to the raw capacity
available - how unlikely is it that the next drive failure will cause data loss
What is the write penalty for each RAID group?
RAID 5 - 4 - old/new data, old/new parity 1
RAID 6 - 6 - old/new data, old/new parity 1, old/new parity 2
RAID 1/0 - 2 - old/new data
What causes more drive load on parity RAID types?
random writes
What is almost always true about midrange storage and performance?
random reads almost always a cache miss
What does a smaller IO help with?
transactional capability and faster service times
What is the benefit of larger IO?
affords higher bandwidth - but large random can result in additional drive IO if spanning multiple stripe elements
What does random IO benefit from?
lots of drives
What is the relationship between concurrency and storage consolidation?
traditionally leads to more concurrency (multiple hosts pointing to one storage system)
Why does server virtualization typically reduce concurrency?
consolidating servers means consolidating paths to the storage
What benefit does read caching have on performance?
previously read data is stored in cache - LRU data is discarded when cache pages are needed
much shorter response time to service a cached read
What benefit does write caching have on performance?
hitting write cache typically faster than going to HDD (as long as cache isn’t saturated)
for a small IO under 1ms vs 5-10ms going to disk
insulates host IO from drive response (RAID overhead, drive type, etc.)
can coalesce chunks of data up to entire stripes
What may not benefit from write cache?
a storage system that has a lot of non-parity writes and uses SSDs
What benefit does IO ordering/processing have on performance?
while data is stored in cache controller can optimize how it will be sent to the drives
order blocks to minimize drive seeking during reads and writes
coalesces multiple contiguous blocks into a single large IO