Memory Hierarchy Flashcards by Tabby Black

Draw a diagram of a machine’s memory hierarchy

How well did you know this?

Not at all

Perfectly

Where is SRAM found? What are its characteristics?

On processor chip
Low latency, high bandwidth, size limited by chip area, read/write size can easily be byte level

How well did you know this?

Not at all

Perfectly

Where is DRAM found? What are its characteristics?

Off chip
Higher latency
High bandwidth
Reads/writes are in small blocks, transferred as burts

How well did you know this?

Not at all

Perfectly

What type of hardware does page-based virtual memory use? What are its characteristics?

Magnetic hard disk or SSD
High latency, lower bandwidth, reads/writes in larger blocks needed for extensive error detection and correction codes

How well did you know this?

Not at all

Perfectly

Describe static multiple issue

Compiler groups instructions to be executed together into “issue slots”
Compiler detects and avoids hazards

How well did you know this?

Not at all

Perfectly

Describe dynamic multiple issue

CPU chooses instructions to issue each cycle
Compiler can help reorder instructions
CPU resolves hazards at runtime

How well did you know this?

Not at all

Perfectly

What is speculation?

Guess what to do with an instruction, start operation asap, check whether guess was right and either complete the sequence or roll-back and do the right thing

How well did you know this?

Not at all

Perfectly

Give an example of speculation

Speculate on branch outcome - roll back if different path is taken

How well did you know this?

Not at all

Perfectly

What makes pipelining harder?

Poor ISA design:
1. Complex instruction sets
2. Complex addressing modes
3. Delayed branches

How well did you know this?

Not at all

Perfectly

What is the principle of locality?

Programs access a small proportion of their address space at any time, giving rise to spatial and temporal locality

How well did you know this?

Not at all

Perfectly

What is temporal locality?

Items accessed recently are likely to be accessed again soon

How well did you know this?

Not at all

Perfectly

Give 2 examples of temporal locality

Instructions in a loop
Loop counters

How well did you know this?

Not at all

Perfectly

What is spatial locality?

Items near those accessed recently are likely to be accessed soon

How well did you know this?

Not at all

Perfectly

Give 2 examples of spatial locality

Sequential instruction access
Array data

How well did you know this?

Not at all

Perfectly

What is cache?

Memory that holds recently used data so that it can be accessed with a lower latency next time to alleviate the von Neumann bottleneck

How well did you know this?

Not at all

Perfectly

What is a cache hit? What happens?

Data is found in the cache, so read

How well did you know this?

Not at all

Perfectly

What is a cache miss? What happens?

Data not in cache, so fetch from next level in memory hierarchy

How well did you know this?

Not at all

Perfectly

Write the equation for miss rate

Miss rate = number of cache misses / number of memory accesses

How well did you know this?

Not at all

Perfectly

Draw a diagram of a crude direct-mapped cache

How well did you know this?

Not at all

Perfectly

In a crude direct-mapped cache, for a given address, how many places are there to look?

How well did you know this?

Not at all

Perfectly

Give a real life example of a direct-mapped cache

Order all books by first two letters
Go to the right place and read the tag of the relevant book to check it is the correct one - there can only be one entry for each tag

How well did you know this?

Not at all

Perfectly

Which property must each item in a crude direct-mapped cache have?

A unique hash function value ie. cache address/tag

How well did you know this?

Not at all

Perfectly

What does the hash function do in a crude direct-mapped cache?

Takes the load address and converts it to the cache address, which is comprised of: the tag, the index and the offset

How well did you know this?

Not at all

Perfectly

Give a disadvantage of a crude direct-mapped cache

Storing a tag for every data item is a big overhead

How well did you know this?

Not at all

Perfectly

What is a simple direct-mapped cache?

Amortises the cost of crude direct-mapped cache by storing several data items per tag

What is a cache line?

Some number of contiguous words in memory. Links with spatial locality and links with DRAM bursts when filling cache lines from main memory

What is a fully associative cache?

There is no hash function ordering, must search through whole cache every time

Draw a diagram of a simple direct-mapped cache

What is the effect of larger cache lines in a variable sized cache? Why? What is a disadvantage that can override this benefit?

Reduce miss rate due to spatial locality. Larger miss penalty, can override the benefit of reduced miss rate

What is the effect of larger cache lines in a fixed size cache? Why?

Fewer cache lines so more competition so increased miss rate Pollution so more data read in that may never be read

How can we avoid the larger miss penalty that comes with larger cache lines?

Using early restart - give a word to the CPU as soon as we get it, don’t wait for the rest of the words

How does the CPU proceed on a cache hit?

Normally ie. carry on and fetch the next instruction

How does the CPU proceed on a cache miss? How does this differ between instruction and data cache misses?

1. Stall the CPU pipeline 2. Fetch block from next level of hierarchy 3. If it is an instruction cache miss, restart instruction fetch. If it is a data cache miss, complete data access

What are the two ways we can handle a data-write hit (ie. the block of data to update is in the cache)?

1. Write-back - just update the cache line 2. Write through - also update memory for consistency

What is a disadvantage of write-through?

It makes writes take longer

What is a solution to make write-through more efficient?

Use a write buffer

What is a write buffer?

Holds data waiting to be written to memory. CPU continues immediately and only stalls if write buffer is already full

What do we need to keep track of in write-back? Why?

Whether each cache-line is dirty ie. holds new data. When a dirty block is replaced, write it back to memory

How can a write buffer be used in write-back too?

Allow replacement cache line to be read first

List how we can handle write misses for write-through and write-back

For write-through: 1. Allocate on miss 2. Write around For write-back: 1. Allocate on miss

What is allocate on miss?

Fetch the cache line (with write-back: to avoid having a cache line where some words or bytes are invalid)

What is write around?

Don't fetch the cache line, instead write the data to the next level in the hierarchy

Draw a diagram of a fully associative cache

How much restriction is there on where data can be stored in a fully associative cache?

No restrictions

What happens to the upper address bits (tag) of a memory address in a fully associative cache?

They are broadcast to every cache location, each with its own comparator. Hardware is parallel, so all comparisons are done simultaneously when the CPU is trying to access a data block

How does FPGA circuit area compare between direct-mapped and fully-associative caches?

- Direct-mapped caches can be efficiently implemented using embedded block RAM - Fully-associative caches have to be made out of resisters and logic so are huge

How does efficiency compare between direct-mapped and fully-associative caches?

Fully-associative caches tend to have higher hit rate than a direct-mapped cache that stores the same amount of data - no cache location conflicts, every cache location can be checked at once on data access

What is the advantage of a set associative cache?

Combines the efficiency of a direct-mapped cache with the higher hit rate of a more associative cache

What is a set associative cache?

Has N direct-mapped caches, reads look in all N caches for data, so cache has N-way associativity

How many items can be stored with the same tag in a set-associative cache?

What is the replacement policy for a direct-mapped cache?

No choice

What is the replacement policy for a set-associative or fully-associative cache?

Replace a non-valid entry if there is one 1. Least recently used (LRU) - choose the one unused for the longest time 2. Not last used - approximates LRU and is simpler to implement 3. Random - simple to implement, gives approximately the same performance as LRU for high associativity

What is a victim cache? How is it used?

Augment a direct-mapped cache with a tiny full-associative cache, then give evicted cache lines a second chance in the victim cache. Eliminates some pathological missed of a direct-mapped cache but not as effective as set-associative

List the 3 sources of cache misses

1. Compulsory misses 2. Capacity misses 3. Conflict misses

What is a compulsory miss?

First access to a block

What is a capacity miss?

Due to finite cache size A replaced block is accessed again

What is a conflict miss?

In a non-fully associative cache due to competition for entries in a set

What prevents the construction of a large, low-latency memory requiring us to use a hierarchy of memories to approximate this?

Physical constraints

There is no hash function in a simple direct-mapped cache. Where does the cache address come from?

The index bits. The upper address bits are used to tag the data

What is the tag in a crude direct-mapped cache?

Data load address

Memory Hierarchy Flashcards

(60 cards)