Advanced Caches Flashcards by Eddie Ed

Ways to improve cache performance

reduce hit time
reduce miss rate
reduce miss penalty

How well did you know this?

Not at all

Perfectly

Ways to reduce hit time

pipelined caches
Improve TLB time
Way prediction
Improve replacement policy time

How well did you know this?

Not at all

Perfectly

What are pipelined caches?

Cache accesses are queued, so cannot access the cache while another instruction is accessing it (this can be multiple cycles) ==> instead pipeline the steps to look up an entry in the cache

How well did you know this?

Not at all

Perfectly

What is a PIPT cache?

Physically-indexed, physically tagged cache. This cache stores the actual physical address. Processor must look up physical address in TLB and then use that address to get the data in the cache

How well did you know this?

Not at all

Perfectly

What are some potential problems with virtually accessed caches?

Must flush on context switch because VA are specific to a process
Need other information from TLB on cache hits (such as does this process have permission to access this page)

How well did you know this?

Not at all

Perfectly

What is a VIPT cache?

Virtually Indexed - Physically Tagged Cache.
Cache uses virtual index to get physical tag.
At the same time. TLB uses virtual index to get physical tag.
If the tags match, then it is a hit
Do not need to flush on context switch

How well did you know this?

Not at all

Perfectly

What is cache aliasing?

In a VIPT cache, multiple virtual addresses might map to the same physical address, but will get store in the cache in different locations (because they are stored based on virtual address)

How well did you know this?

Not at all

Perfectly

Why is cache aliasing bad?

On a write, only one location will be updated

How well did you know this?

Not at all

Perfectly

How can we ensure that we do not have aliasing?

All of the index bits must come from the page offset. AKA (cache offset) + index <= page offset

How well did you know this?

Not at all

Perfectly

What is way prediction?

We try to guess which line in the set has the correct tag. If we miss, we do a normal check of the rest of the lines in the set.

How well did you know this?

Not at all

Perfectly

What is the relationship between associativity and hit time?

As associativity goes up, hit rate goes up, but hit time goes up ==> need to balance associativity

How well did you know this?

Not at all

Perfectly

What is wrong with LRU and random for replacement polcies?

Random - nothing to update on cache hit, but has higher miss rate
LRU - need to update lots of counters on cache hit, but lower miss rate

How well did you know this?

Not at all

Perfectly

What are some alternatives for LRU?

NMRU and PLRU

How well did you know this?

Not at all

Perfectly

What is PLRU?

Every time a line is accessed, set bit to 1. If we need to remove something, remove a 0. Once all the bits have been set, zero out all bits except the last one. Better than NMRU, not as good as LRU

How well did you know this?

Not at all

Perfectly

What are the three types of misses?

Compulsory, capacity (because of limited cache size), and conflict (because of low associativity)

How well did you know this?

Not at all

Perfectly

How to reduce the miss rate?

Study These Flashcards

larger cache blocks
prefetching
hardware prefetching
loop interchange

What is the relationship between cache block size and the miss rate?

Study These Flashcards

Larger cache blocks reduce the miss rate, but only when spatial locality is good.
Graph of block size vs miss rate is smiley face because eventually the program will not have good enough locality

What is prefetching?

Study These Flashcards

Trying to guess which blocks will be needed next and loading them into the cache ahead of time

What is cache pollution?

Study These Flashcards

When the cache is polluted with junk blocks that we don’t need. Can be caused by bad prefetching. Can also cause a second cache miss because we evicted a block we needed to fetch some junk

What are prefetch instructions and what are the downsides to using them?

Study These Flashcards

A hardware instruction that tells the cache to prefetch data before it is used. They are hard to program correctly because it is hard to guess how long it will take to fetch data.

What are the types of hardware prefetching?

Study These Flashcards

Stream buffer, stride prefetcher, correlating prefetcher

What is a stream buffer?

Study These Flashcards

hardware prefetching that tries to fetch the next physical block after the one we just accessed

What is stride prefetching?

Study These Flashcards

Guesses next block based on address difference (stride) between previous blocks (Aka just looks for a pattern of adding 3)

What is correlating prefetching and what is it good for?

Study These Flashcards

remembers different patterns than stride and stream. (if we fetch A then B, then next time we fetch A, fetch B). good for linked lists.

What is loop interchange?

- Compiler optimization to increase locality - In a nested loop, swap the inner and outer loop so that we are iterating over a contiguous block of memory - Not always possible => swapped code must be the same (prove by ensuring there are no dependencies between loops)

How to reduce the miss penalty?

Overlap miss and cache hierarchy

How to overlap misses?

Have a non-blocking cache that supports: hit under miss: allow other hits to happen while we are fetching a miss miss under miss: while we are fetching a miss, allow misses for other blocks to be fetched

How to implement miss under miss?

Requires hardware support - Miss Status Handling Registers (MSHR) - keeps track of which fetches are currently in progress

What is a miss vs half miss?

A miss would occur in a blocking cache, a half miss can only occur in a non-blocking cache and occurs when we miss on a block that we are already fetching

What is the cache hierarchy?

L1, L2...LLC

What is the local hit rate?

hit rate that the cache observes (i.e. only looking at requests that come to this level cache, what is the hit rate?)

What is the global hit rate?

the overall hit rate of the entire cache

What is MPKI?

Misses per 1000 instructions. another metric to use to describe overall hit rate of the entire cache

What is inclusion?

If there is a block in L1, then it must be in L2

What is exclusion?

If there is a block in L1, then is must NOT be in L2

Do we prefer inclusion or exclusion?

Inclusion, better for write-backs

Advanced Caches Flashcards

(36 cards)