Caches Flashcards

Question

what roughly determines t_hit

Answer 1

lower level memory structures

Answer 2

hardware performance counters, simulation, paper simulation

Answer 3

Log_2(block size)

Answer 4

log_2(number of sets)

Answer 5

increase capacity | increase block size

Answer 6

reduce % miss, but t_hit increases

Answer 7

sqrt(capacity)

Answer 8

reduce %miss | reduce tag overhead

Answer 9

potentially useless data transfer | premature replacement of useful data

Answer 10

Increasing the block size will reduce the tag overhead

Answer 11

spacial prefetching | interference

Answer 12

good; for blocks with adjacent addresses turns miss/miss into miss/hit

Answer 13

For blocks with non-adjacent addresses but adjacent frames; turns hits into misses by disallowing simultaneous residence

Answer 14

critical word first/early restart

Answer 15

No. Reads/transfers/fills of two misses can't happen simultaneously

Answer 16

pro: reduces conflicts con: increases t_hit (additional tage match and muxing)

Answer 17

Use index bit to find set read data/tags in all frames in that set in parallel if match and valid bit, hit

Answer 18

Add MRU bits to each set, hit will update MRU, miss will replace any way but MRU

Answer 19

Still more logic in the critical path than direct mapped caches (an additional multiplexor), so slower t_hit time

Answer 20

Pro: have better (lower) % miss Con: T_hit increases - the more associative, the slower

Answer 21

don't have to worry about writing/storing

Answer 22

For reads, can read tag and data in parallel

Answer 23

1) match tag 2) write to matching way bypass to avoid load stalls, may introduce structural hazards

Answer 24

1) write through | 2) write back

Answer 25

on hit, update cache | immediately send the write to the next level

Answer 26

write to lower level when block is replaced | requires an extra dirty bit per block

Answer 27

keeps writes off the critical path 1) send fill request to next level 2) while waiting, write dirty block to buffer 3) when new block arrives, put it into cache 4) write buffer sends contens to next-level

Answer 28

requires additional bus bandwidth | without a write buffer, must wait for writes to complete to memory

Answer 29

Easier to implement, no need for dirty bits in cache Don't have to deal with coherence traffic at this cache level Simplifies miss handling (no write back buffer step)

Answer 30

Uses less bandwidth since some writes don't go to memory (also saves power)

Answer 31

Read miss: load can't go on w/o data, must stall | Write miss: no instruction waiting for data, so don't need to stall

Answer 32

writes to D$ in background eliminates stalls on write misses loads must search store buffer in addition to D$

Answer 33

store buffer: in front of D$, hides store misses | writeback buffer: behind D$, hides write backs

Answer 34

write back

Answer 35

when a write miss occurs, allocate a frame in the cache for the miss data

Answer 36

decreases read misses

Answer 37

when a write miss occurs, just write to next level, no need to allocate a cache frame for the miss data

Answer 38

potentially more read misses, but doesn't use a frame in the cache

Answer 39

compulsory, capacity, conflict, coherence

Answer 40

never seen this address before, would miss in infinite cache

Answer 41

miss caused because cache is too small (would miss in fully associative cache)

Answer 42

miss caused because cache associativity is too low

Answer 43

miss due to external invalidations in shared memory multiprocessors and multicores

Answer 44

decreases compulsory misses (spacial locality) increases conflict misses (fewer frames) can increase t_miss - reading more bytes from next level no significant effect on t_hit

Answer 45

decreases capacity miss | increases t_hit

Answer 46

decreases conflict misses | increases t_hit

Answer 47

percent of references to this cache that hit -# misses/total accesses to this cache local miss rate = (100%- local hit rate)

Answer 48

misses/total # of memory references

Answer 49

a block in the L1 is always in the L2 good for write throughs coherence traffic only needs to check L2

Answer 50

block is either in L1 or L2 (never both) holds more data coherence traffic must check both L1 and L2

Answer 51

read must check contents of the WBB since it could hold the read value reduces write costs in writeback cache- if read miss will replace a dirty block, write the dirty block to WRR, read memory, then write WBB to memory

Caches Flashcards

(77 cards)