Branch Prediction Flashcards

Question 1

Q

What is Not Taken prediction?

Answer

A

Always predict that the branch is not taken - every processor uses at lease this since it is an easy performance gain

Question 2

Q

What is the BTB?

Answer

A

Branch Target Buffer - keeps track of where we went to last time we took this branch. Update this table after we’ve determined if we’ve taken the branch or not. Can be used as a predictor or in coordination with one.

Question 3

Q

What is the structure of the BTB?

Answer

A

Want to keep it small, so maintaining an entry for each instruction is unrealistic. Use least significant bits to index into BTB

Question 4

Q

What is a 1-bit predictor?

Answer

A

Has a branch history table that keeps track of whether we took this branch last time or not. If taken, has BTB to check where we are going.

Question 5

Q

What is a 1-bit predictor good and bad for?

Answer

A

Good for when we always (or almost always) are T or NT. Terrible at random N/NT

Question 6

Q

What is a 2-bit predictor (2BP, 2BC)?

Answer

A

Have a prediction bit and hysteresis (conviction bit). Improvement from 1-bit predictor. Initialization state doesn’t matter much

Question 7

Q

3+ Bit Predictors

Answer

A

More costly than 2BP, and only improve things when ‘anomalous’ decisions come in streaks. Usually better to stay with 2BP

Question 8

Q

What aspect of bit-predictors do history predictors aim to improve upon?

Answer

A

Having a higher performance on pattern based behavior

Question 9

Q

What is a 1-Bit History Predictor?

Answer

A

BHT has 1 bit for the history and 2-2BC. If the history is 0, we look at the first BC, if it’s one, we look at the second. When updating, we update the BC for the history, then change the history to what we observed

Question 10

Q

What is a 2-Bit History Predictor?

Answer

A

BHT has 2 bits for the history and 4-2BC.

Question 11

Q

What is an N-Bit History Predictor? What will is predict correctly? What is the cost?

Answer

A

N Bits of history, 2^N-2BC. Will correctly predict all patterns with length <= N+1. Most counters are not used. Cost is (N+2*2^N) bits for each address

Question 12

Q

What is a PShared predictor?

Answer

A

History with Shared Counters Predictor. P == private history, shared == shared counters. Aims to solve the waste of the n-bit predictor

Question 13

Q

How is a PShared predictor structured?

Answer

A

Pattern history table (PHT) - contains just the history bits for each address
Use PC to index into PHT, XOR address offset with PHT history -> index into BHT (single 2BP)

Question 14

Q

What are the pros/cons for a PShared predictor?

Answer

A

(+) Allows for many bits in the history table without a counter for each one
(-) possible collisions in BHT
Good for patterns and 8-iteration loop

Question 15

Q

What is a GShared Predictor?

Answer

A

A predictor with global history and shared predictors. Use a single history to predict all branches (XOR PC with global history)

Question 16

Q

What is a GShared predictor good for?

Answer

A

Correlated branches (when one branch depends on the outcome of another)

Question 17

Q

What is a tournament predictor?

Answer

A

Contains two good predictors and a 2BC (meta-predictor). 2BC keeps track of which prediction we should use. Outcome is updated for each predictor, and meta is updated based on which one was right (no change if both were right/wrong)

Question 18

Q

What is a hierarchical predictor?

Answer

A

Contains a good and okay predictors. Update okay predictor for every branch, update good predictor only when the okay one does poorly (don’t want to use the good one if you don’t have to). Better than tournament predictor

Question 19

Q

What is a RAS Predictor?

Answer

A

Return Address Stack. Has a small stack (Return Address Table, “RAT”). When a function is called, its return address is pushed onto the stack. If it becomes full, wrap around to the beginning.

Question 20

Q

How do we know if an instruction is a return instruction (and therefore we should use the RAS predictor)?

Answer

A

can use a simple predictor
Can use precoding - when processor fetches instruction from memory, it stores it in the cache as well as whether or not it’s a RET