Datapath And Pipelining Flashcards

Question

Factors Limiting Performance

Answer 1

* Theoretical speedup of ‘n’ not delivered in practice: * Not all instructions require all pipeline stages. * Overheads of filling/emptying pipeline. * Dependencies between instructions. * Have to set the clock for the slowest stage in the pipeline. • Note: instructions cannot ‘skip ahead’ in the pipeline! Must go through every stage.

Answer 2

* Control signals derived from instruction: * As in single-cycle implementation, but * Pass control signals along just like the data.

Answer 3

A structure hazard refers to when a required resource is busy.

Answer 4

Situations that prevent starting the next instruction in the next cycle.

Answer 5

Need to wait for previous instruction to complete its data read/write.

Answer 6

Deciding on control action depends on previous instruction…

Answer 7

Hazards may require us to stall the pipeline, effectively wasting a cycle also known as a “bubble”.

Answer 8

**Force control values in ID/EX register to 0:** * EX, MEM and WB do nop (no-operation). **Prevent update of PC and IF/ID register:** * The current instruction is decoded again * Following instruction is fetched again; * 1-cycle stall allows MEM to read data for lw * Can subsequently forward to EX stage.

Answer 9

A structure hazard refers to conflict for use of resource • In a constrained MIPS pipeline with a single memory for both data/instructions: • Load/store requires data access; • Instruction fetch would have to stall for that cycle… • Would cause a pipeline “bubble”. • Hence, pipelined datapaths require separate instruction/data memories. • Or, at least, separate instruction/data caches.

Answer 10

Data hazards are when an instruction depends on the completion of data access by a previous instruction One method of solving this is forwarding, this is to * Use result when it is computed * Don’t wait for it to be stored in a register; * Requires extra connections in the datapath.

Answer 11

* If value not computed when needed… * We can’t forward, backwards in time!

Answer 12

**Result available at end of EX phase (in pipeline** register)… • No need to wait until WB phase; • Add extra logic to feed this result back to ALU input; • If hazard detected, logic selects forwarded ALU input, discarding erroneous register value. **Can also forward result from MEM stage:** • Either value fetched from memory, or • ALU result passed on from previous clock cycle.

Answer 13

**MEM hazard** ``` • if (MEM/WB.RegWrite and (MEM/WB.RegisterRd ≠ 0) and not (EX/MEM.RegWrite and (EX/MEM.RegisterRd ≠ 0) and (EX/MEM.RegisterRd = ID/EX.RegisterRs)) and (MEM/WB.RegisterRd = ID/EX.RegisterRs)) ForwardA = 01 • if (MEM/WB.RegWrite and (MEM/WB.RegisterRd ≠ 0) and not (EX/MEM.RegWrite and (EX/MEM.RegisterRd ≠ 0) and (EX/MEM.RegisterRd = ID/EX.RegisterRt)) and (MEM/WB.RegisterRd = ID/EX.RegisterRt)) ForwardB = 01 ``` • EX hazard • if (EX/MEM.RegWrite and (EX/MEM.RegisterRd ≠ 0) and (EX/MEM.RegisterRd = ID/EX.RegisterRs)) ForwardA = 10 • if (EX/MEM.RegWrite and (EX/MEM.RegisterRd ≠ 0) and (EX/MEM.RegisterRd = ID/EX.RegisterRt)) ForwardB = 10

Answer 14

Check when using instruction is decoded in ID stage ALU operand register numbers in ID stage are given by • IF/ID.RegisterRs, IF/ID.RegisterRt **Load-use hazard when** • ID/EX.MemRead and ((ID/EX.RegisterRt = IF/ID.RegisterRs) or (ID/EX.RegisterRt = IF/ID.RegisterRt)) **If detected, stall and insert bubble**

Answer 15

Not all stalls can be avoided as sometimes they are required for correct results, so to solve this the compiler can arrange code to avoid this issue.

Answer 16

Reorder code to avoid the use of load result in the next instruction

Answer 17

**Branch determines the flow of control** • Fetching next instruction depends on branch outcome; • Pipeline won’t always fetch correct instruction • Still working on ID stage of branch… **In MIPS pipeline** • Need to compare registers and compute target early in the pipeline; • Add hardware to do it in the ID stage.

Answer 18

* Remove branch hazard by redefining semantics of branch instructions: * instruction immediately following branch instruction (in the “branch delay slot”) is always executed; * rely on the compiler to fill the slot with something useful (or nop if it can’t). **Branch Prediction** Alternatively, compiler can predict the result of the branch. • Longer pipelines can’t readily determine branch outcome early: • Stall penalty becomes unacceptable. **Predict outcome of branch:** • Only stall if the prediction is wrong. In MIPS pipeline: • Can predict branches not taken; • Fetch instruction after branch, with no delay.

Answer 19

**Static branch prediction** * Based on typical branch behaviour * Example: loop and if-statement branches * Predict backward branches taken * Predict forward branches not taken **Dynamic branch prediction** * Hardware measures actual branch behaviour * e.g., record recent history of each branch * Assume future behavior will continue the trend * When wrong, stall while re-fetching, and update history

Answer 20

Here, branch penalty is more significant. Use dynamic prediction: * Branch prediction buffer (aka branch history table); * Indexed by recent branch instruction addresses; * Stores outcome (taken/not taken). * To execute a branch: * Check table, expect the same outcome; * Start fetching from fall-through or target; * If wrong, flush pipeline and flip prediction.

Answer 21

* Inner loop branches mispredicted twice! * Mispredict as taken on the last iteration of the inner loop * Then mispredict as not taken on the first iteration of the inner loop next time around

Answer 22

Only change prediction on two successive mispredictions

Answer 23

A superscalar processor is a CPU that implements a form of parallelism called instruction-level parallelism within a single processor.

Datapath And Pipelining Flashcards

(49 cards)