Chapter 4 - The Processor Flashcards

Question

What is load-use hazard detection and what does it do?

Answer 1

Check when using instruction is decoded in ID stage. If detected, stall and insert bubble

Answer 2

Stalls reduce performance but are required to get correct results

Answer 3

A 2-bit predictor uses a 2-bit counter to track the history of branch outcomes, providing four states: Strongly Taken, Weakly Taken, Weakly Not Taken, and Strongly Not Taken. The prediction changes state only after two consecutive mispredictions, which improves accuracy by reducing the likelihood of frequent mispredictions in loop constructs. The state diagram is as follows: Strongly Taken (11) -> Weakly Taken (10) -> Weakly Not Taken (01) -> Strongly Not Taken (00) <--------------------------------------------------------> Transitions: From Strongly Taken (11) to Weakly Taken (10) on a single misprediction. From Weakly Taken (10) to Weakly Not Taken (01) on another misprediction. From Weakly Not Taken (01) to Strongly Not Taken (00) on another misprediction. Reverse transitions on correct predictions.

Answer 4

The 1-bit predictor has a significant shortcoming with nested loops because it can only remember the last outcome of a branch. In the given code, the inner loop branch is predicted as taken until the last iteration, where it is not taken. The predictor will mispredict this transition as taken. Similarly, when the inner loop starts again, the first iteration's branch will be mispredicted as not taken. This leads to two mispredictions: one at the end of the inner loop (predicted taken but actually not taken) and one at the start of the next iteration (predicted not taken but actually taken).

Answer 5

Exception: An exception is an unexpected event arising within the CPU, such as an undefined opcode, overflow, or syscall. Example: An attempt to execute an undefined opcode triggers an exception. Handling: In MIPS, exceptions are managed by the System Control Coprocessor (CP0). The processor saves the PC of the offending instruction in the Exception Program Counter (EPC) and the cause of the exception in the Cause register. The processor then jumps to a predefined handler address (e.g., 8000 0180). Interrupt: An interrupt is an unexpected event from an external I/O controller. Example: An external device signaling the completion of an I/O operation triggers an interrupt. Handling: Interrupts are often managed by vectored interrupts where the handler address is determined by the cause of the interrupt. For instance, different causes like undefined opcode or overflow have specific handler addresses (e.g., C000 0000 for undefined opcode, C000 0020 for overflow).

Answer 6

In the MIPS architecture, exceptions are managed by the System Control Coprocessor (CP0). When an exception occurs: The PC of the offending or interrupted instruction is saved in the Exception Program Counter (EPC). The Cause register saves an indication of the problem (e.g., 0 for undefined opcode, 1 for overflow). The processor then jumps to the exception handler located at a fixed address (e.g., 8000 0180). The handler can then take appropriate action based on the type of exception and resume normal execution.

Answer 7

Vectored interrupts determine the handler address based on the cause of the interrupt, allowing for more efficient and faster response times. Instead of jumping to a single handler address and then determining the cause, the processor can directly jump to the appropriate handler. Example: Undefined opcode: handler address C000 0000. Overflow: handler address C000 0020. Advantage: This reduces the overhead and latency involved in handling interrupts, as the processor does not need to perform additional checks to determine the correct handler.

Answer 8

The forwarding unit helps mitigate data hazards by allowing the pipeline to use the result of an instruction before it has been written back to the register file. This bypassing technique reduces stalls and improves performance. In the given example: The ADD instruction writes the result to R1. The SUB instruction needs the value of R1 immediately in the next cycle. Without forwarding, the SUB instruction would stall waiting for R1 to be updated. With forwarding, the result of the ADD instruction is forwarded directly from the ALU output to the input of the ALU for the SUB instruction, preventing the stall.

Answer 9

The hazard detection unit's primary role is to identify and manage data hazards in the pipeline to ensure correct instruction execution. Its functions include: Detecting read-after-write (RAW) hazards, introducing stalls (bubbles), inserting NOPs (no operation).

Answer 10

Instruction 1: ADD R1, R2, R3 No forwarding needed as it is the first instruction. Instruction 2: SUB R4, R1, R5 Forwarding needed from ADD's ALU result to SUB's ALU input for R1. Forwarding Path: EX/MEM -> ID/EX Instruction 3: AND R6, R1, R7 Forwarding needed from ADD's ALU result to AND's ALU input for R1. Forwarding Path: MEM/WB -> ID/EX Instruction 4: OR R8, R4, R9 Forwarding needed from SUB's ALU result to OR's ALU input for R4. Forwarding Path: EX/MEM -> ID/EX

Answer 11

Instruction 1: LW R1, 0(R2) Loads value into R1. Instruction 2: ADD R3, R1, R4 Data hazard: Needs R1 which is loaded by the LW instruction. Resolution: Introduce a stall for one cycle. Use the result from MEM/WB stage of LW instruction. Instruction 3: SUB R5, R3, R6 Data hazard: Needs R3 which is computed by the ADD instruction. Resolution: Forward from ADD's EX/MEM stage to SUB's ALU input. Instruction 4: AND R7, R5, R8 Data hazard: Needs R5 which is computed by the SUB instruction. Resolution: Forward from SUB's EX/MEM stage to AND's ALU input.

Answer 12

Instruction 1: LW R2, 0(R3) Loads value into R2. Instruction 2: ADD R4, R2, R5 Data hazard: Needs R2 which is loaded by the LW instruction. Calculation: LW produces R2 at the end of the MEM stage. ADD needs R2 at the beginning of its EX stage. Insert one stall cycle to allow LW to complete and write R2 to the register file. Instruction 3: SW R4, 4(R6) Needs R4 which is produced by the ADD instruction. Forwarding from ADD's EX/MEM stage to SW's MEM stage. Instruction 4: SUB R7, R4, R8 Needs R4 which is produced by the ADD instruction. Forwarding from ADD's EX/MEM stage to SUB's EX stage.

Chapter 4 - The Processor Flashcards

(36 cards)