Lecture 4B: Modern CPU Hardware Architectures Flashcards
Define the term: Superscalar Processor.
A CPU that implements a form of parallelism called instruction-level parallelism within a single processor.
A superscalar processor can execute more than one instruction during a ____ ____ by simultaneously dispatching multiple ________ to different ________ units on the processor.
A superscalar processor can execute more than one instruction during a [clock cycle] by simultaneously dispatching multiple [instructions] to different [execution units] on the processor
What extra hardware is required to support Superscalar Processors?
Extra Functional Units are required to issue and execute multiple instructions in parallel.
Define the term: Superpipelining.
A technique for improving the performance of a processor by splitting instructions into many separate “pipelines” that can be executed more or less in parallel.
Superscalar Processors can execute only _______ instructions in parallel whereas _______ instructions are often _______.
Superscalar Processors can execute only [independent] instructions in parallel whereas [adjacent] instructions are often [dependent].
Name the limitations of Superscalar Processors
Can only process independent instructions, require extra hardware.
Define the term: Out-of-order execution.
Execute independent instructions in a different, more efficient order.
What happens if a hazard causes stall cycles?
All instructions up to the offending instruction are stalled.
What can help to prevent stall cycles?
Forwarding, branch prediction, and other techniques can reduce the number of stall cycles, but sometimes it is unavoidable.
What are speculative instructions?
All execution within the instruction window is speculative (i.e., side-effects are not applied outside the CPU until it is committed.
Advantages of Out-of-order execution.
Better performance.
> Exploit instruction-level parallelism (ILP)
> Hide latencies (e.g. L1 data cache miss, divide)
Disadvantages of Out-of-order execution.
Hardware is much more complex, expensive, requires a larger chip area and has higher power consumption.
Define the term: Dispatch. (out-of-order execution)
New instructions are added to the instruction window - reservation stations (RS)
What happens in a Reservation Station? (out-of-order execution)
Instructions wait for their inputs. If instruction operands are ready they are sent to the FU. Otherwise, check on the bypass network and wait for operands.
Functional units (FU) > ALUs, AGUs, FPU
What does a Bypass Network do? (out-of-order execution)
Broadcast computes values back to the reservation stations.
What does a Reorder Buffer do? (out-of-order execution)
It allows instructions to be committed in order.
De-speculates execution, mostly by committing instructions in order.
Flushes the speculative instructions when a misprediction is discovered.
What happens during the Commit process? (out-of-order execution)
- All execution within the instruction window is speculative (i.e., side-effects are not applied outside the CPU) until it is committed.
- Instructions can write to memory only when it is certain they should have been executed.
- Instructions must not affect the machine state while they are speculative.
- Instructions enter and leave the instruction window in program order, and instruction can only leave the window when it is the oldest instruction in the window and it has been completed.
- The instruction window is instantiated as RS & ROB.
What is required in order for the Store Buffer dispatch a store? (Out-of-Order execution)
The store buffer dispatches a store only when the store has both it’s address and its data ready and there are no older stores awaiting dispatch.
What are the features of a ‘Brainiac Design’ Processor?
Extra Hardware to achieve more Instruction Level Parallelism (ILP) out of code, millions of extra transistors, more design effort, consume more power.
What are the features of a ‘Speed Demon’ Processor?
Run at higher clock speeds because they are simpler, have simple hardware design, have Less chip area, less power consumption.
Define the term: ISA.
ISA provides commands to the processor, to tell it what to do, e.g., add, load, store.
Define the term: CISC.
Complex instruction set computer (CISC)
Note: Complex instructions can execute several low-level operations such as load from memory, arithmetic, and a memory store.
Define the term: RISC.
Reduced instruction set computer (RISC)
Note: Simple and 1 cycle instructions.
What are the differences between RISC and CISC?
CISC:
• CISC puts emphasis on hardware.
• CISC was developed to make compiler development simpler.
• CISC is typically used for general-purpose computers.
RISC:
• RISC puts emphasis on software.
• RISC was developed to make hardware simpler.
• Typically used in smartphones.