Architectures Flashcards
Difference between von Neumann architecture and Harvard architecture?
Harvard has two separate memory spaces for programs and data, and therefore two separate buses
vN only has one memory space for both, with only one bus (this is the litmus test for vN)
What are accumulators?
General purpose registers for holding data, interim and final results of calculations, holding data ready to be transferred between I/O and main memory or between different memory locations
What is the Program Counter?
Holds the address of the next instruction to be executed
What is the Instruction Register?
Holds the actual instruction being executed
What are flags?
1 bit registers used to keep track of special conditions, such as:arithmetic carry, overflow, power failure, internal computer error
They are grouped together in one or more Status Registers
What is the Memory Address Register?
Register that holds the address of a memory location to be accessed, connected to the memory with a one-way bus
What is the Memory Buffer/Data Register?
Register that holds the data being stored in/retrieved from memory location specified in the MAR, connected to the memory with a two-way bus
What is a bus?
A physical connection (group of electrical conductors) from one part of the system to another, used for transferring data. They are used to transfer data inside the CPU, between CPU and RAM, and between CPU and components
What are the four categories of buses?
Data, Address, Control, Power
Point-to-point buses carry the signal from a specific source to a specific destination
Broadcast buses carry the signal from a specific source to multiple destinations
Three types of MIPS instructions
R-type: register operands
I-type: immediate operand
J-type: for jumping
R-type instruction format
Human format: Opcode, Dest, Source1, Source2, ShiftAmount, Function
e.g. add, $s0, $s1, $s2
Internal format: Opcode, Source1, Source2, Dest, ShiftAmount, Function
6, 5, 5, 5, 5, 6
e.g. 0, 17, 18, 16, 0, 32
opcode is always 0 for R-type instructions
add, sub, etc. are different functions for R-type
I-type instruction format
Human format: Opcode, Dest, Source, Argument
Example: addi $s0, $s1, 5
Internal format: Opcode, Source, Dest, Argument
6, 5, 5, 16
Example: 8, 17, 16, 5
J-type instruction format
Opcode, Address
6, 26
Jumps to the address specified in the 26-bit address operand
What are the four parts of the 4GB MIPS address space?
Text, global data, dynamic data, reserved
Text segment
Stores the program in memory, maximum capacity 256 MB, first four significant bits of any address in this area are 0000
Global data segment
Stores global variables, capacity of 64 KB
Global variables are accessed using the pointer $gp
Dynamic data segment
The dynamic data segment stores datat hat are dynamically allocated and deallocated throughout the execution of the program.
Data in this segment are stored in a stack and a heap.
Register only addressing
Uses registers for all source and destination operands
R-type instructions use Register Only adddressing
Immediate addressing
Uses registers and a 16-bit immediate
Some of the I-type instructions use this
Base addressing
The base address in the register rs is added to the contents of the immediate
Used in memory access instructions (subset of I-type instructions)
Memory access instructions
lw destRegister offset source = load word
lb = load byte
sw sourceRegsiter offset dest = store word
sb = store byte
word = 32 bits = 4 bytes
Example: sw $s3, 4($0) means write $s3 to Word 1
PC-relative addressing
0x40 loop: __________
…
0x54 bne $t1, $0, loop
0x58 _____________
…
bne is an I-type instruction. To calculate the imm, subtract the target address from the PC value immediately after the branching instruction and divide by 4. (0x58 - 0x40)/4 = -6
Therefore imm = -6 = 1111 1111 1111 1010
Pseudo-direct addressing
This is used to calculate the new value of the Program Counter from a J-type instruction
2 least significant bits and 4 most significant bits are left to 0
j 0x004000A0 = j 0000 0000 0100 0000 0000 0000 1010 0000 = j 0000 0100 0000 0000 0000 1010 00
Call and return in MIPS
0x00400200 main: jal simple
0x00400204 …
…
0x00401020 simple: jr $ra
“jal simple” jumps to “simple” like j would do, but also stores in $ra where the program should return after executing “simple”
“jr $ra” jumps to the address stored in $ra
This is R-type, not J-type
Arguments and return value in MIPS
The caller places arguments into registers $a0 to $a3
Makes a call using jal
The return value is placed in registers $v0 or $v1
Input in MIPS
li $v0, 5 # load 5 into $v0
syscall # make system call with code 5
move $t0, $v0 # move result (in $v0) to $t0
syscall services
$v0 = 1, print integer $a0
$v0 = 4, print string $a0
$v0 = 5, read integer and store in $v0
$v0 = 10, exit
Process of translating and starting a program
The compiler translates high-level code into assembly
The assembler turns assembly code into machine language code (object file)
The linker combines the object file with other machine language code (e.g. from already compiled and assembled libraries)
The loader puts the executable in memory
Assembler process
The assembler makes two passes through the assembly code. First it assigns instruction addresses and puts all symbols in a table. Then it generates the object file.
Microcontrollers and microprocessors
Microcontrollers are integrated devices (computers on a chip), and consist of:
- A microprocessor
- Memory
- I/O
They are used in embedded systems (special purpose computer systems designed to perform dedicated functions)
Programs are usually stored in flash memory, often forever
AVR registers
32 general purpose 8-bit registers as part of the ALU (R0-R31)
The six 8-bit registers R26-R31 should be treated as three 16-bit registers used for addressing
AVR memory
Data memory = SRAM
Program memory = Flash
Data storage = EEPROM
AVR flash memory
Divided into two sections: Boot Loader Section and Application Program Section
Program Counter is 14 bits wide, AVR instructions are 16 or 32 bits wide
AVR EEPROM
1KB of EPPROM, organised as a separate space for storing data
Organised into pages consisting of 4 bytes
AVR SRAM
Divided into three sections:
Mapping space of ALU registers
I/O registers and extended I/O registers
Internal SRAM
AVR types of addressing
Direct addressing reaches the entire data space
Registers X, Y and Z (R26 to R31) are the indirect addressing pointer registers
Indirect with displacement mode reaches 63 address locations from the address in the pointer register
In register indirect addressing mode with pre-decrement or post-increment, the address registers X, Y, and Z are decremented or incremented
AVR single register addressing
Opcode Operand1 (11, 5)
Operand1 is contained in register d (0 <= d <= 31)
Example: NEG R7
AVR two register addressing
Opcode Operand1 Operand2 (6, 5, 5)
Result stored in Operand2
Example: ADD R3, R4 (R3 := R3 + R4)
AVR direct data addressing
Instructions are two words long (32 bits), meaning the program counter will be incremented by two
Operand DestOrSourceReg DataAddress
0 <= DataAddress <= 65535
Example: LDS, R3, 2000 (loads a byte from data address $2000 to register $r3)
Indirect addressing
The operand address is the contents of the X, Y, or Z register
Pre-decrement: X, Y, Z register decremented before the operation
Post-increment: X, Y, Z register incremented after the operation
Indirect addressing with displacement
Operand address is the result of the contents of the Y or Z register, added to the address contained in six bits of the instruction word (q)
Rd/Rr specifies the destination or source register
Example of indirect addressing
LD loads one byte from the address in the Y or Z register, LDD does the same but with displacement.
LD Rd, Z [Rd <= (Z)]
LD Rd, Z+ [Rd <= (Z), Z <= Z+1]
LD Rd, -Z [Z <= Z-1, Rd <= (Z)]
LDD Rd, Z+q [Rd <= Z+q]
Direct program memory
Program execution continues at the address immediate in the instruction words
Example: JMP <label>
<label> is the label of the instruction at address k
PC <= k (0 <= k <= 4,000,000)</label></label>
Relative program memory
Program execution continues at address PC+k+1 (-2048 <= k <= 2047)
Example:
RJMP <label>
PC <= PC + k + 1</label>
Directives
.equ assigns a value to a constant
Example: .equ delayVal, 10000
.global deeclares a function as global
Example: .global start
SBI instruction
Sets bit in I/O register to 1
Syntax: SBI A,b (0 <= A <= 31, 0 <= b <= 7)
CBI does the same thing except it sets the bit to 0
Status register
An 8-bit register part of the CPU. Its bits are used as flags.
Reading values from the input pin
btnLED:
SBIC PIND, 2
RJMP blink
RJMP btnLED
The status of the input pin is stored in a bit named (PIND,2)
The bit DDRD,2 controls whether D02 is used as an input or output pin
The SBIC instruction checks if a bit in an I/O register is 0, and skips an instruction if it is
Controlling the output pin
SBI DDRB,4 sets pin D12 as the output pin (bit 4 in the I/O register <= 1)
Its behaviour is now controlled PORTB,4
When PORTB,4 is 1, the LED is on
SUBI instruction in AVR
SUBI Rd, x (Rd -= x)
If the result of the subtraction is zero, the value in the Z bit of the status register will be 1
Conditional branching in AVR
BRNE label: branches to label if Z is equal to 1
CISC architecture
Complex Instruction Set Computers:
Large number of specialised instructions
Wide variety of addressing modes
Instruction length varies
Comparatively few general purpose registers
Examples: IBM mainframe, x86
RISC architecture
Reduced Instruction Set Computers (RISC)
Limited and simplified instruction set, but very efficient
Register-oriented instructions with limited memory access
Fixed length and format of instructions
Limited addressing modes
Large bank of registers
CISC vs RISC comparison
RISC used to be faster but produce bigger programs
Nowadays RISC and CISC have become more similar, and have similar performance
Key things to consider when approaching a new architecture
Data word length
Registers
How memory is organised
Instructions
Pipelining
Separate execution units for different types of instruction to allow parallel execution of unrelated instructions
Branch problem solutions
When we reach a branch statement we don’t know what the next instruction will be
Solutions: Work on both outcomes until the branch is resolved, predict the branch outcome based on previous visits or programmer hint, requiring the following instruction to not be dependent on the branch, analyse upcoming instructions and do the ones that do not depend on the current pipeline first
Superscalar processing
Specialist units for different types of instruction, several for the most common instructions
Each unit has a pipeline, if they’re all filled then the overall processor can execute more than one instruction per clock cycle
Superscalar processing complications
Conflicts for CPU resources, dependencies on prior results may be lost, instructions may complete in the wrong order, branch problems