Comp 1313 Systems 1 Flashcards

Question

What can ruin pipelining?

Answer 1

Multiple streams Prefetch Branch Target Loop buffer Branch Prediction

Answer 2

Prefetch the branch target and a few instructions after before the actual branch

Answer 3

Very fast memory that store the last n instructions, maintained in the fetch stage of the pipeline. Check the buffer before fetching, and then load the buffer instead. Good for small loops

Answer 4

Predict where you're going to branch. This involves predict that you either will always branch here or will never branch here. If so we either prefetch the next instruction or the branch instructions.

Answer 5

Predicting by Opcode. As some are more likely to jump than others (75% success rate) Taken/Not taken switch, use previous execution history to detirmine if it's going to jump.

Answer 6

Operators have different meanings in different algebras.

Answer 7

Eliminates negative values Simplifies comparison operations Symmetrical range about zero Facilitates floating point Simpler conversion between systems

Answer 8

Avoids the double representation of zero

Answer 9

Flip every bit and then add one

Answer 10

Add the bias to the original number and then convert to an unsigned binary number

Answer 11

Shifting by a bias so that 0000... would be -128 and 1111... would be 128

Answer 12

Doing simple commands at the same time using multiple arithmetic units. Basically not vector or array logic

Answer 13

When one instruction is dependant on another instruction happening previously and thus cannot execute together.

Answer 14

Can't execute instructions after a branch with instructions before, otherwise we waste processor time.

Answer 15

Two or more instructions request the same resource at the same time.

Answer 16

By duplicating the resource

Answer 17

Instructions in a sequence are all independent of each other, so their execution can be overlapped. Governed by data and procedural dependency.

Answer 18

Ability to take advantage of instruction parallelism.

Answer 19

Where the order of instructions, fetched, executed, memory and registers changed.

Answer 20

Issuing instructions in the order the occur This is inefficient Instructions could stall if required

Answer 21

Decouple, decode pipeline from the execution pipeline. Continue to fetch and decode until the pipeline is full, when a unit becomes available then we use it.

Answer 22

We need additional logic to ensure that our code isn't destroyed by being executed out of order.

Answer 23

One instruction cannot happen before another, as the first instruction modifies the other instruction's operands.

Answer 24

Avoid antidependancy by dynamically allocating registers as they are needed to have copies of the original values before have the code reference that version of the registers. These are stored in register specifically for this. Avoiding pipeline stalls.

Answer 25

Duplication of resources with out-of-order issues. Need instruction window length large enough to "see" instructions incoming.

Answer 26

If there is a unit free we can do instruction that may be needed. And dispose of their results if they are not. Out of order execution can provide this, but, leads to the meltdown vulnerability.

Answer 27

Microsoft Windows and UNIX-like

Answer 28

An easy and convenient interface with the system. It is essentially a mediator between programs and hardware.

Answer 29

To protect users and programmers from having to work with horrendous complexity. (IP, Compilers, Drivers) To protect users and programmers from the details of the hardware through an interface.

Answer 30

Memory management Task management File management Device management

Answer 31

Allocation of memory and management of virtual memory Also restricts access helping with programming errors and malware.

Answer 32

Launching processes Maintaining the process table in memory Performing time slicing and context switching Handling interrupts

Answer 33

Respond to program request to open files Set and check permissions Handle buffering

Answer 34

Use drivers to respond to request to use devices.

Answer 35

Memory protection Timer Privileged instructions Interrupts

Answer 36

Making effective use of the processor because the processor is much faster than memory or I/O devices, so scheduling is required so that other task can be completed while one waits.

Answer 37

The state of a running process is saved, and another process given processor resources

Answer 38

Process Control Block Comes with an: Identifier State Volatile environment Priority I/O status Accounting information.

Answer 39

Long-term Medium-term Short-term I/O

Answer 40

When space becomes available processes are loaded from disk and when they are stalled or finished they are removed.

Answer 41

How do we distribute a set of processes (of unknown and varying sizes) onto fixed memory.

Answer 42

Partitions of memory of a fixed size

Answer 43

Partitions allocated as required, may lead to fragmentation

Answer 44

Logical Address, location relative to beginning of the program Physical address, the actual location in memory Base address, current starting location of the process

Answer 45

We divide memory into lots of small, equal chunks (frames) Divide processes into chunks (pages) of the same size as these frames Then we can map pages to frames efficiently (in terms of memory)

Answer 46

Through the page table

Answer 47

Page tables show exactly what pages belong to what process, by translating base address to physical address.

Answer 48

Each page of a process is swapped in only when it is needed. This makes it possible for a program to be larger than memory as only pages required are loaded, a page fault is triggered to inform the OS that a new page is required.

Answer 49

Advantages: More processes can be maintained Time is saved Disadvantages: This uses swapping, so one page in and one out. Thrashing can occur - Where the processor spends most of it's time swapping pages.

Answer 50

Translation lookaside buffer Every logical access requires two physical access, page table entry and the actual access Most systems have cache reserved for TLB

Answer 51

Allowing the programmer to view memory as a series of address spaces or segments. Good for handling growing data structures Recompilation independently without requiring an entire set of programs to be recompiled Sharing among processes Protection

Answer 52

One core can look like two with extra instructions but sharing execution units allowing separate threads to run.

Answer 53

Using a separate microcontroller to monitor power Can shut-off cores and boost cores when required.

Answer 54

We can turbo-boost a core for short burst or if we are only running a single core. Used widely. 1 or 2 cores active at 5Ghz or 3-4 active at 4.8 or 5-8 active at 4.7

Answer 55

Directly to the CPU.

Answer 56

Non-Uniform Memory Access Multiprocessor systems with separate blocks of RAM to reduce bottlenecks and is good with large numbers of cores

Answer 57

Neural Processing Unit specialist unit in a CPU for neural net calculations

Answer 58

Using a mix of performance and efficency cores. As processes using less resources can run on E cores E cores are much smaller than P cores

Answer 59

Multiple computer that are connected together and can share information/resources

Answer 60

Wide Area Network Local Area Network Metropolitan Area Network Personal Area Network

Answer 61

OSI TCP/IP Abstraction, so we don't need to know about the hardware.

Answer 62

Application Transport Internet Network Access (Link)

Answer 63

The network is responsible for best-effort connections End-hosts are responsible for reliability and security

Answer 64

Deals with local link With a unique MAC address Ethernet

Answer 65

Handles next-hop routing provides unique addressing Passes to the correct devices, transport layer.

Answer 66

32 bit IP address Variable length header with a minimum of 20-bytes We have run out of IPv4 addresses

Answer 67

Sharing one IPv4 address between multiple computers Breaks the end-end principle Doesn't solve the exhaustion problem

Answer 68

128-bit address with a 40-bytes header Written in hex

Answer 69

A logical subdivision of a network

Answer 70

When there is a change in IP spaces, at the internet layer, packets are changed to have new IP addresses and converted between networks.

Answer 71

Provides host-to-host communication using TCP and UDP

Answer 72

Acknowledgements Guanteed arrival in correct order 20-bytes header

Answer 73

No acknowledgements required No guarrentee on order 8-bytes header

Answer 74

Preventing a fast sender overwhelming a slow responder

Answer 75

Reduces send rate to cope with network congestion.

Answer 76

Sliding window protocol The sender should only send if the receiver indicates that it has suitable buffer space

Answer 77

Sender sends a small packet and increases size until a packet is lost. Sender restarts the cycle of sending with a lower threshold

Answer 78

Software that uses the networks, generally using pre-made libraries themselves.

Answer 79

A protocol used for diagnostic and control purposes or generated in response to error in IP

Answer 80

Port number

Answer 81

Address Resolution Protocol Neighbour Discovery Protocol Operates at the link layer Translates IP addresses to MAC addresses

Answer 82

Domain Name Service provides a way to map symbolic domain names to an IP address Reliable and resilient distributed service At the application layer

Answer 83

Hubs A multi-port repeater All packets sent to all connected devices Switches connect multiple devices on one network segment Switches are at the Link layer They forward only to the specific required port

Answer 84

So network owners need to know what is happening Government want to know what people are doing

Answer 85

VPNS create a logs anyway and their exit points can be monitored ToR- The Onion Router Routing traffic through a random series of hops, all encrypted.

Answer 86

List of all commands that a processor can execute It is per processor

Answer 87

Operation Code Source operand reference Result reference Next instruction reference (usually implicit)

Answer 88

Data processing Data movement Program flow

Answer 89

For shifting: Bit masks Unpacking data Fast integer arithmetic For rotating: cryptography

Answer 90

May be specific instructions May be done with data movement instructions May be done by a separate controller

Answer 91

More addresses: More complex instructions More egisters Reg-Reg operations are quiker Fewer instruction per program Fewer addresses: Less complex instructions More instructions per program Faster fetch/execution of instructions Comprimise!

Answer 92

Complex Instruction Set Computing Multiple cycles per instruction Reduced Instructions Set Computing One cycle per instruction (usually)

Answer 93

There is no consistency in ordering bytes

Answer 94

Most significant byte in the lowest numerical address Memory dumps are left to right Stores strings and integers in the same order But has to perform an extra operation to convert 32 bit to 16 bit addresses

Answer 95

Least significant byte in the lowest address

Answer 96

Accumulator Stack Register-memory Register-Register

Answer 97

The accumulator is the input and output store A can be loaded B can then be added The result can be stored

Answer 98

Operands are pushed onto a stack and then an instruction POPs them off, performs the operations and pushes them back.This requires an extra pointer, and memory transfer requires extra operations.

Answer 99

Operand LOADed from memory to registers Instructions use operands stored in registers Memory transfer requires extra operations.

Answer 100

Accumulator Short instructions High memory traffic Single temporary storage location Stack Simple Short instructions Stack cannot be randomly accessed Bottleneck in stack Register Easy code Compiler optimisations Fast access to temporary values Operand must be names Longer instructions

Answer 101

Shows information about an instruction, if problems occured, the result of previous instructions, etc.

Answer 102

The return address of a branch is placed in the link register so that a subroutine can return back

Answer 103

This allows the occurance of multiple subroutines.

Answer 104

Time required for memory to recover before next access

Answer 105

Register L1 Cache L2 Cache Main Memory Disk cache Disk (SSD) HDD Optical Tape

Answer 106

Dynamic RAM Requires refreshing or charges will leak Simple and small, less expensive 25GB/s DDR5 Two 32 bit channells 38GiB/s up 96GB

Answer 107

Static RAM Bits stored as on/off gates using 4-6 transistors No charge leak so no refreshing More expensive and more complex

Answer 108

Read-only memory BIOS anbd basic system programs

Answer 109

DRAM looses data, 25K failures per Mbit per billion hours

Answer 110

Hard Permanent defect Soft Random, non-destructive No permanent damage Detected and maybe fixed by Error Correcting Code

Answer 111

Small block of fast SRAM on the CPU where memory requests are sent (NOT DRAM)

Answer 112

As DRAM takes at least 5 clock cycles to provide data

Answer 113

CPU reads a memory location Address goes to cache If present cache will provide (hit) Otherwise, block read required from RAM to cache (miss) Then this is delivered

Answer 114

As they have different access patterns

Answer 115

As bigger caches have longer latency so they need to be split

Answer 116

N words of equal length with unique address

Answer 117

Reads instructions and data Sends controls signals to othe units Defines a chip plug

Answer 118

Control/Address/Data PCIe Serial ATA Universal Serial Bus

Answer 119

A common communication pathway Signals might be separate, multiplexed, serialised This convers the conventual parallel bus.

Answer 120

Maximum memory capacity

Answer 121

Control and timing information Controls access to other buses

Answer 122

1. Obtain the use of the bus 2. Transfer data 3. Synchronise and acknowledge

Answer 123

Peripheral Component Interconnection Express Serial bus with multi- GiBytes/s lanes v5 - 4 GiByte/s Uses 16 lanes for GPU cards

Answer 124

Packets are sent over serial links ACK/NACKS protocol used for data safety Using multiple lanes to gain bandwidth It is similar to a network with layers and addressing

Answer 125

SATA - 500MB/S SAS - 1.2GB/S Fibre channel - 100 MB/S - 25 GB/S (long distance) SCSI 10 - 640 MB/S iSCSI SAS

Answer 126

Universal Serial Bus For low-high speed I/O devices Allows for 127 devices USB4 - 10-40 Gbit/s

Answer 127

Assumes a root hub connected to the main bus Cables have 4 wire 2 data lines 0 is a transition in voltage and 1 is the absence of a transition Every 1 msec, the hub broadcasts a frame

Answer 128

Control Isochronous Bulk Interrupt

Answer 129

Basic input/output system Firmware on the motherboard to start and test hardware and boot OS. Stored in flash and looks for a bot-loader

Answer 130

A replacement for BIOS to boot services and runtime services and can have graphics

Answer 131

The paths/buses between componants

Answer 132

DDR5: 10GB/S PCIe V3: 1GB/S PCIe V4: 2GB/S V5: 4GB/S Nvme SSD: 5GB/S Sata SSD: 500MB/S 2.5Gbe: 200MB/S Wifi 6: 100MB/s Ethernet: 100MB/S DMI V4: 16GB/S HDD: 100MB/s CPU: ~50 Gflops

Answer 133

A metal disk coated in magnetic material They store data on multiple platers with one head per side, all aligned. Data is striped per cylinder to reduce head movement. Data is organised into concentric rings with gaps between rings.

Answer 134

By the seek time. Access time = seek time + latency

Answer 135

Disk throughput will often be slower than the connection speed of the wire, on disk-cache can be used to store whole tracks.

Answer 136

Mean Time Between Failures HDD: 114 yrs

Answer 137

Solid State Drive non-volatile NAND logic with fast access times. Over millions of writes the flash blocks can fail. File systems are used to deal with SSD problems and erase unneeded blocks, but only by block.

Answer 138

To perform interface addressing, error detection and correction. To change some bytes a block has to be read and modified.

Answer 139

70MiB/s read 15-30GiB average 128 GiB maximum

Answer 140

A collection of disks or tapes (blu-ray) which can be interchanged to read/write. Usually used for backup.

Answer 141

Internet Small Computer System Interface TCP/IP over normal Ethernet that backs up data Storage Area Network Block accessed with 16GBi/s fibre channel

Answer 142

A self-contained computer on a single chip featuring a slower clock.

Answer 143

They can't run a real operating system

Answer 144

Bare-metal programming Embedded OS Real-time

Answer 145

Write C code, compile it and then flash it to the micro-controller.

Answer 146

Used when the timing is crucial and must be guaranteed.

Answer 147

Integrating most of a computer onto a single chip. Includes radio, co-processor, interface drivers and more.

Answer 148

Software placed between the OS and hardware, that tells the OS what is and isn't hardware.

Answer 149

Using a hypervisor to emulate hardware, above the hardware.

Answer 150

A Virtual Machine Monitor is a layer of software that emulates the hardware of a complete computer system.

Answer 151

The machine code of the above OS may not be for the instruction set of the hardware.

Answer 152

Changing the guest OS so that it cooperates with the Virtual Machine.

Answer 153

Activate and deactivate the interrupts Change page tables Accessing virtualised peripherals

Answer 154

Allows the VMM to run privileged code. It is not translated.

Answer 155

New instruction that switches the CPU into non-root mode. Processor state is loaded from the guest state of the VM scheduled to run. The control transferred from VMM to the VM.

Answer 156

Saves the process state in the guest state area of the running VM. Loads the processor state from the host-state area. Transfer control to the VMM.

Answer 157

Linux device driver for hardware virtualisation.

Answer 158

Uses binary translation via Tiny code generator for efficient emulation.

Answer 159

Use the kernel of the host system to run only the code needed instead of having a full OS.

Answer 160

Freezing an older OS/demo service Trying new OS. Working on system installation scripts. Migrating between VMs Disaster recovery

Answer 161

Classification of computer architectures. Four classification based on number of instruction and data streams But vector processing is missing.

Answer 162

Do they same thing to many data objects Require special CPU hardware and supporting software.

Answer 163

Image processing Video processing Array/vector processing Text processing

Answer 164

Streaming SIMD Extensions

Answer 165

128-bit registers that can be packed with various data types.

Answer 166

Symmetric Multiprocessors A MIMD system where multiple CPUs share main memory and I/O Hardware manages contention.

Answer 167

Each processor has its own L1 and L2 cache Connected by a system bus Main Memory, I/O, etc are also connected to the bus.

Answer 168

Combining big performance cores with little efficiency cores.

Answer 169

Hardware multi-threading on superscalar CPUs. Execute multiple instructions at the same time using redundant execution units in the processor.

Answer 170

Split the code up, onto separate CPUs

Answer 171

Very easy to split tasks into parallel subtasks

Answer 172

Split the data to make independent parallel tasks

Answer 173

Video compression Video transcoding Image compression Modelling AI Number-crunching

Answer 174

A core containing a floating point unit and maybe an integer unit Could have a Special Function Unit for trigonometric operations

Answer 175

As the bandwidth needs to be huge.

Answer 176

A core for AI acceleration For fused multiply-add operations

Answer 177

Cards designed for number-crunching, especially in data centres don't have outputs

Answer 178

The number of transistors within a system doubles every two years

Answer 179

The speed of a system is limited by it's bottlenecks, there is a fancy equation for this.

Answer 180

Simultaneous multithreading

Answer 181

SISD: Single Instruction Single Data SIMD: Single Instruction Multiple Data MISD: Multiple Instruction Single Data MIMD: Multiple Instruction Multiple Data

Answer 182

A collection of Vertices, Edges and Faces

Answer 183

2 The router and broadcast

Answer 184

Where 0s are, but, have not been included

Answer 185

Single (32 bits) and double (64 bits)

Answer 186

15-17 digits

Answer 187

Convert your number to binary Normalise the number Determine the sign bit Calculate the exponent Calculate the mantissa Combine all the parts together

Answer 188

An on/off signal used by motors and LEDs.

Comp 1313 Systems 1 Flashcards

(222 cards)