CSCI 223 Midterm 2 Flashcards

Question 1

Q

3 mechanisms in procedures

Answer

A

passing control (call, return)
passing data (using either registers or memory)
memory management (stack - allocate/deallocate)

Question 2

Q

x86-64 stack grows toward

Answer

A

lower memory addresses (down)

Question 3

Q

register ? contains address of top of stack

Question 4

Q

stack operation: pushq Src

Answer

A

fetch (read) operand at Src), decrememt %rsp by 8 (or 4 on 32-bit), write operand at address given by %rsp

Question 5

Q

register ? contains address of the frame pointer

Question 6

Q

stack operation: popq Dest

Answer

A

read value at address given by %rsp, increment %rsp by 8 (or 4 on 32-bit machine), store value at Dest (must be register)

Question 7

Q

procedure control flow

Answer

A

use stack
procedure call: push return address on stack, jump to label
procedure ret: pop address from stack, jump to address

Question 8

Q

return address

Answer

A

address of the next instruction right after call

Question 9

Q

return value

Answer

A

%rax (64-bit) or %eax (32-bit)

Question 10

Q

stack allocated in ?

Answer

A

frames (state for single procedure instantiation)

Question 11

Q

contents of stack frames

Answer

A

return information, local storage (if needed), temporary storage (if needed)

Question 12

Q

management of stack frames

Answer

A

space allocated when enter procedure (“set-up” code; includes push by call instruction) and deallocated when return (“finish” code; includes pop by ret instruction)

Question 13

Q

we create a new stack frame when

Answer

A

we enter a new function

Question 14

Q

calling conventions

Answer

A

caller saved and callee saved

Question 15

Q

caller saved

Answer

A

caller saves temporary values in its frame before the call

Question 16

Q

callee saved

Answer

A

callee saves temporary values in its frame before using, then restores them before returning to caller

Question 17

Q

%rax is the ?, ?-saved, and can be modified by ?

Answer

A

return value, caller-saved, procedure

Question 18

Q

%rdi…%r9 are the ?, ?-saved, and can be modified by ?

Answer

A

arguments, caller-saved, procedure

Question 19

Q

%r10, %r11 are ?-saved, and can be modified by ?

Answer

A

caller-saved, procedure

Question 20

Q

%rbx, %r12, %r13, %r14 are ?-saved, and can be modified by ?

Answer

A

caller-saved, callee must save and restore

Question 21

Q

%rbp is the ?, ?-saved, and can be modified by ?

Answer

A

frame pointer, callee-saved, callee must save and restore

Question 22

Q

%rsp is the ?, ?-saved, and can be modified by ?

Answer

A

stack pointer, callee-saved (special form), restored to original value upon exit from procedure

Question 23

Q

recursion is handled by ? calling conventions

Answer

A

normal (in the use of the stack)

Question 24

Q

char/unsigned char

Answer

A

1 byte

2^8 (256) numbers

Question 25

Q

short/unsigned short

Answer

A

2 bytes

2^16 numbers

Question 26

Q

int/unsigned int

Answer

A

4 bytes

2^32 numbers

Question 27

Q

long/unsigned long

Answer

A

8 bytes

2^64 numbers

Question 28

Q

integer encoding standards: unsigned integers

Answer

A

like converting binary to decimal: the sum of the number (0 or 1) times 2^i

Question 29

Q

integer encoding standards: two’s complement for signed integers

Answer

A

first bit is a sign bit (0 = non-negative, 1 = negative) in addition to a value, converting the rest is like binary to decimal

Question 30

Q

to find the negative version of an integer

Answer

A

find bit pattern for positive version
flip all bits
add 1

Question 31

Q

for a 16-bit integer, UMax (unsigned)

Answer

A

0xFFFF = 2^16-1 (need to know context to know difference between -1 and 2^16-1)

Question 32

Q

for a 16-bit integer, TMax (signed)

Answer

A

0x7FFF = 2^15-1

Question 33

Q

for a 16-bit integer, TMin (signed)

Answer

A

0x8000 = -2^15

Question 34

Q

for a 16-bit integer, -1

Answer

A

0xFFFF = -1 (need to know context to know difference between -1 and 2^16-1)

Question 35

Q

for a 16-bit integer, 0

Answer

A

0x0000 = 0 (always all zeroes for signed and unsigned)

Question 36

Q

the size of TMin is equal to

Question 37

Q

the size of UMax is equal to

Answer

A

2*TMax + 1

Question 38

Q

equivalence of unsigned/signed numeric values

Answer

A

same encodings for nonnegative values

Question 39

Q

uniqueness of unsigned/signed numeric values

Answer

A

every bit pattern represents a unique integer value; each representable integer has a unique bit encoding

Question 40

Q

constants by default are considered to be ? integers unless

Answer

A

signed; U as suffix

Question 41

Q

sign extension

Answer

A

given w-bit signed integer x, convert it to w+k bit integer with same value

Question 42

Q

rule for sign extension

Answer

A

make k copies of sign bit

Question 43

Q

floating point numbers

Answer

A

encode rational numbers of the form v = x*2^y

Question 44

Q

standard for representing floating point numbers (but not all)

Question 45

Q

single-precision for IEEE 754

Question 46

Q

double precision for IEEE 754

Question 47

Q

IEEE 754 sign bit is

Question 48

Q

IEEE 754 next set of bits

Question 49

Q

IEEE 754 last set of bits

Question 50

Q

IEEE 754 formula

Answer

A

(-1)^s (1.frac) 2^(exp - bias)

Question 51

Q

divide by two by shifting

Question 52

Q

multiply by two by shifting

Question 53

Q

IEEE 754 single precision bit allocation

Answer

A

sign bit = 1 bit
exp = 8 bits
frac = 23 bits

Question 54

Q

IEEE 754 double precision bit allocation

Answer

A

sign bit = 1 bit
exp = 11 bits
frac = 52 bits

Question 55

Q

normalized values for IEEE 754

Answer

A

when exp =/= 000…0 or 111…1

Question 56

Q

bias for exp

Answer

A

2^(k-1) - 1

Question 57

Q

bias for exp for 32 bit, 64 bit

Answer

A

127, 1023

Question 58

Q

denormalized values for IEEE 754

Answer

A

exp = 000…0

cases:
frac = 000…0 represents zero value (+0, -0 depending on sign bit)
frac =/= 000…0 represents numbers very close to 0.0

Question 59

Q

special values for IEEE 754

Answer

A

exp = 111…1

cases:
frac = 000…0 represents infinity (both positive and negative depending on sign bit)
frac =/= 000…0 represents Not-a-Number (NaN; when no numeric value can be determined)

Question 60

Q

IEEE 754 zero

Answer

A

exp = 00...00
frac = 00...00

Question 61

Q

IEEE 754 smallest positive denormalized

Answer

A

exp = 00...00
frac = 00...01

Question 62

Q

IEEE 754 largest denormalized

Answer

A

exp = 00...00
frac = 11...11

Question 63

Q

IEEE 754 smallest positive normalized

Answer

A

exp = 00...01
frac = 00...00

Question 64

Q

IEEE 754 one

Answer

A

exp = 01...11
frac = 00...00

Answer 54

A

exp = 11...10
frac = 11...11

Answer 55

A

towards zero, round down (towards -inf), round up (towards +inf), nearest even (default)

Answer 56

A

exact conversion, as long as int has <= 53 bit word size

Answer 57

A

will round according to rounding mode

Answer 58

A

truncates fractional part, like rounding toward zero; not definite when out of range or NaN

Answer 59

A

traditionally packaged as a chip, basic storage unit is normally a cell (one bit per cell)

Answer 60

A

Random-Access Memory

Answer 61

A

used for cache; each cell stores a bit with a four or six-transistor circuit; retains value indefinitely, as long as it is kept powered; relatively insensitive to electrical noise (EMI), radiation, etc.; faster and more expensive than DRAM

Answer 62

A

used for main memory; each cell stores bit with capacitor; one transistor per bit; value must be refreshed every 10-100ms; more sensitive to disturbances than SRAM; slower and cheaper than SRAM

Answer 63

A

volatile (lose info if powered off)

Answer 64

A

retain value even if powered off (ROM, PROM, EPROM, EEPROM, flash memory)

Answer 65

A

read-only memory; programmed during production

Answer 66

A

programmable ROM; can be programmed once after manufacturing

Answer 67

A

erasable PROM; can be bulk erased (UV, XRay)

Answer 68

A

electronically erasable PROM; on our system; electronic erase capability

Answer 69

A

EEPROMs with partial (sector) erase capability; wears out after about 100,000 erasings

Answer 70

A

firmware programs stored in ROM, solid state disks, disk caches

Answer 71

A

a collection of parallel wires that carry address, data, and control signals; typically shared by multiple devices

Answer 72

A

movl A, %eax
CPU places address A on the memory bus
main memory reads A from the memory bus, retrieves word x, and places it on the bus
CPU reads word x from the bus and copies it into register %eax

Answer 73

A

movl %eax, A
CPU places address A on bus; memory reads it and waits for the corresponding word to arrive
CPU places data word y on the bus
main memory reads data word y from the bus and stores it at address A

Answer 74

A

CPU clock rates increase up to 2003, then drop because the number of cores was increased in exchange for lower speed to save power

Answer 75

A

widens between DRAM, disk, and CPU speeds; the key to bridging this gap is locality

Answer 76

A

programs tend to use data and instructions with addresses near or equal to those they have used recently

Answer 77

A

recently referenced items are likely to be referenced again in the near future

Answer 78

A

items with nearby addresses tend to be referenced close together in time

Answer 79

A

the distance between two consecutively accessed addresses (e.g. stride-N, stride-1)

Answer 80

A

words; L1 cache

Answer 81

A

cache lines; main memory

Answer 82

A

disk blocks/”page” local disks

Answer 83

A

files; disks on remote network servers

Answer 84

A

registers, L1 cache, L2 cache, main memory, local secondary storage, remote secondary storage

Answer 85

A

a smaller, faster storage device that acts as a staging area for a subset of the data in a larger, slower device

Answer 86

A

for each k, the faster, smaller device at level k serves as a cache for the larger, slower device at level k+1

Answer 87

A

because of locality, programs tend to access the data at level k more often than they access the data at level k+1. thus, the storage at level k+1 can be slower, and thus larger and cheaper per bit

Answer 88

A

creates an illusion of a large pool of storage that costs as much as the cheap storage near the bottom, but that serves data to programs at the rate of the fast storage near the top

Answer 89

A

data block b is needed and found in the cache

Answer 90

A

data block b is needed, not found in the cache, fetched from memory, then stored in cache

Answer 91

A

cold (compulsory), conflict, capacity

Answer 92

A

occurs because the cache is empty

Answer 93

A

occur when the level k cache is large enough, but multiple data objects all map to the same level k block

Answer 94

A

occurs when the set of active cache blocks (working set) is larger than the cache

Answer 95

A

small, fast SRAM-based memories managed automatically in hardware that hold frequently accessed blocks of main memory

Answer 96

A

direct-mapped, e-way associative, fully associated

Answer 97

A

S, E, B
S = 2^s sets
E = 2^e lines per set
B = 2^b bytes per cache block
cache size = C = SxExB data bytes

Answer 98

A

1 line per set
number of lines = number of sets
fastest

Answer 99

A

E = # lines per set

Answer 100

A

one set with all cache lines

Answer 101

A

locate set, check if any line in set has matching tag, yes + line valid: hit, locate data starting at offset

Answer 102

A

block will always go to the same location (address % number of blocks)

Answer 103

A

block can go anywhere

Answer 104

A

block has E options in one set

Answer 105

A

write-through (write immediately to memory) or write-back (defer write to memory until replacement of line; need a dirty bit to determine if the line is different from memory or not)

Answer 106

A

write-allocate (load into cache, update line in cache) or no-write-allocate (writes immediately to memory)

Answer 107

A

write-back + write-allocate

Answer 108

A

4; Registers, L1 d-cache, L1 i-cache, and L2 unified cache

Answer 109

A

the 4 cores and the L3 unified cache (shared by all cores)

Answer 110

A

32 KB, 8-way, access: 4 cycles

Answer 111

A

256 KB, 8-way, access: 11 cycles

Answer 112

A

8MB, 16-way, access: 30-40 cycles

Answer 113

A

64 bytes for all caches

Answer 114

A

fraction of memory references not found in cache; misses/accesses = 1 - hit rate

Answer 115

A

time to deliver a line in the cache to the processor (includes time to determine whether the line is in the cache)

Answer 116

A

additional time required because of a miss

Answer 117

A

AMAT = hit time + (miss rate * miss penalty)

Answer 118

A

random (or psuedo-random)
least recently used (LRU)
first in, first out (FIFO - oldest block is replaced)

Answer 119

A

reduce miss rate
reduce miss penalty
reduce hit time

Answer 120

A

larger block size to reduce miss rate
larger cache to reduce miss rate
higher associativity to reduce miss rate
multilevel caches to reduce miss penalty
give priority to read misses over writes to reduce miss penalty
avoid address translation to reduce hit time

Answer 121

A

a direct-mapped cache of size N has about the same miss rate as a 2-way set associative cache of size N/2. This held in three C’s figures for cache sizes less than 128 KB