Chapter 3 - Machine-Level Representation of Programs Flashcards

Question

Name the four instructions in the MOV class.

Answer 1

movb, movw, movl, and movq.

Answer 2

Two; one to copy the data into a register and another to copy it to the destination.

Answer 3

To move 64-bit immediate values, and the destination must be a register. N.B. movq can only handle 32-bit immediate values which are sign-extended to 64 bits.

Answer 4

The movz and movs class.

Answer 5

movzbw, movzbl, movzwl, movzbq, and movzwq.

Answer 6

A 4-byte source is automatically zero extended when copied it to a 8-byte destination, so the movl instruction automatically implements the movzlq instruction.

Answer 7

movsbw, movsbl, movswl, movsbq, movswq, and movslq.

Answer 8

The cltq instruction, which operates on the %rax register.

Answer 9

movl %eax, (%rsp) movw (%rax), %dx movb $0xFF, %bl

Answer 10

movb (%rsp,%rdx,4), %dl movq (%rdx), %rax movw %dx, (%rax)

Answer 11

movb $0xF, (%ebx) // Memory references require 8-byte registers. movl %rax, (%rsp) // Mismatch between instruction suffix and register ID movw (%rax),4(%rsp) // Only one operand can be a memory location.

Answer 12

movb %al,%sl // There is no register named sl. movq %rax,$0x123 // The destination cannot be an immediate value.

Answer 13

movl %eax,%rdx // The destination of the movl instruction cannot be an 8-byte register. movb %si, 8(%rbp) // Mismatch between instruction suffix and register ID.

Answer 14

S = char, D = int: movsbl (%rdi), %eax movl %eax, (%rsi) S = char, D = unsigned: movsbl (%rdi), %eax movl %eax, (%rsi)

Answer 15

S = unsigned char, D = long: movzbl (%rdi), %eax movq %rax, (%rsi) // Note the special trick with the first step; this relies on the fact that the high-order 4 bytes of the register will be cleared with this instruction. S = int, D = char: movl (%rdi), %eax movb %al, (%rsi)

Answer 16

S = unsigned, D = unsigned char: movl (%rdi), %eax movb %al, (%rsi) S = char, D = short: movsbw (%rdi), %ax movw %ax, (%rsi)

Answer 17

A "last-in, first-out" (LIFO) discipline.

Answer 18

The stack pointer is decremented by 8 (to allocate 8 bytes) and the quad word provided as an operand is written to the value at the new top-of-stack address.

Answer 19

The quad word is read from the top-of-stack location and stored at the operand destination, then the stack pointer is incremented by 8 (to deallocate 8 bytes).

Answer 20

It sign extends a 4-byte source to a 8-byte location and requires no operands; it sign extends the data in %eax and stores it at %rax.

Answer 21

The leaq instruction.

Answer 22

``` x++ = INC x-- = DEC -x = NEG ~x = NOT ```

Answer 23

The instructions in these classes apply unary operations. They take a single operand which corresponds to both the source and the destination.

Answer 24

They take two operands: a source, and a destination (in that order).

Answer 25

SAL, SAR, SHL, SHR. The SAL and SHL are not unique; they both have the same effect.

Answer 26

long t = x + 4*y + 12*z;

Answer 27

long t = 10*y + z + x*y;

Answer 28

``` leaq 9(%rdx), %rax = 9+q leaq (%rdx,%rbx), %rax = p+q leaq (%rdx,%rbx,3), %rax = q+3*p ```

Answer 29

``` leaq 2(%rbx,%rbx,7), %rax = 2+8*p leaq 0xE(,%rdx,3), %rax = 14+3*q leaq 6(%rbx,%rdx,7), %rax = 6+p+7*q ```

Answer 30

An immediate value or a single-byte register.

Answer 31

A 32-bit value can only be shifted by 2^5 so only the first five bits of 0xF3 are used, i.e. [10011]. Thus, the shift amount is 19.

Answer 32

Convert a quad word to an oct word.

Answer 33

``` imulq = signed multiplication mulq = unsigned multiplication ```

Answer 34

They are special instructions that provide full 128-bit multiplication and division. ``` imulq = signed multiplication mulq = unsigned multiplication ```

Answer 35

The carry flag, zero flag, sign flag, and overflow flag.

Answer 36

It indicates that the most recent operation generated a carry out of the most significant bit. It's used to detect overflow of unsigned calculations.

Answer 37

To indicate that the most recent operation yielded zero.

Answer 38

To indicate that the most recent operation yielded a negative value.

Answer 39

To indicate that the most recent operation caused a two's complement overflow -- either positive or negative.

Answer 40

The CMP and TEST classes of instructions.

Answer 41

Syntax: cmp{b,w,l,q} S_1, S_2. Effect: Set condition flags based on S_2 - S_1.

Answer 42

Syntax: test{b,w,l,q} S_1, S_2. Effect: Set condition flags based on S_2 & S_1.

Answer 43

``` (SF ^ OF) | ZF // Note: this form ensures that the most recent operation has not overflowed. ```

Answer 44

``` ~(SF ^ OF) & ~ZF // Note: this form ensures that the most recent operation has not overflowed. ```

Answer 45

A. cmpl %esi, %edi setl %al // Data types: int // Operation: a=b

Answer 46

A. cmpb %sil, %dil setbe %al // Data types: unsigned char // Operation: a<=b B. cmpq %rsi, %rdi setne %a // Data types: long, unsigned long or a pointer // Operation: a!=b

Answer 47

(1) We can set a single byte to 0 or 1 depending on some combination of the condition codes; (2) We can conditionally jump to some other part of the program, or; (3) We can conditionally transfer data.

Answer 48

The condition, NOT the data size.

Answer 49

A. testq %rdi, %rdi setge %al // Data types: long // Operation: a>=0 B. testw %di, %di sete %al // Data types: short // Operation: a==0

Answer 50

A. testb %dil, %dil seta %al // Data types: unsigned char // Operation: a>0 B. testl %edi, %edi setle %al // Data types: int // Operation: a<=0

Answer 51

The jmp instruction.

Answer 52

Jump target.

Answer 53

A direct and indirect jump.

Answer 54

jmp *operand. The operand is either a register or memory location.

Answer 55

PC relative and with an "absolute" address.

Answer 56

4003fa: 74 02 je XXXXXX 4003fc: ff d0 callq *%rax // 0x2 is the offset from the byte after the jump instruction, 0x4003fc. Thus, the jump target is 0x4003fe.

Answer 57

40042f: 74 f4 je XXXXXX 400431: 5d pop %rbp // The jump is 0xf4=-12. Adding this to 0x400431 gives the jump target 0x400425.

Answer 58

``` XXXXXX: 77 02 ja 400547 XXXXXX: 5d pop %rbp // The jump target, 0x400547, is the address of the byte after the jump instruction plus 0x2. Subtracting 0x2 from 0x400547 gives 0x400545 so the filled in assembly code is: 400543: 77 02 ja 400547 400545: 5d pop %rbp ```

Answer 59

4005e8: e9 73 ff ff ff jmpq XXXXXXX 4005ed: 90 nop // The target of the jump is encoded as 0xffffff73 (note that this is on a little endian machine). Negating and adding one gives 0x8D=261, so the jump is -0x8D=-261. Therefore, the jump target is 400560.

Answer 60

Option 1: ``` t = test-expr; if (!t) goto false; then-statement goto done; false: else-statement done: ``` Option 2: ``` t = test-expr; if (t) goto true; else-statement goto done; true: then-statement done: ```

Answer 61

Option 1 is better because it is easier to adapt when there is no "else" statement. It is also has a similar structure to the normal form of an if statement.

Answer 62

The first conditional branch is part of the implementation of the && expression. If the test for a being non-null fails, the code will skip the test of a >= *p.

Answer 63

It allows object code to be relocatable, i.e. it can be shifted to different portions of memory without alteration.

Answer 64

Conditional transfer of control and conditional transfer of data.

Answer 65

When the amount of data that needs to be computed is small.

Answer 66

This approach computes both outcomes of a conditional operation and then selects one based on whether or not the condition holds.

Answer 67

A conditional transfer of data involves computing both outcomes of a conditional operation and then selecting one based on whether or not the condition holds. This strategy makes sense only in restricted cases, but it can then be implemented by a simple conditional move instruction.

Answer 68

The CMOV instruction class.

Answer 69

v = then-expr; ve = else-expr; t = test-expr; if (!t) v = ve;

Answer 70

cmove and cmovne.

Answer 71

cmovs and cmovns.

Answer 72

cmov[X] S,D, which moves the data from the source to the destination D when the condition specified by [X] holds.

Answer 73

2, 4 or 8 byte registers.

Answer 74

For example, when the result of one statement has a side effect or can generate an error condition.

Answer 75

OP is the divide operator, implemented with the right shift ">>4"

Answer 76

Branch prediction logic.

Answer 77

``` loop: body-statement t = test-expr; if (t) goto loop; ```

Answer 78

As "jump in the middle" and "guarded do" while loops.

Answer 79

``` goto test; loop: body-statement test: t=test-expr; if (t) goto loop; ```

Answer 80

``` t=test-expr; if (!t) goto done; loop: body-statement t=test-expr; if (t) goto loop; done: ```

Answer 81

A "jump in the middle" while loop.

Answer 82

``` init-expr; t=test-expr; if (!t) goto done; loop: body-statement update-expr; t=test-expr; if (t) goto loop; done: ```

Answer 83

When the body statement contains continue or break statement. (The update-expr expression would not be evaluated when the continue statement is satisfied resulting in infinite recursion.)

Answer 84

A switch statement provides a multiway branching capability based on the value of an integer index. They are particularly useful when dealing with tests where there can be a large number of possible outcomes. Not only do they make the C code more readable, but they also allow an efficient implementation using a data structure called a jump table.

Answer 85

A jump table is an array where entry i is the address of a code segment implementing the action the program should take when the switch index equals i.

Answer 86

When the test cases span a small range of values and there are a number of cases.

Answer 87

A goto location determined using a jump table (seen in switch statements).

Answer 88

Optional: complete Practice Problem 3.31 on page 274.

Answer 89

When an x86-64 procedure requires storage beyond what it can hold in registers, it allocates space on the stack. This region is referred to as the procedure’s stack frame.

Answer 90

The return address.

Answer 91

P's stack frame.

Answer 92

If there are six or less arguments then they may all be passed through registers. If there are more than six arguments, the remaining arguments are store on the stack.

Answer 93

The call instruction.

Answer 94

%rsp, the stack pointer register, and %rip, the program counter register.

Answer 95

%rdi, %rsi, %rdx, %rcx, %r8, and %r9.

Answer 96

1. When there are not enough registers to hold the local data. 2. When the memory address of a variable is required. 3. When some of the local variables are arrays or struct. (Access of these variables require array or structure references.)

Answer 97

A register whose value must be preserved by the callee.

Answer 98

A register whose value must be preserved by the caller.

Answer 99

%rbx, %rbp, and %r12-%r15.

Answer 100

All registers except for %rbx, %rbp, %r12-%r15 and %rsp.

Answer 101

By not overwriting it or saving its value on the stack and popping it before returning.

Answer 102

No. Register %rbx is a callee-saved register so its value will be preserved by the function Q, but register %rdi is a caller-saved register so its value should be preserved by function P before invoking function Q.

Answer 103

x_A + L*i.

Answer 104

movl (%rdx,%rcx,4), %eax.

Answer 105

*(E+i-3) corresponds to the memory reference M[x_E+4*i-12], which can be evaluated using the instruction: movl -12(%rdx,%rcx,4), %eax

Answer 106

Expression: P[1] Type: short Value: M[x_P + 2] Assembly code: movw 2(%rdx), %ax

Answer 107

Expression: P[2] Type: short Value: M[x_P + 4] Assembly code: movw 4(%rdx), %ax

Answer 108

Expression: P + 3 + i Type: short* Value: x_P + 2*i + 6 Assembly code: leaq 6(%rdx,%rcx,2), %rax

Answer 109

Expression: P[i * 6 - 5] Type: short Value: M[x_P + 12*i - 10] Assembly code: movw -10(%rdx,%rcx,12),%ax

Answer 110

Expression: &P[i + 2] Type: short* Value: x_P + 2*i + 4 Assembly code: leaq 4(%rdx,%rcx,2), %rax

Answer 111

&D[i][j]=x_D + L(C*i+j).

Answer 112

We can see that the reference to matrix P is at byte offset 8*(7i + j), while the reference to matrix Q is at byte offset 8*(5j + i). From this, we can determine that P has 7 columns, while Q has 5, giving M=5 and N=7.

Answer 113

Structures and unions.

Answer 114

The sum of the sizes of each contained field, plus the padding needed to satisfy the alignment requirement of each field.

Answer 115

Command: r->p = &r->a[r->i + r->j]; Register: r in %rdi. Instructions: movl 4(%rdi), %eax // Get r->j addl (%rdi), %eax // Add r->i cltq // Extend to 8 bytes leaq 8(%rdi,%rax,4), %rax // Compute &r->a[r->i + r->j] movq %rax, 16(%rdi) // Store in r->p

Answer 116

p: 0 s. x: 8 s. y: 10 next: 12

Answer 117

``` void st_init(struct test *st) { st->s.y = st->s.x; st->p = st->s.y; st->next = st; } ```

Answer 118

``` A. short test(struct ACE *ptr) { short result=1; while (ptr!=0) {result*=ptr->v; ptr=ptr->p;} return result; } ``` B. The data structure is a singly-linked list and the function computes the product of all the values in the list.

Answer 119

The size of the largest field, plus whatever padding is needed to satisfy its alignment requirement.

Answer 120

It can be used to access the bit pattern of a data type.

Answer 121

On a little endian machine, the least significant bytes (corresponding to word0) will be stored first and the most significant bytes (corresponding to word1) will be stored last. On such a machine, the code is given by: ``` double uu2double(unsigned word0, unsigned word1) { union { double d; unsigned u[2]; } temp; ``` temp.u[0] = word0; temp.u[1] = word1; return temp.d; } On a big endian machine, word0 and word1 are the other way around.

Answer 122

The restriction (placed by many computer systems) on the allowable address for the primitive data types, requiring that the address for some objects must be a multiple of some value K (typically 2, 4, or 8).

Answer 123

Its address must be a multiple of K.

Answer 124

12 bytes. This includes the 3 bytes needed to pad the character c so that j meets its 4-byte alignment restriction.

Answer 125

12 bytes. This includes the 3 bytes needed to pad the end of that structure so that it meets a 4-byte alignment restriction necessary to allocate an array of structs of type S2.

Answer 126

``` A. struct P1 { short i; int c; int *j; short *d; }; // Offset (in bytes): i=0, c=4, j=8, d=16. // Total size: 24 bytes. // Alignment requirement: 8. B. struct P2 { int i[2]; char c[8]; short s[4]; long *j; }; // Offset (in bytes): i=0, c=8, s=16, j=24. // Total size: 32 bytes. // Alignment requirement: 8. ```

Answer 127

``` A. struct P3 { long w[2]; int *c[2] }; // Offset (in bytes): w=0, c=16. // Total size: 32 bytes. // Alignment requirement: 8. B. struct P4 { char w[16]; char *c[2] }; // Offset (in bytes): w=0, c=16. // Total size: 32 bytes. // Alignment requirement: 8. ```

Answer 128

``` A. struct P5 { struct P4 a[2]; struct P1 t }; // Offset (in bytes): a=0, t=64. // Total size: 88 bytes. // Alignment requirement: 3 bytes to pad short i. ```

Answer 129

A common source of state corruption resulting from an out-of-bounds access.

Answer 130

Optional: complete practice problem 3.46.

Answer 131

Exploit code.

Answer 132

Stack randomisation (subset of ASLR; address-space layout randomisation), stack protection, and limiting the regions of memory that can hold executable code.

Answer 133

The randomisation of the position of the stack used to help thwart buffer overflow attacks.

Answer 134

Stack protection refers to the usage of a "canary" value on the stack to detect whether a buffer has overflowed.

Answer 135

Adding a "nop sled" (consisting of a long sequence of "nop"s) before the actual exploit code. This increases the chances of the overwritten return address jumping onto the exploit code.

Answer 136

Between any local buffer and the rest of the stack state.

Answer 137

It is placed between any local buffer and the rest of the stack state once storage on the stack is allocated. Before restoring the register state and returning from the function, the program checks if the guard value has been altered. If so, the program aborts with an error.

Answer 138

Using the frame (or base) pointer, %rbp.

Answer 139

Register %rbp serves as a frame pointer, and is used to manage variable size stack frames.

Answer 140

The "leave" instruction. It is equivalent to the instructions: movq %rbp, %rsp // Set the stack pointer to the beginning of the frame popq %rbp // Restore the saved value of %rbp (a callee-saved register) and set the stack pointer to the end of caller's frame.

Answer 141

When the stack frame may be of variable size.

Answer 142

Optional: complete practice problem 3.49.

Answer 143

Single-instruction, multiple-data (SIMD) mode.

Answer 144

SSE (streaming SIMD extensions) and AVX (advanced vector extensions).

Answer 145

128 bits/16 bytes.

Answer 146

256 bits/32 bytes.

Answer 147

Registers %ymm0-%ymm7.

Answer 148

Registers %ymm8-%ymm15.

Answer 149

vmovss (single-precision) and vmovsd (double-precision)

Answer 150

vmovaps and vmovapd.

Answer 151

vcvttss2si, vcvttsd2si, vcvttss2siq, and vcvttsd2siq. The source can either be a memory location or XMM register and the destination must be an integer register.

Answer 152

vcvtsi2ss, vcvtsi2sd, vcvtsi2ssq, and vcvtsi2sdq. The first source can be an integer register or a memory location but the second source must be an XMM register. The destination must be an XMM register.

Answer 153

vunpcklps %xmm0, %xmm0, %xmm0 // Interleave values in sources and store in destination. vcvtps2pd %xmm0, %xmm0 // Convert two vector elements to a double.

Answer 154

vmovddup %xmm0, %xmm0 // Replicate first vector element. vcvtpd2psx %xmm0, %xmm0 // Convert two vector elements to single.

Answer 155

[s1, d1, s0, d0].

Answer 156

``` val1 = d. val2 = i. val3 = l. val4 = f. ```

Answer 157

src_t: double dest_t: int Instruction(s): vcvttsd2sdi %xmm0, %eax ``` src_t: double dest_t: float Instruction(s): vmovddup %xmm0, %xmm0 vcvtpd2psx %xmm0, %xmm0 ```

Answer 158

src_t: long dest_t: float Instruction(s): vcvtsi2ssq %rdi, %xmm0, %xmm0 src_t: float dest_t: long Instruction(s): vcvttss2siq %xmm0, %rax

Answer 159

``` Add: vaddss or vaddsd Subtract: vsubss or vsubsd Multiply: vmulss or vmulsd Divide: vdivss or vdivsd Max: vmaxss or vmaxsd Min: vminss or vminsd Square root: sqrtss or sqrtsd ```

Answer 160

``` double funct1a(int p, float q, long r, double s); or double funct1b(int p, long q, float r, double s); ```

Answer 161

``` double funct2(double w, int x, float y, long z) { return x*y - w/z; } ```

Answer 162

vxorps, vxorpd, vandps and vandpd.

Answer 163

vcomiss and vcomisd.

Answer 164

When the least significant byte of the most recent operation contains an even number of bytes.

Answer 165

When the most recent operation yielded NaN.

Answer 166

The carry flag, zero flag and parity flag are all set to true.

Answer 167

"Jump on parity". It is used to jump when the result of the most recent floating-point operation yielded NaN.

Chapter 3 - Machine-Level Representation of Programs Flashcards

(196 cards)