8: Smashing the Stack For Fun and Profit Flashcards

Question

How are procedure prolog and epilog handled in Intel and Motorola CPUs?

Answer 1

The Intel ENTER and LEAVE instructions and the Motorola LINK and UNLINK instructions, have been provided to do most of the procedure prolog and epilog work efficiently.

Answer 2

A buffer overflow is the result of stuffing more data into a buffer than it can handle

Answer 3

* segmentation violation b/c strcpy() is copying the contents of \*str(larger\_string[]) into buffer[] until a null character is found on the string. * buffer[] is much smaller than \*str * since buffer[16] and larger\_string[256], all 250 [240] bytes after buffer in the stack are being overwritten * this includes SFP, RET, and even \*str * filled large\_string w/ character 'A' which has hex character value 0x41 * return address is now 0x41414141 * this is outside process address space * thus, when the function returns and tries to read the next instruction from that address of a function

Answer 4

What we have done is add 12 to buffer1[]'s address. This new address is where the return address is stored. We want to skip past the assignment to the printf call. How did we know to add 8 [should be 10] to the return address? We used a test value first (for example 1), compiled the program, and then started gdb:

Answer 5

In most cases we'll simply want the program to spawn a shell. From the shell we can then issue other commands as we wish.

Answer 6

* the answer is to place the code (shell code) with [you] are trying to execute in the buffer we are overflowing, and overwrite the return address so it points back into the buffer.

Answer 7

Now execve(). Keep in mind we are using a Intel based Linux system. The syscall details will change from OS to OS, and from CPU to CPU. Some will pass the arguments on the stack, others on the registers. Some use a software interrupt to jump to kernel mode, others use a far call. Linux passes its arguments to the system call on the registers, and uses a software interrupt to jump into kernel mode.

Answer 8

The procedure prelude

Answer 9

As we can see there is not much to the execve() system call. All we need to do is:

Answer 10

The program will continue fetching instructions from the stack, which may contain random data! The program will most likely core dump. We want the program to exit cleanly if the execve syscall fails. To accomplish this we must then add an exit syscall after the execve syscall. What does the exit syscall looks like?

Answer 11

The problem is that we don't know where in the memory space of the program we are trying to exploit the code (and the string that follows it) will be placed. One way around it is to use a JMP, and a CALL instruction. The JMP and CALL instructions can use IP relative addressing, which means we can jump to an offset from the current IP without needing to know the exact address of wherein memory we want to jump to. If we place a CALL instruction right before the "/bin/sh" string, and a JMP instruction to it, the strings address will be pushed onto the stack as the return address when CALL is executed. All we need then is to copy the return address into a register.

Answer 12

To get around this restriction we must place the code we wish to execute in the stack or data segment, and transfer control to it. To do so we will place our code in a global array in the data segment. We need first a hex representation of the binary code. Lets compile it first, and then use gdb to obtain it.

Answer 13

What we have done above is filled the array large\_string[] with the address of buffer[], which is where our code will be. Then we copy our shellcode into the beginning of the large\_string string. strcpy() will then copy large\_string onto buffer without doing any bounds checking, and will overflow the return address, overwriting it with the address where our code is now located. Once we reach the end of main and it tried to return it jumps to our code, and execs a shell. The problem we are faced when trying to overflow the buffer of another program is trying to figure out at what address the buffer (and thus our code) will be. The answer is that for every program the stack will start at the same address. Most programs do not push more than a few hundred or a few thousand bytes into the stack at any one time. Therefore by knowing where the stack starts we can try to guess where the buffer we are trying to overflow will be.

Answer 14

Here is a little program that will print its stack pointer:

Answer 15

We'd have to guess what the buffer and offset should be. Trying to guess the offset even while knowing where the beginning of the stack lives is nearly impossible. We would need at best a hundred tries, and at worst a couple of thousand. The problem is we need to guess \*exactly\* where the address of our code will start. If we are off by one byte more or less we will just get a segmentation violation or a invalid instruction.

Answer 16

One way to increase our chances is to pad the front of our overflow buffer with NOP instructions. Almost all processors have a NOP instruction that performs a null operation. It is usually used to delay execution for purposes of timing. We will take advantage of it and fill half of our overflow buffer with them. We will place our shellcode at the center, and then follow it with the return addresses. If we are lucky and the return address points anywhere in the string of NOPs, they will just get executed until they reach our code. In the Intel architecture the NOP instruction is one byte long and it translates to 0x90 in machine code. Assuming the stack starts at address 0xFF, that S stands for shell code, and that N stands for a NOP instruction the new stack would look like this:

Answer 17

A good selection for our buffer size is about 100 bytes more than the size of the buffer we are trying to overflow. This will place our code at the end of the buffer we are trying to overflow, giving a lot of space for the NOPs, but still overwriting the return address with the address we guessed. The buffer we are trying to overflow is 512 bytes long, so we'll use 612.

Answer 18

What we will do is place our shellcode in an environment variable, and then overflow the buffer with the address of this variable in memory. This method also increases your changes of the exploit working as you can make the environment variable holding the shell code as large as you want. The environment variables are stored in the top of the stack when the program is started, any modification by setenv() are then allocated elsewhere. The stack at the beginning then looks like this: NULLNULLenvp>

Answer 19

Our new program will take an extra variable, the size of the variable containing the shellcode and NOPs. Our new exploit now looks like this: Test it like this: [aleph1]$ ./exploit4 768 Using address: 0xbffffdb0 [aleph1]$ ./vulnerable $RET $ How does it work in xterm? [aleph1]$ export DISPLAY=:0.0 [aleph1]$ ./exploit4 2148 Using address: 0xbffffdb0 [aleph1]$ /usr/X11R6/bin/xterm -fg $RET Warning: Color name ...°¤ÿ¿°¤ÿ¿°¤ ... Warning: some arguments in previous message were lost $ * Experiment both with positive and negative offsets.

Answer 20

* The standard C library provides a number of functions for copying or appending strings, that perform no boundary checking * strcat(), strcpy(), sprintf(), and vsprintf() * These functions operate on nullterminated strings, and do not check for overflow of the receiving string * gets() is a function that reads a line from stdin into a buffer until either a terminating newline or EOF * It performs no checks for buffer overflows * scanf() family of functions * can also be a problem if you are matching a sequence of nonwhitespace characters (%s), or matching a nonempty sequence of characters from a specified set (%[]), and the array pointed to by the char pointer, is not large enough to accept the whole sequence of characters, and you have not defined the optional maximum field width * If the target of any of these functions is a buffer of static size, and its other argument was somehow derived from user input there is a good posibility that you might be able to exploit a buffer overflow * while loop to read one character at a time into a buffer from stdin or some file until the end of line, end of file, or some other delimiter is reached * getc(), fgetc(), or getchar() * If there is no explicit checks for overflows in the while loop, such programs are easily exploited

Answer 21

The sources for free operating systems and their utilities is readily available. This fact becomes quite interesting once you realize that many comercial operating systems utilities where derived from the same sources as the free ones. Use the source d00d.

8: Smashing the Stack For Fun and Profit Flashcards

(68 cards)