11-16 Flashcards

Question

Threads: Compiling and Running: To compile, you must __.

Answer 1

include the header pthread.h • Explicitly link with the pthreads library, by adding the –pthread flag: prompt> gcc –o main main.c –Wall -pthread • For more information: man –k pthread

Answer 2

1. Keep it simple • Tricky thread interactions lead to bugs 2. Minimize thread interactions • Each interaction should be carefully thought out and constructed with known design patters (which we will learn about) 3. Initialize locks and condition variables • Otherwise code can sometime fail in strange and unpredictable ways 4. Check your return codes • Failure to do so will lead to bizarre and hard to understand behavior 5. Be careful how you pass arguments and return values • If you returning a reference to a variable on the stack, you are going to have a bad time 6. Each thread has its own stack • To share data between threads, the values must be in the heap or a global variable 7. Always use condition variables to signal between threads • It’s tempting to use a simple flag, but don’t do it 8. Use the man pages • Highly informative with good examples

Answer 3

1. available (or unlocked or free) • No thread holds the lock 2. acquired (or locked or held) • Exactly one thread holds the lock and presumably is in a critical section

Answer 4

• Try to acquire the lock • If no other thread holds the lock, the thread will acquire the lock • Enter the critical section - This thread is said to be the owner of the lock

Answer 5

the first thread holds the lock; 1. Other threads will block on the call to lock, until the lock is released 2. If several threads are waiting on the lock, only one will get it when it is released

Answer 6

• The name that the POSIX library uses for a lock • Used to provide mutual exclusion between threads pthread_mutex_t lock = PTHREAD_MUTEX_INITIALIZER; Pthread_mutex_lock(&lock); // wrapper for pthread_mutex_lock() balance = balance + 1; Pthread_mutex_unlock(&lock); • We may be using different locks to protect different variables → Increase concurrency (a more fine-grained approach)

Answer 7

* Efficient locks provided mutual exclusion at low cost | * Building a lock need some help from the hardware and the OS

Answer 8

1. Mutual exclusion • Does the lock work, preventing multiple threads from entering a critical section? 2. Fairness • Does each thread contending for the lock get a fair shot at acquiring it once it is free? (Starvation) 3. Performance • The time overheads added by using the lock

Answer 9

critical sections

Answer 10

``` • One of the earliest solutions used to provide mutual exclusion • Invented for single-processor systems --- void lock() { DisableInterrupts(); } void unlock() { EnableInterrupts(); } ```

Answer 11

• Require too much trust in applications - Greedy (or malicious) program could monopolize the processor • Do not work on multiprocessors • Code that masks or unmasks interrupts is executed slowly by modern CPUs

Answer 12

``` First attempt: Using a flag denoting whether the lock is held or not • The code below has problems 1 typedef struct __lock_t { int flag; } lock_t; 2 3 void init(lock_t *mutex) { 4 // 0 → lock is available, 1 → held 5 mutex->flag = 0; 6 } 7 8 void lock(lock_t *mutex) { 9 while (mutex->flag == 1) // TEST the flag 10 ; // spin-wait (do nothing) 11 mutex->flag = 1; // now SET it ! 12 } 13 14 void unlock(lock_t *mutex) { 15 mutex->flag = 0; 16 } ``` • Problem 1: No Mutual Exclusion (assume flag=0 to begin) • Problem 2: Spin-waiting wastes time waiting for another thread • So, we need an atomic instruction supported by hardware - test-and-set instruction, also known as atomic exchange

Answer 13

• return (test) old value pointed to by the ptr • Simultaneously update (set) said value to new • This sequence of operations is performed atomically • x86_64: - xchg rax, (mem)

Answer 14

1. Correctness: yes • The spin lock only allows a single thread to entry the critical section 2. Fairness: no • Spin locks don’t provide any fairness guarantees • Indeed, a thread spinning may spin forever 3. Performance: • For a single CPU, performance overheads can be quite painful • If the number of threads roughly equals the number of CPUs, spin locks work reasonably well

Answer 15

• If so, update the memory location pointed to by ptr with the new value • In either case, return the actual value at that memory location • x86_64 - cmpxchg

Answer 16

Atomically increment a value while returning the old value at a particular address

Answer 17

Ticket lock; | Ensure progress for all threads

Answer 18

simple; | work

Answer 19

wastes an entire time slice doing nothing but checking a value

Answer 20

give up the CPU to another thread; 1. OS system call moves the caller from the running state to the ready state 2. The cost of a context switch can be substantial and the starvation problem still exists

Answer 21

keep track of which threads are waiting to enter the lock; • park() - Put a calling thread to sleep • unpark(threadID) - Wake a particular thread as designated by threadID

Answer 22

sleep forever; setpark() • By calling this routine, a thread can indicate it is about to park • If the thread happens to be interrupted and the lock is freed before park is actually called, the subsequent park returns immediately instead of sleeping

Answer 23

the lock is about to be released

Answer 24

1. First phase • The lock spins for a while, hoping that it can acquire the lock • If the lock is not acquired during the first spin phase, a second phase is entered, 2. Second phase • The caller is put to sleep • The caller is only woken up when the lock becomes free later

Answer 25

• There are many cases where we wish to have coordination between threads • A thread wishes to check whether a condition is true before continuing its execution --- • Example: • A parent thread might wish to check whether a child thread has completed • This is often called a join()

Answer 26

Condition variable – an object used to wait for some condition to be true

Answer 27

1. Waiting on the condition variable: • An explicit queue that threads can put themselves on when some state of execution is not as desired • The thread is no longer running, freeing up the CPU to run another thread 2. Signaling on the condition variable: • Some other thread, when it changes said state, can wake one of those waiting threads and allow them to continue

Answer 28

* The wait() call release the lock and put the calling thread to sleep * When the thread wakes up, it must re-acquire the lock * It is assumed the thread is holding the lock with signal() is called

Answer 29

1. Create the child thread and continues running itself 2. Call into thr_join() to wait for the child thread to complete • Acquire the lock • Check if the child is done • Put itself to sleep by calling wait() • Release the lock

Answer 30

1. Print the message “child” 2. Call thr_exit() to wake the parent thread • Grab the lock • Set the state variable done • Signal the parent thus waking it

Answer 31

Imagine the case where the child runs immediately • The child will signal, but there is no thread sleeping on the condition • When the parent runs, it will call wait and be stuck • No thread will ever wake it, sad!

Answer 32

The issue here is a race condition 1. The parent calls thr_join() • The parent checks the value of done • It will see that it is 0 and try to go to sleep • Just before it calls pthread_cond_wait to go to sleep, the parent is interrupted and the child runs 2. The child changes the state variable done to 1 and signals • But no thread is waiting and thus no thread is woken • When the parent runs again, it sleeps forever, sad!

Answer 33

1. Producer • Produces data items • Wishes to place data items in a buffer 2. Consumer • Grabs data items out of the buffer to consume them in some way --- Example: Multi-threaded web server • A producer puts HTTP requests into a work queue • Consumer threads take requests out of this queue and process them

Answer 34

* Put -- Only put data into the buffer when count is zero (i.e., when the buffer is empty) * Get -- Only get data from the buffer when count is one (i.e., when the buffer is full) * Producer -- puts an integer into the shared buffer loops number of times * Consumer -- gets the data out of that shared buffer * Need synchronization between the producer and consumer

Answer 35

there is one producer and one consumer; * C1 runs and waits, P1 puts an item in and signals C1 * Before C1 gets to run, C2 sneaks in and consumes the item, setting count to 0 * When C1 runs, no more items left, sad!

Answer 36

After the producer woke T_c1_, but before T_c1_ ever ran, the state of the bounded buffer changed by T_c2_.

Answer 37

Mesa semantics • Virtually every system ever built employs Mesa semantics

Answer 38

the woken thread will run immediately upon being woken

Answer 39

• Assume two consumers and one producer • C1 runs, finds the buffer empty and waits, C2 runs, finds the buffer empty and waits • P1 runs, produces an item, signals, and waits because buffer is full • C1 wakes (from P1 signal) and consumes the buffer, signals, and then waits - Who gets the signal, P1 or C2? • C2 wakes, finds the buffer empty and waits – everyone is sleeping, sad!

Answer 40

Use two condition variables and while loops • Producer threads wait on the condition empty, and signals fill • Consumer threads wait on fill and signal empty

Answer 41

More concurrency and efficiency • Add more buffer slots • Allow concurrent production or consuming to take place • Reduce context switches

Answer 42

• Thread T_a_ calls allocate(100) • Thread T_b_ calls allocate(10) • Both T_a_ and T_b_ wait on the condition and go to sleep • Thread T_c_ calls free(50) - Which waiting thread should be woken up?

Answer 43

1. Replace pthread_cond_signal() with pthread_cond_broadcast() 2. pthread_cond_broadcast() • Wake up all waiting threads • Cost: too many threads might be woken up • Threads that shouldn’t be awake will simply wake up, re-check the condition, and then go back to sleep

Answer 44

Condition variables

Answer 45

wake up the thread by signal/broadcast

Answer 46

* When waiting on the CV, the lock is (temporarily) given up | * While returning from the wait, the thread re-acquires the lock

Answer 47

* The state of the world may have changed | * Recheck your state (in a while loop) upon returning from wait if there is any chance the state may have changed

Answer 48

parallelize your program

Answer 49

* Adding locks to a data structure makes the structure thread safe * How locks are added determine both the correctness and performance of the data structure * Adding threads may actually slow down your code

Answer 50

calling a routine that manipulates the data structure

Answer 51

* Even though more work is done, it is done in parallel * The time taken to complete the task on each core is not increased For our example: • Single thread on one core: about 0.03 seconds • Two threads running concurrently: about 5 seconds

Answer 52

* Each thread updates the counter one million times | * iMac with four Intel 2.7GHz i5 CPUs

Answer 53

• A single logical counter, via numerous local physical counters, one per CPU core • A single global counter • There are multiple locks - One for each local counter and one for the global counter Example: on a machine with four CPUs • Four local counters • One global counter

Answer 54

• It increments its local counter • Each CPU has its own local counter - Threads across CPUs can update local counters without contention - Therefore counter updates are scalable • The local values are periodically transferred to the global counter - Acquire the global lock - Increment it by the local counter’s value - The local counter is then reset to zero

Answer 55

Approximation Threshold: How often the local-to-global transfer occurs is determined by threshold S: 1. The smaller S: • The more the counter behaves like the non-scalable counter 2. The bigger S: • The more scalable the counter • The further off the global value might be from the actual count - Worst case: S * NUMCPUS

Answer 56

* The threshold S is set to 5 * There are threads on each of four CPUs * Each thread updates their local counters 𝐿1… 𝐿4

Answer 57

* Low S → Performance is poor, the global count is always quire accurate * High S → Performance is excellent, the global count lags

Answer 58

acquires; releases --- • If malloc() happens to fail, the code must also release the lock before failing the insert • This kind of exceptional control flow has been shown to be quite error prone • Solution: The lock and release only surround the actual critical section in the insert code

Answer 59

• Add a lock per node of the list instead of having a single lock for the entire list • When traversing the list: - First grabs the next node’s lock - And then releases the current node’s lock • Enable a high degree of concurrency in list operations - However, in practice, the overheads of acquiring and releasing locks for each node of a list traversal is prohibitive

Answer 60

* One for the head of the queue * One for the tail * The goal of these two locks is to enable concurrency of enqueue and dequeue operations

Answer 61

* Allocated in the queue initialization code | * Enable the separation of head and tail operations

Answer 62

* Does not resize * Built using the concurrent lists * It uses a lock per hash bucket each of which is represented by a list

Answer 63

• Counters • Lists • Queues • Hash Tables --- • Be careful with acquiring and releasing locks around control flow changes • Enabling more concurrency does not necessarily increase performance • Premature optimization is the root of all evil!

Answer 64

both locks and condition variables; an integer value associated with it; • sem_wait() • sem_post()

Answer 65

* Declare a semaphore s and initialize it to the value 1 | * The second argument, 0, indicates that the semaphore is shared between threads in the same process

Answer 66

• Decrements the integer value of the semaphore by 1 • If the value is negative the semaphore will wait - It will cause the caller to suspend execution waiting for a subsequent post - Similar to a cond_wait() • If the value of the semaphore (after the decrement) is positive or zero, return right away

Answer 67

* Increments the value of the semaphore by 1 | * If there is any threads waiting on the semaphore, wake one of them up

Answer 68

the number of threads waiting on the semaphore

Answer 69

• The semaphore should be initialized to 1; binary semaphore • Works the same as a lock

Answer 70

• This works for a single producer and consumer • What is we have multiple producers or consumers? - We have a race condition - Need to add mutual exclusion for the calls to put() and get()

Answer 71

1. insert • Change the state of the list • A traditional critical section makes sense 2. lookup • Simply read the data structure • As long as we can guarantee that no insert is on-going, we can allow many lookups to proceed concurrently This special type of lock is known as a reader-writer lock

Answer 72

single writer

Answer 73

* More readers will be allowed to acquire the read lock too | * A writer will have to wait until all readers are finished

Answer 74

* It would be relatively easy for reader to starve writer | * A more sophisticated scheme could prevent this

Answer 75

1. • Between each pair of philosophers is a single fork (five total) • The philosophers each have times where they think, and don’t need any forks, and times where they eat • In order to eat, a philosopher needs two forks, both the one on their left and the one on their right 2. Key challenges: • There is no deadlock • No philosopher starves and never gets to eat • Concurrency is high

Answer 76

left(p); right(p)

Answer 77

1. If each philosopher happens to grab the fork on their left before any philosopher can grab the fork on their right 2. Each will be stuck holding one fork and waiting for another, forever

Answer 78

* There is no situation where each philosopher grabs one fork and is stuck waiting for another * The cycle of waiting is broken

Answer 79

“too many” threads from doing something all at once

Answer 80

the number of concurrent threads with a threshold semaphore • Throttling, a form of admission control --- Example: • Hundreds of threads solving a parallel problem • One area of the code is memory-intensive • If all threads are allowed into this area, machine will start swapping and thrashing Solution: • Add a semaphore initialized to the maximum number of threads allowed in the memory-intensive area • Put a sem_wait() and sem_post() around the memory-intensive area

Answer 81

Doesn't maintain the invariant that a negative value is a count of threads waiting on the semaphore • The value is never lower than zero • This behavior is easier to implement and matches the current Linux implementation

Answer 82

* Unsynchronized code can cause incorrect behavior | * But too much synchronization means threads spend a lot of time waiting, not performing useful work

Answer 83

* Testing isn’t enough | * Need to assume worst case: all interleavings are possible

Answer 84

1. Locks are very simple and suitable for many cases • Issues: Maybe not the most efficient solution • E.g., can’t allow multiple readers but one writer inside a standard lock 2. Condition variables allow threads to sleep until an even occurs • Just remember the state of the world might have changed since the signal was called 3. Semaphores provide pretty general functionality • But can be tricky to get correct

Answer 85

* Lots of early research focused on this | * We’ll dive in a bit more deeply today

Answer 86

MySQL, Apache, Mozilla, OpenOffice

Answer 87

a majority of concurrency bugs

Answer 88

* Atomicity violation | * Order violation

Answer 89

violated; Simply add locks around the shared-variable references --- • Simple Example found in MySQL: • Two different threads access the field proc_info in the struct thd

Answer 90

flipped; enforce ordering using condition variables --- • I.e., A should always be executed before B, but the order is not enforced during execution • Example: - The code in Thread2 seems to assume that the variable mThread has already been initialized (and is not NULL) ``` 1 Thread1:: 2 void init(){ 3 mThread = PR_CreateThread(mMain, …); 4 } 5 6 Thread2:: 7 void mMain(…){ 8 mState = mThread->State 9 } ```

Answer 91

* Thread1 is holding a lock L1 and waiting for another one, L2. * Thread2 that holds lock L2 is waiting for L1 to be release.

Answer 92

1. Reason 1: • In large code bases, complex dependencies arise between components 2. Reason 2: • Due to the nature of encapsulation - Hide details of implementations and make software easier to build in a modular way - Such modularity does not mesh well with locking

Answer 93

* The routine acquires said locks in some arbitrary order (v1 then v2) * If some other thread calls v2.AddAll(v1) at nearly the same time → We have the potential for deadlock

Answer 94

1. Mutual Exclusion: Threads claim exclusive control of resources that they require. 2. Hold-and-wait: Threads hold resources allocated to them while waiting for additional resources. 3. No preemption: Resources cannot be forcibly removed from threads that are holding them. 4. Circular wait: There exists a circular chain of threads such that each thread holds one more resources that are being requested by the next thread in the chain. If any of these four conditions are not met, deadlock cannot occur.

Answer 95

• Deadlock: A circular waiting for resources • Starvation: A thread never makes progress because other threads are using resources it needs • Starvation != Deadlock - Deadlock can be seen as a special case of starvation

Answer 96

* Deadlock prevention – deadlock is not possible in the system * Deadlock avoidance – prevent a particular instance of deadlock from happening

Answer 97

pretend that deadlocks never occur in the system | - Used by most operating systems, including UNIX

Answer 98

at least one of the four deadlock conditions does not hold

Answer 99

sharable resources (e.g., read-only files); must hold for non-sharable resources

Answer 100

whenever a process requests a resource, it does not hold any other resources

Answer 101

* Require process to request and be allocated all its resources before it begins execution, or allow process to request resources only when the process has none allocated to it. * Low resource utilization; starvation possible

Answer 102

* If a process that is holding some resources requests another resource that cannot be immediately allocated to it, then all resources currently being held are released * Preempted resources are added to the list of resources for which the process is waiting * Process will be restarted only when it can regain its old resources, as well as the new ones that it is requesting

Answer 103

impose a total ordering of all resource types, and require that each process requests resources in an increasing order of enumeration

Answer 104

a priori information available

Answer 105

the maximum number of resources of each type that it may need

Answer 106

resource-allocation state to ensure that there can never be a circular-wait condition

Answer 107

the number of available and allocated resources, and the maximum demands of the processes

Answer 108

1. Requires that the system has some additional a priori information available 2. Simplest and most useful model requires that each process declare the maximum number of resources of each type that it may need 3. The deadlock-avoidance algorithm dynamically examines the resource-allocation state to ensure that there can never be a circular-wait condition 4. Resource-allocation state is defined by the number of available and allocated resources, and the maximum demands of the processes

Answer 109

immediate allocation leaves the system in a safe state

Answer 110

safe sequence [LTS]P1 , P2 , …, Pn> of ALL the processes in the systems such that for each Pi , the resources that Pi can still request can be satisfied by currently available resources + resources held by all the Pj , with j [LTS] i --- That is: • If Pi resource needs are not immediately available, then Pi can wait until all Pj have finished • When Pj is finished, Pi can obtain needed resources, execute, return allocated resources, and terminate • When Pi terminates, P_i +1_ can obtain its needed resources, and so on

Answer 111

no deadlocks; possibility of deadlock; ensure that a system will never enter an unsafe state.

Answer 112

* Use a resource-allocation graph | * Check for cycles

Answer 113

banker's algorithm

Answer 114

- P = {P1 , P2 , …, Pn }, the set consisting of all the processes in the system - R = {R1 , R2 , …, Rm}, the set consisting of all resource types in the system; directed edge Pi → Rj; directed edge Rj → Pi

Answer 115

no deadlock; - if only one instance per resource type, then deadlock - if several instances per resource type, possibility of deadlock

Answer 116

- Use a resource-allocation graph - Check for cycles; - Use the banker's algorithm

Answer 117

* Have multiple instances of resources * Each process must a priori claim maximum resource use (not to exceed total resources in the system) * When a process requests a resource it may have to wait * When a process gets all its resources it must return them in a finite amount of time

Answer 118

n = number of processes, and m = number of resources • Available – Resources currently available in the system - Vector of length m - If Available [j] = k, there are k instances of resource type Rj available • Max – Maximum resources processes may request in the system - n x m matrix - If Max [i,j] = k, then process Pi may request at most k instances of resource type Rj • Allocation – Resources allocated to processes - n x m matrix - If Allocation[i,j] = k then Pi is currently allocated k instances of Rj • Need – Resources currently needed by processes - n x m matrix - If Need[i,j] = k, then Pi may need k more instances of Rj to complete its task - Need [i,j] = Max[i,j] – Allocation [i,j]

Answer 119

``` 1. Let Work and Finish be vectors of length m and n, respectively. Initialize: Work = Available Finish [i] = false for i = 0, 1, …, n- 1 2. Find an i such that both: (a) Finish [i] = false (b) Needi ≤ Work If no such i exists, go to step 4 3. Work = Work + Allocationi Finish[i] = true go to step 2 4. If Finish [i] == true for all i, then the system is in a safe state ```

Answer 120

1. If Requesti ≤ Needi go to step 2. Otherwise, raise error condition, since process has exceeded its maximum claim 2. If Requesti ≤ Available, go to step 3. Otherwise, Pi must wait since resources are not available 3. Pretend to allocate requested resources to Pi by modifying the state as follows: Available = Available – Requesti; Allocationi = Allocationi + Requesti; Needi = Needi – Requesti; • If safe ⇒ the resources are allocated to Pi • If unsafe ⇒ Pi must wait, and the old resource-allocation state is restored

Answer 121

``` • Allow system to enter deadlock state • Detect that deadlock has occurred - Detection algorithm • Recover from deadlock - Recovery scheme ```

Answer 122

* Nodes are processes | * Pi → Pj if Pi is waiting for Pj

Answer 123

a cycle in the graph | • If there is a cycle, deadlock exists

Answer 124

n^2; | the number of vertices in the graph

Answer 125

* Abort all deadlocked processes | * Abort one process at a time until the deadlock cycle is eliminated

Answer 126

* Selecting a victim – minimize cost * Rollback – return to some safe state, restart process for that state * Starvation – same process may always be picked as victim, include number of rollbacks in cost factor

11-16 Flashcards

(153 cards)