P2L5 Thread Performance Considerations Flashcards

Question 1

Q

Which threading model is better? Boss/Worker or Pipeline?

Answer

A

It depends …

It depends on which metric is most important.

Total time or average time per order …

Question 2

Q

Review: Give 4 ways threads are useful

Answer

A

parallelization (speed up)
specialization (hot cache)
efficiency (lower memory usage, cheaper syncronization - read/write shared variables vs. IPC)
In single CPU, threads hide latency of I/O

Question 3

Q

What is useful for:

Matrix multiply application?
Web service application?
Hardware?

Answer

A

execution time
requests/sec and response time
CPU utilization (time CPU is working)

Question 4

Q

Give 3 metrics important for toy shop and OS. Give examples of the metrics for toy shop vs. OS

Answer

A

Throughput
- toys/hour
- process completion rate
Response time
- avg. time to respond to order
- avg. time to respond to input (e.g. mouseclick)
Utilization
- busy workbenches
- % CPU

Question 5

Q

Which of the following are performance metrics?

performance/$
performance/W (per watt)
percentage of SLA violations
client perceived performance
aggregate performance
platform efficiency
throughput
wait time

Answer

A

all of the above!

Question 6

Q

Define a ‘test bed’

Answer

A

Ideally, we will obtain metrics running real software on real machines with real workloads.

Often, this is not feasible for many different reasons. In these cases, we may have to settle on “toy” experiments that are representative of realistic situations that we may encounter.

We refer to these experimental settings as a testbed. The testbed tells us where/how the experiments were carried out and what were the relevant metrics being gathered.

Question 7

Q

What is a metric?

Answer

A

a metric is some measurable quantity we can use to reason about the behavior of a system.

Question 8

Q

Is ‘it depends’ a correct answer to “Are threads useful?”
Is ‘it depends’ an accepted answer to “Are threads useful?”

Answer

A

Yes
No

For example, some graph traversal algorithms work best on sparse graphs, while others work best on dense graphs. Some filesystems are optimized for read access, while others might be optimized for a write-heavy system.

The answer is: It depends! While, this answer is almost always correct, it is rarely accepted. What is more important perhaps is to modify the question, extending it to include the context you wish to examine and the metrics you wish to obtain.

Question 9

Q

Refer to the diagram

Which steps are computationally expensive?
Which steps involve interaction with the network?
Which steps involve interaction with the disk?

Answer

A

parser step, header creation
accepting a connection, sending data
read/write file

Question 10

Q

State 1 advantage and 4 disadvantages of performance speedup by making the web server multi-process

Answer

A

Advantage: simple programming - spawn multiple processes - because we have a working debugged process already. what could be better?

Disadvantages:

higher memory footprint, which can hurt performance.
high cost of a contest switch whenever we want to run a different process.
hard/costly to maintain shared state across processes due to IPC constraints.
difficult to have multiple processes listening on a specific port.

Question 11

Q

State 2 advantages and 2 disadvantages of improving web server performance by multi-threading

Answer

A

Advantages:

Cheaper context switch because shared address space.
Lighter memory requirements because of shared information across all threads in the process.

Disadvantages:

Software complexity: Multithreaded requires explicit application level synchronization code
Depends on underlying operating system level support for threads, although this is less of an issue now that it was in the past. (true for Solaris paper)

Question 12

Q

Describe the event-driven model for an application

Answer

A

An event driven application is implemented in a single address space, with a single thread of control. The main part of the process is an event dispatcher which in a loop looks for incoming events and then based on those events invokes one or more of the registered handlers.

Question 13

Q

How does the event-driven model support concurrency?

Answer

A

In the event driven model, the processing of multiple requests are interleaved within a single execution context.

Question 14

Q

Why does the event-driven model with one thread work for concurrency?

Answer

A

because there is no idle time

context switching just wastes cycles that could have been used for request processing.

In the event driven model, a request will be processed exactly until a wait is necessary, at which point the execution context will switch to servicing another request.

Question 15

Q

What about if we have multiple CPUs?

Note the gotcha

Answer

A

If we have multiple CPUs, the event driven model still makes sense, especially when we have to service more concurrent requests than we have CPUs. Each CPU could host a single event-driven process, and all multiple requests to be processed concurrently within that process.

This can be done with less overhead than if each CPU had to context switch among multiple processes or multiple threads.

Gotcha: It is important to have mechanisms that will steer the right set of events to the right CPU.

Question 16

Q

Describe implementation of event-driven model

Answer

Study These Flashcards

A

The operating system uses sockets as an abstraction over the network, and files as an abstraction over the disk.

single address space
single execution context
no shared access variables

we are jumping over code base - executing handlers - but that’s less than context switches

Question 17

Q

List the problems with the event-driven model. And some solutions

Answer

Study These Flashcards

A

If one of the handlers initiates a blocking call, the entire process can be blocked.

Solution: asynchronous I/O operations.

Question 18

Q

What is an asynchronous I/O operation?

Answer

Study These Flashcards

A

Asynchronous calls let the process or thread continue execution now and check their results later.

Question 19

Q

Describe helpers

If the kernel is not multithreaded - it wasn’t back in the day - the helpers need to be processes. The model was called the Asymmetric Multi-Process Event-Driven Model or AMPED. The multithreaded equivalent acronym is AMTED.