Unit 7 Flashcards

1
Q

A computer system with at least two processors. This computer is in contrast to a uniprocessor, which has one, and is increasingly hard to find today.

A

Multiprocessor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Utilizing multiple processors by running independent programs simultaneously.

A

Task-level parallelism or process-level parallelism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A single program that runs on multiple processors simultaneously.

A

Parallel processing program

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A set of computers connected over a local area network that function as a single large multiprocessor.

A

Cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A microprocessor containing multiple processors (“cores”) in a single integrated circuit. Virtually all microprocessors today in desktops and servers are multicore.

A

Multicore microprocessor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

A parallel processor with a single physical address space.

A

Shared memory multiprocessor (SMP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Speed-up achieved on a multiprocessor without increasing the size of the problem.

A

Strong scaling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Speed-up achieved on a multiprocessor while increasing the size of the problem proportionally to the increase in the number of processors.

A

Weak scaling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A uniprocessor.

A

SISD or single instruction stream, single data stream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A multiprocessor.

A

MIMD or multiple instruction streams, multiple data streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The conventional MIMD programming model, where a single program runs across all processors.

A

SPMD or single program, multiple data streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The same instruction is applied to many data streams, as in a vector processor.

A

SIMD or single instruction stream, multiple data streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Parallelism achieved by performing the same operation on independent data.

A

Data-level parallelism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The basic philosophy of blank is to collect data elements from memory, put them in order into a large set of registers, operate on them sequentially in registers using pipelined execution units, and then write the results back to memory.

A

vector architecture

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

One or more vector functional units and a portion of the vector register file. Inspired by lanes on highways that increase traffic speed, multiple lanes execute vector operations simultaneously.

A

Vector lane

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Increasing utilization of a processor by switching to another thread when one thread is stalled.

A

Hardware multithreading

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

A thread includes the program counter, the register state, and the stack. It is a lightweight process; whereas threads commonly share a single address space, processes don’t.

A

Thread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

A process includes one or more threads, the address space, and the operating system state. Hence, a process switch usually invokes the operating system, but not a thread switch.

A

Process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

A version of hardware multithreading that implies switching between threads after every instruction.

A

Fine-grained multithreading

20
Q

A version of hardware multithreading that implies switching between threads only after significant events, such as a last-level cache miss.

A

Coarse-grained multithreading

21
Q

A version of multithreading that lowers the cost of multithreading by utilizing the resources needed for multiple issue, dynamically scheduled microarchitecture.

A

Simultaneous multithreading (SMT)

22
Q

A multiprocessor in which latency to any word in main memory is about the same no matter which processor requests the access.

A

Uniform memory access (UMA)

23
Q

A type of single address space multiprocessor in which some memory accesses are much faster than others depending on which processor asks for which word.

A

Nonuniform memory access (NUMA)

24
Q

The process of coordinating the behavior of two or more processes, which may be running on different processors.

A

Synchronization

25
Q

A synchronization device that allows access to data to only one processor at a time.

A

Lock

26
Q

A function that processes a data structure and returns a single value.

A

Reduction

27
Q

An API for shared memory multiprocessing in C, C++, or Fortran that runs on UNIX and Microsoft platforms. It includes compiler directives, a library, and runtime directives.

A

OpenMP

28
Q

Communicating between multiple processors by explicitly sending and receiving information.

A

Message passing

29
Q

A routine used by a processor in machines with private memories to pass a message to another processor.

A

Send message routine

30
Q

A routine used by a processor in machines with private memories to accept a message from another processor.

A

Receive message routine

31
Q

Collections of computers connected via I/O over standard network switches to form a message-passing multiprocessor.

A

Clusters

32
Q

Rather than selling software that is installed and run on customers’ own computers, software is run at a remote site and made available over the Internet typically via a Web interface to customers. SaaS customers are charged based on use versus on ownership.

A

Software as a service (SaaS)

33
Q

Informally, the peak transfer rate of a network; can refer to the speed of a single link or the collective transfer rate of all links in the network.

A

Network bandwidth

34
Q

The bandwidth between two equal parts of a multiprocessor. This measure is for a worst case split of the multiprocessor.

A

Bisection bandwidth

35
Q

A network that connects processor-memory nodes by supplying a dedicated communication link between every node.

A

Fully connected network

36
Q

A network that supplies a small switch at each node.

A

Multistage network

37
Q

A network that allows any node to communicate with any other node in one pass through the network.

A

Crossbar network

38
Q

A popular high-speed link today is which stands for Peripheral Component Interconnect Express. It is called a link in that the basic building block, called a serial lane, consists of only four wires: two for receiving data and two for transmitting data. T

A

PCIe,

39
Q

An I/O scheme in which portions of the address space are assigned to I/O devices, and reads and writes to those addresses are interpreted as commands to the I/O device.

A

Memory-mapped I/O

40
Q

A mechanism that provides a device controller with the ability to transfer data directly to or from the memory without involving the processor.

A

Direct memory access (DMA)

41
Q

An I/O scheme that employs interrupts to indicate to the processor that an I/O device needs attention.

A

Interrupt-driven I/O

42
Q

A program that controls an I/O device that is attached to the computer.

A

Device driver

43
Q

The process of periodically checking the status of an I/O device to determine the need to service the device.

A

Polling

44
Q

A UNIX API for creating and manipulating threads. It is structured as a library.

A

Pthreads

45
Q

The ratio of floating-point operations in a program to the number of data bytes accessed by a program from main memory.

A

Arithmetic intensity

46
Q

It was perhaps the most infamous of supercomputers. The project started in 1965 and ran its first real application in 1976. The 64 processors used a 13-MHz clock, and their combined main memory size was 1 MB: 64 × 16 KB. The blank was the first machine to teach us that software for parallel machines dominates hardware issues.

A

Illiac IV