1. What is a drawback of PCI-e connectivity for GPUs? 2. How can it be mitigated? 3. Is there anything better than PCI-e?

1. It has relatively high latency, which can impact performance. 2. By minimising the transfer of data between the host CPU and the GPU. 3. NVIDIA's "NV link" offers better data rate

1. What is the Green500 list? 2. Why is it important?

1. A companion to the Top500 list that ranks HPC systems by energy efficiency (GFlops/Watt). 2. Power consumption and cooling requirements significantly impact overall system performance and cost.

Accelerators and GPUs Flashcards by Charlie Silver

Why do CPUs need to balance multiple factors in their design?

List some factors

CPUs must deliver acceptable performance for a wide range of applications
- functionality
- performance
- energy efficiency
- cost

How well did you know this?

Not at all

Perfectly

What is the purpose of an accelerator in computing?

An accelerator works alongside a CPU to provide increased performance for specific workloads, with different design tradeoffs.

How well did you know this?

Not at all

Perfectly

Give an example of an early accelerator used in CPUs.

Early x86 processors did not have floating point hardware
x87 floating point co-processors were used to accelerate floating point operations
floating point units are now part of the main processor.

How well did you know this?

Not at all

Perfectly

What are GPUs?

Graphical Processing Units (GPUs) are specialised hardware for operations related to image generation - requiring many matrix-vector operations.

How well did you know this?

Not at all

Perfectly

How are GPUs highly parallel?

Contain a large number of floating point units
Support a large number of processing threads

How well did you know this?

Not at all

Perfectly

How does GPU memory differ from CPU memory?

GPU memory provides higher bandwidth than CPU memory.

How well did you know this?

Not at all

Perfectly

What was a key hardware evolution in GPUs?

GPUs evolved from fixed-function rendering pipelines to programmable unified shaders and double-precision arithmetic
This allows current GPUs to be used for general purpose computation

How well did you know this?

Not at all

Perfectly

Why are GPUs useful as accelerators in HPC?

They offer good floating point performance and high memory bandwidth, making them suitable for computationally expensive tasks.

How well did you know this?

Not at all

Perfectly

Why must some operations still be handled by a CPU in a GPU-accelerated system?

The CPU is responsible for tasks such as running the operating system and handling input/output operations.

How well did you know this?

Not at all

Perfectly

How are GPUs typically connected to CPUs?

PCI Express (PCI-e) interface.

How well did you know this?

Not at all

Perfectly

What is a drawback of PCI-e connectivity for GPUs?
How can it be mitigated?
Is there anything better than PCI-e?

It has relatively high latency, which can impact performance.
By minimising the transfer of data between the host CPU and the GPU.
NVIDIA’s “NV link” offers better data rate

How well did you know this?

Not at all

Perfectly

What is the Green500 list?
Why is it important?

A companion to the Top500 list that ranks HPC systems by energy efficiency (GFlops/Watt).
Power consumption and cooling requirements significantly impact overall system performance and cost.

How well did you know this?

Not at all

Perfectly

What is the exascale era in computing?

The era of building exascale supercomputers, which perform at least one exaflop (10^18 floating point operations per second).

How well did you know this?

Not at all

Perfectly

Name the first exascale supercomputer.

Frontier

How well did you know this?

Not at all

Perfectly

Why is hardware diversity increasing in HPC?

Companies beyond Intel and NVIDIA, such as AMD and ARM, are entering the market with competitive CPUs and GPUs.

How well did you know this?

Not at all

Perfectly

Why is portability an important consideration in modern HPC?

Study These Flashcards

The increasing variety of CPU and GPU architectures means software must be able to run across different hardware platforms.

What is the NVIDIA Pascal architecture? Describe It.

Study These Flashcards

A GPU architecture designed for HPC applications.
Up to 60 “Streaming Multiprocessors” per GPU

What is a Graphics Processing Cluster (GPC)?

Study These Flashcards

A block of 10 Streaming Multiprocessors, functioning like an independent GPU within a Pascal GPU.

What is the function of the “GigaThreadEngine” in Pascal GPUs?

Study These Flashcards

It schedules threads and handles context switching.

Why must data movement be considered in GPU programming?

Study These Flashcards

The CPU and GPU have separate memory, and transferring data between them can be slow.

Why should small tasks remain on the CPU instead of being offloaded to the GPU?

Study These Flashcards

The overhead of transferring data to the GPU can outweigh the benefits of parallel execution for small tasks.

How do GPUs hide memory latency?

Study These Flashcards

By overcommitting cores, allowing execution to continue while waiting for memory operations.

Why is branching inefficient in GPU programming?

Study These Flashcards

GPUs are optimised for data-parallel workloads, and branch divergence can cause performance degradation.

What is Single Instruction Multiple Data (SIMD)?

Study These Flashcards

A parallel processing model where each thread performs the same operation on different data items.

Why do GPUs not have the same branch prediction as CPUs?

Their architecture prioritises parallel execution over complex control flow handling.

What is the `target` construct in OpenMP?

A directive that allows offloading computations to accelerators like GPUs.

What does the `teams distribute` directive in OpenMP do?

It distributes loop iterations across threads in a team on an accelerator.

What does the `map` clause do in OpenMP target regions?

It specifies which data should be transferred between CPU and GPU memory.

Why is OpenMP useful for GPU programming?

It allows existing C, C++, and Fortran code to be parallelised with minimal changes.

Accelerators and GPUs Flashcards

(29 cards)