Floating Point Arithmetic Flashcards

Question 1

Q

How are integers represented in computing?

Answer

A

Integers can be represented exactly using a fixed number of bits.

Question 2

Q

What is the largest unsigned integer that can be represented with n bits?

Answer

A

(2^n) - 1

Question 3

Q

How are negative integers represented in computing?

Answer

A

Using two’s complement notation.

Question 4

Q

Why do we need floating point representation?

Answer

A

Many scientific calculations involve non-integer values.

e.g., mass of an electron

Question 5

Q

What key feature makes floating point representation different from integer representation?

Answer

A

The decimal point can float, allowing for a wide range of values.

Question 6

Q

What components make up a floating point number?

Answer

A

Significand/Mantissa
Exponent
Base

Question 7

Q

How would 9.109 × 10−31 be represented in floating point?

Answer

A

Significand: 9.109
Base: 10
Exponent: -31

Question 8

Q

What is the most widely adopted standard for floating point arithmetic? What does it specify?

Answer

A

IEEE 754
It specificies:
Number representations (e.g., single precision, double precision)
How operations like addition, subtraction, multiplication, and division behave

Question 9

Q

What are the two most commonly used floating point precisions in C?

Answer

A

Single Precision: 32-bit (float)
Double Precision: 64-bit (double)

Question 10

Q

Why is double precision preferred for scientific computing?

Answer

A

It provides greater accuracy and range compared to single precision

Question 11

Q

What happens if a number exceeds single precision limits?

Answer

A

It is represented as infinity (inf)

Question 12

Q

How many bits does double precision floating point use? How are they split?

Answer

A

64-bits (8-bytes):
- 52 bits for the mantissa
- 11 bits for the exponent
- 1 bit for the sign

Question 13

Q

What does it mean for a floating point number to be normalised?

Answer

A

The exponent bits are not all 0s and not all 1s

Question 14

Q

How is a normalised floating point number represented in IEEE 754?

Answer

A

x=±(1.b{1}.b{2}…b{52}) x 2^(a{1}.a{2}…a{11})-1023
where:
- b{1}, b{2}, …b{52} are the mantissa bits
- a{1}, a{2}, …a{11} are the exponent bits

Question 15

Q

What is the smallest normalised double precision number? What is the largest?

Answer

A

Small: 1 x 2^(1−1023) ≈ 10^(−308)
Large: (2-2^-52) x 2^(2046-1023) ≈ 10^308

Question 16

Q

Why are floating point numbers not always exact?

Answer

Study These Flashcards

A

Because they have finite precision, leading to rounding errors.

Question 17

Q

What is machine epsilon?

Answer

Study These Flashcards

A

The smallest difference between 1 and the next representable number
For double precision, approximately: 2^(−52) ≈ 10^(−16)

Question 18

Q

What are the five floating point exceptions in IEEE 754?

Answer

Study These Flashcards

A

Overflow (Result is too large) → inf or -inf
Underflow (Result too small) → 0 or subnormal
Divide by Zero → inf or -inf
Invalid → NaN
Inexact → Rounded result

Question 19

Q

What is peak performance (R_{peak})?

Answer

Study These Flashcards

A

The theoretical maximum performance of a system, measured in FLOP/s.

Question 20

Q

How is R_{peak} used in HPC rankings?

Answer

Study These Flashcards

A

It is listed in Top500 rankings, but R_{max} is used for final ranking.

Question 21

Q

What system factors determine R_{peak}?

Answer

Study These Flashcards

A

Number of sockets
Number of processor cores per socket
Clock frequency
Number of operations per cycle

Question 22

Q

What modern features can increase the number of operations per cycle?

Answer

Study These Flashcards

A

Vector instructions: operates on multiple numbers at once
Fused Multiply-Add operations: can multiply and add in one instruction

Question 23

Q

Given:
- 2 sockets
- 6 cores per socket
- 2.8 GHz clock speed
- 4 operations per cycle

What is the compute node R_{peak}?

Answer

Study These Flashcards

A

R_{peak} = 2×6×2.8×4 = 134.4GFLOP/s

Question 24

Q

If a cluster has 192 compute nodes, where each compute node’s R_{peak} = 134.4 GFLOP/s, what is its peak performance?

Answer

Study These Flashcards

A

192×134.4GFLOP/s = 25.2TFLOP/s

Floating Point Arithmetic Flashcards

(24 cards)