floating point form Flashcards

Question 1

Q

what is fixed point binary

Answer

A

when the numbers have a predetermined number of bits before and after the point

Question 2

Q

advantages of fixed point

Answer

A

a fixed point system can represent some numbers more precisely than floating point
calculations can be performed more quickly
represents all numbers to a constant level of precision/accuracy

Question 3

Q

disadvantages of fixed point

Answer

A

they cannot represent the range or accuracy of numbers that may be required

Question 4

Q

what are floating point numbers

Answer

A

when the number is expressed in m x 10^n where m is known as the mantissa and n is the exponent

Question 5

Q

if the number is positive, what should it start with, in floating point form

Question 6

Q

if the number is negative, what should it start with in floating point form

Question 7

Q

what does it mean for a number to be normalised

Answer

A

it either starts with 10 or 01

Question 8

Q

what does the exponent do

Answer

A

scales the mantissa by a power of two

Question 9

Q

floating point formula

Answer

A

FloatingPoint=Mantissa×2
^Exponent

Question 10

Q

why is normalisation important

Answer

A

-Maximises precision / accuracy for given number of bits;
- Unique representation of each number // simpler to test for equality of numbers;

Question 11

Q

uses of fixed vs floating point

Answer

A

Fixed point: Good for applications where speed is important and the range of values is small (e.g., financial systems).

Floating point: Better for representing a wide range of values but more complex in terms of calculations (e.g., scientific calculations, graphics processing).

Question 12

Q

what is an absolute error

Answer

A

The difference between the exact value and the approximate value stored in the system.

Question 13

Q

what is a relative error

Answer

A

The absolute error divided by the exact value.

Question 14

Q

compare absolute and relative errors

Answer

A

For Large Magnitude Numbers:

Absolute errors tend to be larger, but relative errors may be small since the error is small relative to the large value.
Example: A small absolute error on a large value like
10^6 may result in a very small relative error.

For Small Magnitude Numbers:

Even a small absolute error can cause a large relative error. This can be problematic when dealing with numbers close to zero.
Example: If the exact value is 0.001 and the stored value is 0.0009, the relative error will be much larger compared to a large number.

Question 15

Q

what is a rounding error
how do u fix it

Answer

A

occurs when there are not enough bits in the mantissa to represent the number
can be fixed by rounding to the nearest representable value
or truncating

Question 16

Q

how to represent more numbers using the same number of bits in floating point

Answer

Study These Flashcards

A

Move a/some bit(s) from the exponent to the mantissa

Question 17

Q

Explain why the relative error is usually considered to be a more important measure of
error than the absolute error.

Answer

Study These Flashcards

A

The impact of an error depends on its size relative to the number that is represented

Question 18

Q

advantages of floating vs fixed point

Answer

Study These Flashcards

A

floating point :
- a floating point system can represent numbers with a
greater range than a fixed point system;
- can represent numbers much closer to zero // can represent much smaller numbers
- can represent much larger numbers

fixed point:

a fixed point system can represent (some) numbers more
precisely than a floating point system;
Calculations can be performed more quickly

Question 19

Q

What is the difference between absolute error and relative error?

Answer

Study These Flashcards

A

Absolute Error: The difference between the exact value and the computed value.
Relative Error: The absolute error divided by the exact value, often expressed as a percentage.