3 Fundamentals Of Data Representation Flashcards

Question

What is included in a character set?

Answer 1

Alphanumeric characters e.g. letters, numbers, and symbols | Special characters e.g. new line

Answer 2

American Standard Code for Information Interchange | Unicode

Answer 3

American Standard Code for Information Interchange

Answer 4

American Standard Code for Information Interchange

Answer 5

Each character in ASCII is represented by a seven-bit binary code

Answer 6

There is a maximum amount of 128 characters that can be included in ASCII

Answer 7

ASCII includes all the commonly used letters and symbols in the English language

Answer 8

It is useful that ASCII is represented in 7-bits as the extra bit remaining can be used as a check digit in an 8-bit system

Answer 9

128 characters is perfectly fine for the English language but it does not leave space for characters from other languages An extended ASCII set was released which used all eight bits, but it was still not enough This led to the release of Unicode

Answer 10

The aim of Unicode is to represent every possible character in the world

Answer 11

The most common form of Unicode is UTF-8 which uses between 8 and 48 bit binary codes to represent each character

Answer 12

The first 128 digits of Unicode are identical to extended ASCII. This makes is backwards compatible with documents encoded using older character sets

Answer 13

Unicode represents all characters from all major alphabets of the world Unicode is also used to represent emojis

Answer 14

Letters Numbers Symbols

Answer 15

Each pixel of a bitmap image has a colour which is stored as a binary number

Answer 16

Colour-depth is the amount of bits used to store the colour of each pixel

Answer 17

The greater the number of bits used to represent each pixel, the more unique colours can be stored.

Answer 18

Common colour depths are 1-bit, 8-bit, 16-bit, and 24-bit.

Answer 19

Resolution represents the number of pixels in an image

Answer 20

Height of image × Width of an image = Resolution of image

Answer 21

1080p which is 1920×1080

Answer 22

``` Metadata is extra information that is added to an image file such as: The resolution The colour-depth The encoding format The time and date of taking the photo ```

Answer 23

All images are stored as binary

Answer 24

Start at the top left of the image, and work across the first row: Write a 0 if the pixel is black. Write a 1 if the pixel is white. Continue this process until the end of the image.

Answer 25

Each pixel is represented as one bit | 0 represents a black area and 1 represents a white area

Answer 26

Sound needs to be converted from analogue waves to a digital format

Answer 27

When sound is recorded by a computer its amplitude is recorded at regular intervals. The value of the amplitude at each sample is stored as a binary value. The number of bits used to store each sample is known as the sample size. The number of samples taken per second is known as the sampling rate.

Answer 28

Increasing the sampling rate will increase the quality of the audio. Increasing the sample size will also increase the quality of the audio. Unfortunately, increasing these make the file size larger.

Answer 29

The bit rate is the amount of data stored per second of audio.

Answer 30

bitrate = sample rate × sample size.

Answer 31

Compression helps reduce the size of files so we can store more data

Answer 32

Lossless compression is when none of the original data is lost. An algorithm can be used to perfectly restore the original file when needed. Lossless compression causes file size to reduce moderately.

Answer 33

Lossless compression especially useful for executable files, where all of the data is necessary.

Answer 34

An algorithm is applied to remove unnecessary detail from the original file. Some data is permanently lost, but enough remains so that the file is still useful and there is barely a noticeable difference. Lossy compression results in dramatic file size reduction.

Answer 35

RLE is a form of lossless compression that replaces repeating sequences of zeroes and ones with more efficient representations. Each repeating string will be replaced by a code which represents the character and the amount of times it is to be repeated.

Answer 36

Image compression is when pixels that are similar colours are grouped to create one average colour The RLE algorithm is then run on the new image The technique is lossy compression

Answer 37

Huffman coding is a lossless text compression algorithm which is most commonly used for long pieces of text data

Answer 38

Huffman coding works by assigning a fewer number of bits to the most frequently used characters

Answer 39

List the characters in ascending order of frequency, and write their frequency alongside them. Pair up the lowest frequency letters at the bottom of the tree: For each pair, join them to a higher node with a value of the combined frequency. Repeat this process for every node

Answer 40

Starting at the top, traverse the tree to a letter node: For each left branch, append a 0 to the letter’s binary string. For each right branch, append a 1 to the letter’s binary string. Use each letter’s reduced binary code to represent the original text

Answer 41

For each letter, begin at the top of the tree and use the binary string as a set of directions to reach the next letter: For each 0, go to the left. For each 1, go to the right. Once you reach a letter node, you can find the next letter by restarting this process from the top of the tree.

Answer 42

Long Data Test data Infrequently accessed data

3 Fundamentals Of Data Representation Flashcards

(68 cards)