midterm 1 - diff file types Flashcards

Question 1

Q

How does binary store numbers vs text file?

Answer

A

text file –> each digit is stored as indiv ascii char –> 1 byte per char

binary –> digit is stored as corresponding binary

Question 2

Q

pro/con of binary number storage over txt files?

Answer

A

pros of binary storage: uses as little bits per number because it doesnt use 1 whole byte per digit

cons of binary storage: more complicated method of differentiating separate number values (text file can use an ascii char to separate them)

Question 3

Q

how does a binary file know how to differentiate separate numbers

Answer

A

binary file sets max # of bits used per number value –> therefore every X bits, the system knows to read as start of new number

Question 4

Q

binary file number limits explained

Answer

A

since binary files allocate x bits per value –> sets max value that can be stroed

typical limit is 32 bits per #

sci notation representation may be used for larger values –> but introduces limit precision

Question 5

Q

example of binary number limits and their impact on systems

Answer

A

32 bit systems –> max of 32bits of unique usable values

2^32 bits of ram = 4gb

therefore 32bit systems can only use up to 4 gb of ram

Question 6

Q

how is txt file encoded and interpreted

Answer

A

NO HEADER for file identification

all code used is for a character

no standard txt encoding exists –> usually UTF8 but variations exist –> leads to txt files being incorrectly read depending on program limitations

Question 7

Q

data compression - broad definition

Answer

A

reducing the number of bits required to store encoded information –> reducing overall file byte size

when uncompressed –> encoded data is reconstructed to remake the original file

Question 8

Q

types of data compression

Answer

A

lossless - the original file is recinstructed with the exact same bits sequence

lossy - reconstructed bit sequence has mismatches –> causes data encoding errors. usually associated with multimedia files being transferred –> errors manifest as aliasing errors and artifacts

Question 9

Q

list the methods of compression encoding (6)

Answer

A

DP - SQRT

dictionary encode
predictive encode
symbol freq
quantization + modelling human perception
run length encode
transformations

Question 10

Q

symbol frequency compression

Answer

A

aka variable length coding

use algorithm to determine optimal bit pattern per symbol based on how frequently each symbol appears

more frequent = less bits –> compression

where a symbol = characters, values, etc

eg Huffman code

Question 11

Q

run length encoding compression

Answer

A

used for data that is repeated several times
(eg sampled data that doesn’t change often) –> temperature every minute

encodes the data point –> paired with a value of X repetitions

therefore compresses repeat sequences

Question 12

Q

dictionary encoding compression

Answer

A

substitute patterns with shorter symbol codes

analogous to saying “let X = (equation)”

a portion of the compressed file is required to provide meaning of these shorter symbol codes so that decompression software knows what the original code meant

Question 13

Q

predictive compression

Answer

A

uses current data to predict next data point –> calculate diff btw predicted and actual data pts

compression by only encoding failed predictions –> “predict the next data pt, but know it is not X”

OR

encode difference btw predict vs actual (decompress by predicting next value then adding back the difference)

Question 14

Q

transformation encoding compression

Answer

A

use math algorithm to transform raw data into other formats which are then more easily compressed

decompress the file –> transform BACK into original data format

Question 15

Q

quantization

Answer

A

used in multimedia –> decrease bit depth/resolution/hz

reduce quality to compress the file

midterm 1 - diff file types Flashcards

lect 7 (15 cards)