Representing Text Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is a character set?

A

A list of characters and the codes used to

represent each one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does ASCII stand for? How many bits are in each character and how many characters total?

A
  • ASCII: American Standard Code for Information Exchange

- Seven bits for each character, 128 unique character

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How many bits was ASCII later evolved to?

A
  • ASCII evolved so that all eight bits were used

- Represent lines, symbols, and letters with accents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How many are control characters?

A
  • First 32 and last one are control characters or hidden characters, they control how text appears, but do not
    appear as text
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to find the code for uppercase/lowercase letters?

A

Uppercase letters start 65
• Code for J (10th letter) is 65 + (10 – 1) = 74
Lowercase letters start at 97
• Code for j (10th letter) is 97+ (10 – 1) = 106

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the issue with the ASCII set?

A
  • Limited, a lot of symbols are missing and is not enough for international use
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Unicode character set?

A
  • A super set of ASCII

- The first 128 characters in the Unicode character set correspond exactly to the ASCII character set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How many bits per character and characters are represented in the Unicode character set?

A
  • Uses 16+ bits per character and can represent more than 1 million characters
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is lossless compression?

A
  • A compression technique that does not lose any data in the compression process
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is data compression?

A
  • A reduction in the amount of space needed to store a piece of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the compression ratio?

A
  • Size of the compressed data/size of the original data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the two types of data compression?

A
  • Lossless: A compression technique that does not lose any data in the compression process
  • Lossy: Some information may be lost in the process
    of compaction.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the three types of loseless techniques?

A
  1. Keyword Encoding
  2. Run-Length Encoding
  3. Huffman Encoding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is keyword encoding?

A
  • Frequently used words are replaced with a single character
  • The characters used to encode cannot be part of the original text
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What type of technique is keyword encoding usually used with?

A

-Huffman encoding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Run-length encoding?

A
  • A single character may be repeated over and over again in a long sequence
  • A sequence of repeated characters
    is replaced by
    1. A flag character,
    2. Followed by the repeated character,
    3. Followed by a single digit that indicates how many times the character is repeated
  • you do not need to process 1-3 characters
17
Q

What does Huffman Encoding use to represent each

character?

A
  • Uses variable-length bit strings
18
Q

What is an advantage of Huffman Encoding

A

-saving lots of storage space by making shorter sequences (less bits) for the frequent letters and longer sequences (more bits) for the non-frequent letters

19
Q

What is the least effective compression?

A

-Keyword Encoding

20
Q

What is the most effective compression?

A

-Huffman Encoding

21
Q

What are the applications of Huffman encoding?

A
  • JPG, MP3, ZIP
22
Q

How to find compression ratio for Huffman encoding?

A

number of characters*bits/compressed bit length