Digital Information: Storing Text in Binary Flashcards
Encodings
mappings from a character to a binary number
How any symbols can the 2-bit encoding represent? and why
4 bc 00, 01, 10, 11
ASCII encoding stands for
American Standard Code for Information Interchange
ASCII encoding
the most common character encoding format for text data in computers and on the internet
Each ASCII character is encoded in binary using how many bits
7
problems of the ASCII encoding
- only English characters
- represents characters with 7 bits which wastes memory for computers that store info in bytes
Unicode
a universal character set which assigns each character a code point and a name to each character
How are the characters in the ASCII table converted to be stored into a computer
the characters are converted to decimal and then to binary
What problem did Unicode solve of ASCII and how did they do so
they fixed the fact that ASCII only had English and they did this by making characters that were used by all the languages + the individual ones with more specific codes and names
example of something in Unicode
“ą” is assigned to “U+0105” and named “Latin Small Letter A with Ogonek”
Unicode ___ (is/is not) a character set but it ____ (is/is not) an encoding
is, is not
UTF-8
an encoding that is compatible with ASCII encoding but also solves its problems because it can describe the characters in Unicode with 1-4 bytes
how do you know if a character is demonstrated by a UTF sequence
if the first number is a 0, the first two numbers are 1, start with 1110, orstart with 11110
what kind of encoding do most html files use
utf-8