text file formats Flashcards

1
Q

ASCII

A

American Standard Code for Information Interchange - each character encoded in 1 byte (7-bit encoding) - 65-90 caps, + 32 to caps to get lower case - <=31 control characters - English characters only

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

new line standards

A

UNIX / linux = LF
windows = CR + LF
mac = CR (to OS-9 then UNIX based)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

ISO-8859 encoding

A

each family of language has individual has own code page - 0-127 identical to ascii, 128-256 used for characters above English
+ 8 bit
+ simple
- one language family at a time
- only languages with < 128 non-characters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

unicode

A

does not code, just gives each character a code point (unique numerical identity) - 0-255 corresponds with iso-8859-1 (west Europe)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

encoding unicode at bite level

A

ucs-2 - uses 2 bites to represent code point
- not back compatible with ascii
- unicode > 65k code points
UTF - 8 - 1 bite if ASCII, multiple if not
+ back compatible with ASCII
+ all unicode
+ standard

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

UTF-8

A

up to 4 bites
0-7 bits - 0XXXXXXX
8-11 bits - 110XXXXX 10XXXXXX
12-16 bits - 1110XXXX 10XXXXXX 10XXXXXX
17-21 bits - 11110XXX 10XXXXXX 10XXXXXX 10XXXXXX

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

reconising the character encoding

A

guessing on statistical analysis - not recommended
Byte Order Mark - string at beginning of file indicating encoding type - not recommended
HTTP header - header item specifies encoding type - requires web server to be configured
In HTMl file - meta tag - if no control of web server

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is not encoded

A

font
font size
special formates
colour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly