CS Flashcards
What is unicode?
A universal encoding of all text characters. There’s over 140k characters encoded in Unicode represented by code points like U+0041.
What are UTF encodings?
Encodings like UTF-8 and UTF-16 are encoding schemes that specify how code points like U+0041 are represented in bytes. For example, UTF-8 specified that unicode characters are represented in 1 (the range of ASCII characters) to 4 bytes.
What is BOM in the context of Unicode?
A byte order mark. It is required in UTF-16 and UTF-32 to indicate endieanness - big endian being most significant byte first and little endian being the least significant byte first.
What is parsing?
Parsing is the process of checking if textual input is syntactically correct according to some grammar.
What is recursive descent parsing?
Recursive-descent parsing is one of the simplest parsing techniques that is used in practice. Recursive-descent parsers are also called top-down parsers, since they construct the parse tree top down (rather than bottom up).
What is a grammar?
A grammar is a definition of the syntax of a language.
What is a statement?
A group of expressions or other statements that you design to carry out an action.
What is an expression?
Anything that evaluates to a value.