Basic Cleaning Techniques Flashcards
1
Q
what is string?
A
A string is a sequence of characters represented by code points.
Every string in Python is seen as a Unicode covering the numbers from 0 through 0x10FFFF(hexadecimal) = 1114111 in decimal = 100001111111111111111 in binary
Then the sequence has to be represented as a set of bytes (values from 0 to 255) in memory.
2
Q
encoding
A
the rules for translating a Unicode string into a sequence of bytes are called encoding