121 Week 11 - Hashing Flashcards

Question 1

Q

What does a hashing function do

Answer

A

Converts a key (often a string) to an integer.
It is a deterministic function meaning it always gives the same output if the input remains the same.

Question 2

Q

Hash table

Answer

A

A data structure used for storing data.
Elements are stored in buckets.
Each table element has a unique part called the key.
The key is inputted into a hash function which returns an output. This output corresponds to which bucket the element will be stored in.

Question 3

Q

Why are hash tables used.

Answer

A

Hash tables allow for very fast retrieval of data as, assuming collisions are low, retrieval has big O of O(1).

Question 4

Q

Hash example: sum first 4 letters

Answer

A

The position in the alphabet of the first 4 letters of a word is added together to get the hash value of the word.
Limitations: collisions can be common, array is limited to 104 (4 * 26) elements, indicies above 75 and below 10 are uncommon.
Could be solved by adding all letters or including length.

Question 5

Q

Better hash example: multiply first 4 letters

Answer

A

Alpabetic position of first 4 letters are L1-4.
Index = L1 + L2×26 + L3×262 + L4×263.
This fixes having a limited array length and reduces collisions.
Limitations: Need a very long array to be able to store elements in the direct index of the hash function.

Question 6

Q

How to deal with large indices.

Answer

A

If you want to use an array with a size of Size you can use: newIndex = index%Size to get a new index which will be within the boundary 0-Size. (% = int remainder division).
Use a size of around double the number of expected inputs.

Question 7

Q

Bit shifting vs multiplication

Answer

A

Bit shifting is a lot faster to do than multiplication so by multiplying by powers of 2, we can use bit shifting instead and create more efficient hash functions.

Question 8

Q

How can collisions be reduced in hash tables.

Answer

A

Collisions are reduced if there is a wide range of index values and a uniform distribution of keys as this reduces the chance of 2 keys hashing to the same value.

Question 9

Q

How can collisions be dealt with in hash tables.

Answer

A

If a collision occurs we can use chaining to have a chain of elements starting at the first element in that bucket.
When retrieving an element, if the element in the bucket is not the correct element, complete a search on the chain to find the element or determine if it is in the hash table.

Question 10

Q

How can key-value pairs be used to store more data.

Answer

A

Key value pairs allows for more data than just the key to be stored in a hash table. The key will be used to find the hash value for the buckets location but other information can be stored in the bucket as well.

121 Week 11 - Hashing Flashcards

(10 cards)