hash tables Flashcards

Question 1

Q

what is a hash table

Answer

A

data structure where items are stored in a location determined by their content - content based inde

Question 2

Q

what is every hash table fundamentally composed of?

Answer

A

array of items

hash function: that converts item to an index

Question 3

Q

what is a hash function

Answer

A

is a mapping from an item to an integer

Question 4

Q

what are the requirements for a good hash function?

Answer

A

fast compute
deterministic
spread keys evenly in hash table - no large gaps

Question 5

Q

what is a perfect has function

Answer

A

maps an every distinct key onto a distinct integer

minimal perfect hash function has no holes in the array

Question 6

Q

how are collisions addressed?

Answer

A

open addressing - linear and quadratic probing

closed addressing - chaining

Question 7

Q

explain linear probing insert

Answer

A

generate hash code = h

while A[h] contains a key A[h] - {key, value}

Question 8

Q

explain linear probing find

Answer

A

generate hash code = h
while A[h] contains a key:
-- if A[h]{key} == key  return value
-- h = (h+1)%table size
return not found

Question 9

Q

issues with linear probing

Answer

A

primary clustering - multiple adjacent items and slow performance
uneven gaps in the table
full table

Question 10

Q

what to do when the table becomes full?

Answer

A

throw an error
create new and larger table
automatically make a larger array and then rehash every thing

Question 11

Q

what is the load factor

Answer

A

the proportion of the table that is fill (λ)

Question 12

Q

probability of a cell being empty

Answer

A

1 - load factor (lambda)

1-λ

Question 13

Q

average number of cells to be examined for insert

Answer

A

T(λ) = (1+1/(1-λ)^2)/2

Question 14

Q

average number of cells to be examined for successful find

Answer

A

T(λ) = (1+1/(1-λ))/2

Question 15

Q

quadratic probing insert

Answer

A

generate hash code h = hash(key), i =1
while a[h] contains a key
h = h+i*i%table size
a[h] = key, value

Question 16

Q

quadratic probing find

Answer

Study These Flashcards

A

generate hash code h = hash(key), i =1
while a[h] contains a key
-- if A[h]{key} == key  return value
--- h = h+i*i%table size
--- i++

return not found

Question 17

Q

basic idea of collision resolution by chaining

Answer

Study These Flashcards

A

use a table of pointers/references

each new item must be added to a linked list at that position in the table

Question 18

Q

insertion algorithm for chaining

Answer

Study These Flashcards

A

generate hash code h = hash(key), p = new Node
p.data = {key, value}; p.next = A[h]
A[h] = p

Question 19

Q

find algorithm for chaining

Answer

Study These Flashcards

A

generate hash code h = hash(key), p = A[h]
while p != null and p{key} != key
— p = p.next
return p

Question 20

Q

analyze the chaining technique

Answer

Study These Flashcards

A

n items and m table size - even distribution of hash values and use of all possible hash values
on ave each linked list is of size n/m
average search time - 1/2 n/m

Question 21

Q

complexity analysis of chaining

Answer

Study These Flashcards

A

best - O(n/m)
ave - 1/2 n/m
worst - O(n)
m - tablesize
n - number of items

All dependent on choice of M relative to n

Question 22

Q

handling deletions open addressing vs chaining

Answer

Study These Flashcards

A

open addressing:

deletion not easily possible
add a flag to mark deleted items - skip deleted items during search, unmark and over write deleted items on insert

chaining:
- use linked list deletions

Question 23

Q

other variations to chaining

Answer

Study These Flashcards

A

double hashing - if there is a collision because of due to secondary clustering use a second hash function when there is a collision
h = H1 +H2 - where H2 is the rehashing function
a different sequence is checked for each key

chaining using BST or other data structures

Question 24

Q

what is secondary clustering?

Answer

Study These Flashcards

A

where a key generates the same sequence of locations to check

hash tables Flashcards

(24 cards)