DSA - Hash Tables Flashcards

Question

What 2 Assumptions are made for Direct Chaining?

Answer 1

1. We have a good Hash function - Expected Length of chains is n/T 2. We assume a maximal load Factor - n/T < λ, constant λ (MAKE SURE LOAD FACTO IT IS BOUNDED BY SOME λ) Assume the 2 Conditions , computed that the operations of Hash Table are ALL O(1). Good Hash Function Depends on Distribution of data - CONSTANT TIME COMPLEXITY WILL BE AMORTIZED ONLY

Answer 2

1) Lot of hash collisions, therefore lots of unused space - Good Performance maintain a low Load Factor {Between 0.25-0.75} - At 0.25 you are using an array 4x the entries you are going to store in hash table (A LOT OF UNUSED SPACE) 2) LL require a lot of allocation (ALLOCATE MEMORY), which is slow - Every Insert you need to allocate Memory - Not only using space in the array, but allocating memory is a heavy method to call.

Answer 3

If primary position hash(key) is occupied, search for 1st available position to the right of it. REACH END WRAP AROUND

Answer 4

- Increment the value of hash(key) by 1 MOD the value by size of the array. Repeat until you find FREE SPACE - E.g => hash(key) + 1 mod T T = size of the array

Answer 5

1- Find the key stored in table: - Start from primary position hash(key) go right until key or empty position is found. 2- Key is stored in the table, replace it with a marker value (TOMBSTONE = #) DO Probing until you find # or empty space and replace it with new value

Answer 6

Delete Data Collision value, the one stored in the correct hash position, when searching for element in that hash position will return false as nothing is there for key value. Untrue as we had to move the data collision to next free space. This leads to error as we don't find key value from the data collision

Answer 7

- Start from Primary Position hash(key) - Search for key to the right. - Skip over # TOMBSTONES - If we reach Empty Position, key is not in table

Answer 8

- Search for hash(key) like Searching - NOTE first Tombstone Position we find - Find the Key signal ERROR - Reach Empty position, key not in table - Insert key into 1st Tombstone, if any - Otherwise, insert into Empty position found CAN'T INSERT TILL WE KNOW THAT THE KEY IS NOT ALREADY IN THE HASH TABLE.

Answer 9

Insert , Search & Delete ARE ALL O(1)

Answer 10

Clusters caused by entries with the same hash code.

Answer 11

When the Collision handling strategy causes different entries to check the same sequence of locations when they collide

Answer 12

Clusters are more likely to get bigger and bigger, even if small load factor, Lots of Clusters means more probing, thus longer time taken USE DOUBLE HASHING TO MAKE CLUSTERING LESS LIKELY TO OCCUR

Answer 13

Use a Primary & Secondary hash function Primary Hash: Landing place in our Array(Hash Value we go to in the array) - SPACE IS EMPTY OR # CAN USE IT Secondary Hash: add this result to primary key as an offset to find free space, repeat till we find free space. REDUCE PROBLEMS WITH SECONDARY CLUSTERS, SO FEWER COLLISIONS Different key values the offset will be different

Answer 14

- Difference is that for different key values in DOUBLE HASHING the offset will also be different for every key value, due to secondary hash function ALL OPERATION WORK THE SAME WAY FOR BOTH

Answer 15

Try Primary Position hash1(key) first and, if it fails, we try fallback positions: 1) hash1(key) + 1*hash2(key) MOD T 2) hash1(key) + 2*hash2(key) MOD T 3) hash1(key) + 3*hash2(key) MOD T REPEAT TILL WE FIND EMPTY SPACE T = size of the ARRAY

Answer 16

(hash(key) + i) mod T i=1,2,3...

Answer 17

(hash1(key) + i*hash2(key)) mod T i = 1,2,3,...

Answer 18

Can have Short Cycles

Answer 19

Short cycle is a loop that occurs where after a few probes you get back to already visited block CAN'T FIT ANYMORE VALUES, MAYBE Spaces in odd Index Locations

Answer 20

Table Size T & hash2(key) are COPRIME 2 SOLUTIONS: 1) T is PRIME - Array is Prime, Won't get common Divisor - But, Difficult when growing array 2) T= 2^k and hash2(key) is ODD - Test if hash2(key) is ODD, if EVEN +1 - Just Setting Least Significant Bit so it is ODD - Works as no Common divisor between hash2 & T

Answer 21

Worse it gets for missing values

Answer 22

If no number other than 1, divides both a & b Prime numbers are the numbers which are divisible only by 1 and themselves

Answer 23

-2^k therefore doubles it - Less fragmentation, however make sure Hash key values don't crash with Array Length DOUBLING THE SIZE OF THE ARRAY WHEN IT GETS FULL ENSURES T + 2^k

Answer 24

Hash Table is full : n/T > λ If n/T is larger than λ does not meant it is not a full table

Answer 25

If True allocate a new table twice the size and allocate values into new table

Answer 26

Table becomes full after an insertion, allocate new table twice the size and INSERT all elements from old table to it CAN CACHE THE HASH FUNCTION VALUES, SO QUICKER TO PLACE INTO NEWLY ALLOCATED ARRAY

Answer 27

The worst case time Complexity is O(n), when REHASHING AMORTIZED TIME COMPLEXITY is O(1)

Answer 28

Need to change hash function Usually: -bigHash(key) mod T (bigHash COMPUTES a Big Hashcode ) AFTER DOUBLE SIZE: bigHash(key) mod 2*T

Answer 29

a ADTs with implementation consisting of a array arr, a primary function hash1(key) {maybe hash2(key)}

Answer 30

AVL: - Require keys to be comparable and the operations are in O(logn), BEST,AVG & WORST CASE HASH: - Require good Hash Functions, so operations are O(1) AMORTIZED time complexity FOR ALL OPERATIONS

Answer 31

Better Performance advantage: - Clustering in LP is Worse than DH - DC needs to allocate memory which has a large constant cost THIS PROS IS IN MULTIPLIER NOT ENTIRE COMPLEXITY

Answer 32

RARELY Done as it doesn't speed up performance Rehash the into hash table 1/2 size. SAVE MEMORY

Answer 33

Rehash without doubling the size - May get a Hash table 1/2 the size - CONSEQUENTLY delete is O(1) Amortized time complexity

DSA - Hash Tables Flashcards

(57 cards)