HashTableSet Flashcards

Question

UnSorted Array List and Linked List, what is worst case time complexity of FIND?

Answer 1

transformation of a string of characters into a usually shorter fixed-length value or key that represents the original string. one way to enable security during the process of message transmission when the message is intended for a particular recipient only... Hashing is also a method of sorting key values in a database table in an efficient manner." **involves applying a hashing algorithm to a data item, known as the hashing key, to create a hash value.

Answer 2

is a SPACE-EFFICIENT data structure used to check if an item is in the set or not. Trade-off is space-efficiency is the probabilistic answer than deterministic. Either TRUE NEGATIVE "not in the set" or "might be in the set"

Answer 3

Space-efficient: O(1) space, regardless of number of items inserted. Fast: insert and lookup operations are both O(1) time

Answer 4

Probabilistic: Bloom filters can only defnitely identify true negativeness. They cannot identify true positives. positives are "might be there" Limited Interface: Only supports insert and lookup. Cannot iterate thought items in set or delete them.

Answer 5

Bitmap default to 0. Insertion changes value to 1

Answer 6

Design data structures that trade per-operation efficiency for overall efficiency

Answer 7

Collision Strategy: if an object key maps to an index that is already occupied, simply shift over and try the next available index.

Answer 8

Contrast to Linear Probing that is considered open addressing as the destination is not entirely deterministic since if space is occupied , we shift over X number of spaces. They are OPEN to moving

Answer 9

Simply the likely hood due to linear probing is greater with this resolution strategy. It is somewhat coutnerintuitive.

Answer 10

We need a "new" approach that is deterministic in searching but does not increase probability of collision with each insertion in the manner that linear probing does. We need to distribute the insertion probability over the entire array/table/etc

Answer 11

Simply the likely hood due to linear probing is greater with this resolution strategy. It is somewhat counterintuitive.

Answer 12

We still have to worry about reinserting elements into the Hash Table periodically to clean up the "delete" flags In all the open-addressing methods discussed so far, the probability of future collisions increase each time an inserting key faces a collision

Answer 13

Using pointers to a linked List as the key in our Hash Table 1. Look up the index from hashing H_1(k) 2. Follow pointer to linkedList 3. Check if k is in the linkedList. 3a. if not insert k into arr[index] (this is closed addressing) 3b. if it is, then element already in table.

Answer 14

Open hashing and closed addressing are often used interchangeably

Answer 15

Distance causes incremental issues regarding poor cache performance. (computer requires time to traverse to the data and send it back through a BUS or other system outside of coding).

Answer 16

As both tables begin to fill up, the probability of collisions increases. However, unlike the other collision resolution strategies we discussed in which a key k could end up trying every single Hash Table slot until the entire Hash Table was full (e.g. Linear Probing, Double Hashing, Random Hashing), a key k in Cuckoo Hashing only has two different locations that it can map to (index1 = H_1(k) and index2 = H_2(k)

Answer 17

It is important to make sure that the second hash function used returns different indices for keys that originally hashed to the same index. This is because, if a key collides with another key in the first Hash Table, we want to make sure that it will not collide with the same key again in the second Hash Table.

Answer 18

Find(): O(1) Delete: O(1)

Answer 19

KEY aspect/strength of CUCKOO Hashing If you ever cycle (come back to index in table 1), then you quit

Answer 20

MAP ADT ``` put(key, value) get(key) remove(key) size() isEmpty() ```

Answer 21

Essentially in a hash map keys must be hashable and have an equality test to check for uniqueness. If designer wanted to hash a custom object then they need to overload the hash and equality member functions

Answer 22

Sacrifice precision for memory efficiency Memory is using boolean instead of integer data , so literally 1 bit per space/index

Answer 23

In creating a bloom filter we should do our best to guess how many element the user may need, or how many elements they plan to insert. We try to pick a relatively small epilson we can tolerate . Smaller episoln means we will implement a larger table to minimize false positives but thatmeans using more memory. Determine how many hash functions to use: k = -log_2 (EPSILON)

Answer 24

Benefit is that it provides important valuable data without adding more memory consumption.

HashTableSet Flashcards

(52 cards)