1324 Algorithmics Flashcards

Question

What is the complexity of Union-Find?

Answer 1

Initialisation: O(N) Find is O(1) Union is O(N)

Answer 2

Components are stored as trees, making the whole thing a forest. The id of an element is it's parent, if not root.

Answer 3

Initialisation is O(N) Find is O(N) Union is O(N)

Answer 4

Keep the tree balanced. Maintain a size[] to keep track of the number of times in each tree Merge the smaller tree into the larger one (link the root of the smaller tree directly to the root of the larger one) Union method merges.

Answer 5

When we perform find operation in Quick-Union, we actually look for a path from element p to its root. Set the id of items on this path to be the root each time a root is computed, flattening the truth.

Answer 6

Initialisation is O(N) Find is O(log* N) Union is O(log* N)

Answer 7

Iterative logarithm The number of times needed to apply a logarithm before you get a number less than or equal to one.

Answer 8

Keeps a subsequence of elements on the left in the correct report. The subsequence is increased by inserting the next element into it's correct position in the sorted subsequence. With each iteration we move the current element one to the right. This is stable and in-place

Answer 9

Worst: theta(n^2) Average theta(n^2) Best theta(n)

Answer 10

Search for the smallest element, then put it in the right place, and so on for the second. This is in-place but not stable.

Answer 11

theta(n^2)

Answer 12

A binary tree where every level above the lowest is fully occupied. And the nodes on the lowest are all to the left. Each child has a value greater than or equal to it's parent.

Answer 13

Add the element to the next available space, percolate up tree to maintain ordering.

Answer 14

The minimum element is the root of the tree Remove this element Replace it with the last element in the heap Percolate this element down to the bottom of the heap choosing the minimum child.

Answer 15

Percolating one level is theta(1) For the height is theta(log n) The remaining operations are theta(1)

Answer 16

A queue where we assign a priority to each element, the element with the highest priority is the head of the queue.

Answer 17

We simply add elements to the heap and then remove them. This is not in-place and uses theta(n) additional memory The worst case complexity is O(n log n)

Answer 18

Balanced Multiway trees for fast search, finding successors and predecessors, insert, delete, maximum and minimum. They are designed to keep data close together on disk, to reduce retrieval time.

Answer 19

log_M(n) = log_2(n)/log_2(M) is the new access time Trees where each non-leaf node has M children

Answer 20

All internal nodes and leaf nodes contain data pointers along with keys Sequential access of nodes is not possible Searching is slower Insertion and deletion is slower

Answer 21

Only leaf nodes contain data pointers, internal nodes contain keys only Sequential access is possible like a linked list as leaf nodes are linked to each other Searching is faster Insertion and deletion is faster and easier.

Answer 22

The data items are stored only at the leaves The non-leaf nodes store up to M-1 keys to guide the search: key i represents the smallest key in the subtree i+1 The root is either a leaf or has between 2 and M children All non-leaf nodes except the root have between M/2 and M children All leaves except the root are at the same depth and have between L/2 and L data entries

Answer 23

This depends on the block size (amount that can be read from the disk in one go) or just L = M -1

Answer 24

A multiway tree often used for storing large sets of words. A tree with a possible branch for every letter of an alphabet, where all words end with a special $

Answer 25

They are another way of implementing sets. Providing quick insertion, deletion, and find. However, they waste a lot of memory.

Answer 26

An extreme solution to address the memory issue is to build a bit-level trie so the resulting data structure is a binary tree. It differs from a regular binary tree as the decision is dependant on the current bit. Although you loose the advantage of a multiway tree (of reducing the depth)

Answer 27

Correct, efficient and easy to implement. In that order.

Answer 28

When we can show: It holds true prior to the first iteration It holds true before an iteration of the loop and remains true after the next iteration When the loop terminates, the property is useful in showing the algorithm is correct.

Answer 29

A property of a loop that helps us understand why an algorithm is correct

Answer 30

A lower bound of f(n) is a guarantee that no one can user fewer than f(n) opeartions

Answer 31

A method for visualising the lower bound on complexity The worst case: depth of the deepest leaf Average: average depth Best: Depth of the shallowest leaf.

Answer 32

It must have a leaf for every possible way of sorting the list by considering all permutations. These correspond to different paths in the tree Each leaf gives a possible ordering n! permutations log_2(n!) will be the height of the tree

Answer 33

Compare the elements of a and b Choose the smallest and put in C and repeat.

Answer 34

It is stable but not in-place

Answer 35

if start < end THEN - mid = floor((start+end)/2) - MERGESORT(a,start,mid) - MERGESORT(a, mid+1, end) - MERGE(a,start,mid,end) else THEN - return

Answer 36

O(n log n) T(n) = 2T(n/2)+O(n)

Answer 37

If the sub array falls below a certain size we can use insertion sort instead.

Answer 38

if start < end THEN - pivot = CHOOSEPIVOT(a, start, end) - part = PARTITION(a, pivot, start, end) - QUICKSORT(a,start, part-1) - QUCIKSORT(a, part+1, end) else THEN - return

Answer 39

Where the pivot is either the smallest or largest element We need to choose an efficient pivot.

Answer 40

We can choose the median of the first, middle and last element. Choose the pivot randomly Or use an algorithm.

Answer 41

Partitioning is theta(n) The worst case is O(n^2) when the pivot is smallest or largest Best case: omega(log n) Average case: O(n log n) Worst case: O(n^2) as above In practise it is close to O(n) with 39% more comparison compared to merge, but less data is moved.

Answer 42

Insertion: theta(n^2), theta(n), theta(n^2) YES YES Selection: theta(n^2), theta(n^2), theta(n^2), YES, NO Bubble: theta(n^2), theta(n), theta(n^2), YES, YES

Answer 43

When the array gets significantly small we can use insertion sort.

Answer 44

A theta(n) algorithm for generating the next partition point. If the value of a at j < p then we move i to the right and swap the value at j and the value at i.

Answer 45

p=a_right i = left - 1 for j = left to right - 1 DO - if a_j < p THEN - - i = i + 1 - - swap(a_i,a_j) swap(a_i+1, p) return i+1

Answer 46

i = left -1 j = right + 1 while True DO - do - - j = i - 1 - while a_j > p - do - - i = i + 1 - while a_i < p - if i < j THEN - - swap(a_i,a_j) - else THEN - - return j

Answer 47

Choose the median of first, middle and last elements Randomly select pivot points Randomly shuffle the input in the beginning.

Answer 48

for i = a.last to 2 DO - j = a.random in range (1,i) - swap(a_i,a_j) This produces a random permutation of the array and is theta (n) and in-place

Answer 49

Where we choose two or more pivots to improve efficiency The number of required comparisons remains the same, so does the number of swaps, however, the number of cache misses is decreased.

Answer 50

An array a of size n Each item in the array is a d-digit number where each digit can take at most k values and their order. for i = 1 to d DO - Use a stable sort to sort the items

Answer 51

A stable but not in-place algorithm The complexity is theta(n+k) if n>>k, this is linear sorting algorithm if k>>n, there is no meaning for the use of counting sort.

Answer 52

theta (d(n+k)) Where d is the number of digits, and k is the base. If d is constant, Radix runs in linear time Radix is stable, but not in-place and not as general as other spots. Radix sort is unpopular

Answer 53

Let C[0,...,k] for i = 1 to k DO - C[i] = 0 for i = 1 to A.length DO - C[A[i]] = C[A[i]] + 1 for i = 1 to k DO - C[i] = C[i] + C[i-1] for j = A.length to 1 DO - B[C[A[j]]] = A[j] - C[A[j]] = C[A[j]] - 1

Answer 54

The size of the input

Answer 55

Omega is a lower bound for the rate of growth, while O is the upper bound

Answer 56

The average case

Answer 57

Specifies the operations through which a certain data structure can be accessed. Allowing you to declare your intentions.

Answer 58

LIFO Easily implemented using arrays

Answer 59

Push Pop Peek isEmpty isFull

Answer 60

Enqueue Dequeue Peek IsEmpty

Answer 61

They have a simple interface Reduce the access to memory

Answer 62

Arrays or linked lists

Answer 63

Insert findMin deleteMin

Answer 64

Linked list or binary tree (binary tree heap is most efficient)

Answer 65

Add Remove Set Get

Answer 66

A collection of elements where the order of the elements is important

Answer 67

A mathematical set with no ordering or repetition

Answer 68

Add Remove Contains Size isEmpty

Answer 69

HashSet -> Fastest access TreeSet -> For comparisons

Answer 70

A constant addressable memory for pairs, accessed using a key.

Answer 71

Put Get Remove size

Answer 72

Binary Trees Hash tables

Answer 73

Access: theta(1) Insert: O(n) Delete: O(n)

Answer 74

If they become they have to be resized which is slow.

Answer 75

Because you have to create a new array and re-add everything to the new array.

Answer 76

Each element is a node, where a node contains a reference to a value and a reference to the next node.

Answer 77

Each node has only a reference to the one ahead

Answer 78

Each node has a reference to the next and previous node.

Answer 79

Size isEmpty add remove_head Contains get_head get

Answer 80

Create a new node, and set the next of the last node to this new node, and if applicable the previous of this new node to the previous node

Answer 81

All operations are theta(1)

Answer 82

Find: O(n) Add or remove from tail: theta(1) Insert or delete given position: theta(1)

Answer 83

Contain no data, but it tells you where the head and tail are to improve the implementation

Answer 84

Hierarchies of linked lists which support binary search that provide theta(log(n)) search and have similar complexity to a binary tree

Answer 85

We reduce solving a problem to solving one or more smaller problems of the same type. We repeat this until we reach a trivial case we can solve by other means.

Answer 86

Base case: Where solving the problem is trivial Recursive clause: which is a self-referential part driving the problem towards the base case.

Answer 87

When the subproblems overlap, like in the Fibonacci sequence with n-1

Answer 88

An acyclic undirected graph.

Answer 89

A tree with on root node, so each node has one parent except the root node itself.

Answer 90

A subtree plus the root.

Answer 91

Max level + 1

Answer 92

A tree where each node has either 0, 1 or 2 children.

Answer 93

T element <- Actual Value Node left Node right Node parent

Answer 94

Each element in the left subtree is smaller than the root Each element in the right subtree is bigger than the root. Both the left and right subtrees are binary search trees.

Answer 95

Start at the root Go left if less Go right if greater if equal then found if leaf then not in tree

Answer 96

An algorithm to print the elements of a binary search tree in sorted order. print(e) if e != null THEN - print(e.left) - print(e.element) - print(e.right)

Answer 97

Follow two rules: If the right child exists then move right once and go as far left as possible. Otherwise, go up to the left as far as possible and then move on up right.

Answer 98

The case is trivial if it only has one child or no children, but if it has two children then we replace it with it's successor and the repeat this on the successor.

Answer 99

l = log(n+1), which is theta(log(n))

Answer 100

A tree with a height difference between leaf nodes > 1

Answer 101

Left Right Left - right double Right - left double

Answer 102

Single is for when the unbalanced subtree is on the outside Double is for when the unbalanced subtree is on the inside.

Answer 103

The heights of the left and right subtree differ by at most 1. The left and right subtrees are AVL trees, guaranteeing logarithmic depth.

Answer 104

h is O(log(n))

Answer 105

The Fibonacci sequence

Answer 106

Height of the left subtree - the height of the right subtree.

Answer 107

0 - equal -1 - right is deeper than left 1 - left is deeper than right < -1 - we need to rotate > 1 - we need to rotate If there is a change of sign between balance factor values a double rotation is needed.

Answer 108

Rebalancing

Answer 109

A set of vertices and a set of vertices

Answer 110

A sequence of vertices.

Answer 111

If all vertices except the first and last are distinct.

Answer 112

A graph in which from every vertex there is a path to every other vertex.

Answer 113

The number of edges incident on it.

Answer 114

When the graph is sparse

Answer 115

O(|E|+|V|) O(|V^2|)

Answer 116

Start from a node. Then discover all nodes reachable from the current node. The move one to the next node. Noting that we have visited all nodes.

Answer 117

O(|V|+|E|)

Answer 118

As some vertices may not be reachable from S

Answer 119

To calculate the shortest path

Answer 120

A graph which can be partitioned into two sets such that (u,v) is an edge where u is a member of set A and v is a member of set B.

Answer 121

Edges are explored out of the most recently discovered node, we can backtrack to the predecessor to complete the graph.

Answer 122

O(|V|+|E|)

Answer 123

Linear ordering of nodes of a directed graphs. Such that for each edge from u to v, u comes before v.

Answer 124

Call DFS on the graph Every time the processing of a node is finished it is added to the front of a linked list Once finished, the resulting list is the topological sorting.

Answer 125

A directed graph where there is a path from every vertex to every other vertex.

Answer 126

Call DFS to compute finishing time for all nodes Create G^T Call DFS in order of decreasing finishing time of G^T

Answer 127

Tree T subset E that spans G and hast the least weight. T is A tree Spanning Least weight

Answer 128

We build an MST by adding edges one at a time. We can iterate over the edges in the sorted E (by weight) Then using the UNION-FIND data structure to check if adding an edge creates a cycle.

Answer 129

Maintain sets S and V-S Find x where x in V, y where y in V-S such that w(x,y) is less than or equal to w(u,v) for all v and u Remove y from V-S and add it to S and add the edge (x,y) to T Repeat until V-S is empty.

Answer 130

O(m log n) as it Union is log n and that occurs m times

Answer 131

We only need to consider the previous nodes neighbours.

Answer 132

O(m log n)

Answer 133

Maintain a set of edges A. A is a subset of some MST At each step add an edge such that A is still a subset of the MST

Answer 134

P(u,v) = { min w(p) infinite }

Answer 135

When it contains a negative weight cycle

Answer 136

The total cost of the shortest path from s to v

Answer 137

Testing if we can improve the shortest path cost of v by using that edge

Answer 138

Bellman-Ford works with negative cycles.

Answer 139

Relax is applied to all edges in the graph |V| - 1 times

Answer 140

O(|V| x |E|) Slower than Dijkstra's

Answer 141

Maintains a set of S nodes whose paths have been determined. All other nodes are kept in a priority queue, to keep track of the next node to the process. Predecessor stored for recovery of shortest path

Answer 142

O((|V| + |E|) log |V|) Relax is log n due to having percolate up the heap when you change values.

Answer 143

Make locally optimal solutions at each step with the hope that such choices will lead to a globally optimal solution, keyword: hope.

Answer 144

They are efficient

Answer 145

Dijkstra's, Prim's and Kruskal's

Answer 146

We have a set of n jobs, each with a weight w, and length l. They must be run sequentially

Answer 147

f = sum to n from k = 1 of w_k * c_k

Answer 148

Same weighting -> shortest length first Same length -> largest weight first

Answer 149

The ratio between the weight and the length

Answer 150

We have a set of n intervals (s,e) where s and e are the start and ending time. We need to choose a non-overlapping subset of those intervals such that the total number of selected intervals is maximum.

Answer 151

Use the earliest finishing time

Answer 152

We build a min-heap based on finishing times. We the iterate through until Q is empty extract the minimum node from the min heap and if the start is greater than the last finishing time we add it to the solution.

Answer 153

O(n log n)

Answer 154

No A greedy solution has not been found but has not been disproven

Answer 155

Given n items having value v_1,...,v_n and weights w_1,...,w_n and a knapsack of capacity C. We need to maximise sum from i=1 to n of x_i * v_i where that is <= C

Answer 156

Add the item with the biggest ratio of value per weight until the next item does not fit in the knapsack so we add a fraction of it. We can either sort items by ratio or add them to a max-heap with the ratio as key.

Answer 157

We have an alphabet S={s_1,...s_k} and we need to encode a message M consisting of n symbols from S. We need ton use less than n * log(k) bits to store the message

Answer 158

Be using less bits to represent more common characters, but, now we have variable size.

Answer 159

Using prefix codes where no code is a prefix of another. a = 0 then b != 01 but b = 10 There is a 1:1 mapping to binary trees

Answer 160

3, it just the depth of the tree

Answer 161

A = sum from i =1 to k of f_i * d(s_i) Where d(s_i) is the depth of the corresponding to s_i

Answer 162

No, there are many

Answer 163

Writing a solution of a problem in terms of the solutions to its subproblems.

Answer 164

Optimal substructure: a recurrence relation between the optimal solutions to the problem and its subproblems. Overlapping subproblems: otherwise it is divide-and-conquer.

Answer 165

Since subproblems overlap we end up computing the same subproblem multiple times.

Answer 166

Memorisation or bottom-up solutions

Answer 167

Given a graph G=(V,E) an independent set is a subset of vertices such that for all (x,y) in S, (x,y) is not in E We want to maximise |S|

Answer 168

Let O be the optimal solution with the value of opt(n) Consider the last node, either that is in the solution or it is not. If it is, then the node prior is not in the solution, so we compute the optimal for the node two prior. Otherwise, the optimal solution is the same as the one obtained from the node prior

Answer 169

We walk backwards

Answer 170

DP can handle intervals with weights

Answer 171

Let O_n be the optimal solution for the first n intervals and opt(O_n) be the value. Consider the last interval, should we include it? It depends on if the opt become bigger by adding the interval or not.

Answer 172

O(n log n)

Answer 173

If we have two strings X and Y, we have to find the longest common subsequence (characters do not have to be consecutive)

Answer 174

Start from the right: if the characters are not the same we can go up of left, depending on the Max. If they are the same both up and then left, record a letter. Until we reach a zero.

Answer 175

Let S_n be the optimal solution Either an item is or isn't in the sack. If it isn't then there is n-1 remaining items and no change to capacity. If it is then there are n-1 remaining items, but with a decreased capacity. Since we don't prior if adding is the correct option we compute both and take the largest

Answer 176

A company buys rods and cuts them into smaller pieces. The company then sells them. Given a rod of length n, the company would like to cut it to maximise profits.

Answer 177

Either we cut the rod at position i or we don't. We can simplify this into a recurrence relation r(n) = max^n_k=1 [p[k+r(n-k)]

Answer 178

2^n without optimisation

Answer 179

Given an array of numbers, with at least one negative. Find the subarray whose sum of elements is maximum.

Answer 180

We are given an array and a target. Is there a subset that sum is the target.

Answer 181

Start with the last element, either it is in the solution or it is not. If it is, then we find a subset of n-1 elements whose sum is the target - value Else, we find a subset whose sum is the target and is of length n-1 If s = 0 then we return true or if n=0 was true we return false.

Answer 182

If the original is f[e,i[ then we have new options we can go down d[v_1,i] and then w or d[v_2,i] and then w. We should still pick d[e,i] as it is the minimum. To compute the indegree we create an array of linked lsts of length n. foreach v in V for each u in adj[r] inDegree[v] push(u)

Answer 183

O(nm) for the main algorithm O(n^2) for the indegree computation

Answer 184

It is used to find the shortest path between all Paris of vertices in weighted graph.

Answer 185

Given a simple path, p = an intermediate vertex is any vertex of p other than v_1 and v_k. Denoted by pi^k_ij be set of paths from v_i to v_j whose intermediate vertices are drawn from the main set. v_k not in pi in which case pi is in PI Or v_k in pi then pi can be decomposed into two shortest paths pi_1 and pi_2 where pi_1 is in PI^{k-1}_ik and pi_2 in PI^{k-1}_k

Answer 186

The problem of maximising or minimising a linear function subject to a finite number of linear inequalities

Answer 187

Simplex or the interior point method

Answer 188

Maximisation of a linear function subject to linear inequalities

Answer 189

Obtain a new basic feasible solution which has a higher value of z.