CS6515_Exam2 Flashcards

Question

What is the "Cut Property"?

Answer 1

The Key Idea: Any edge which is minimum weight across a cut is going to be part of SOME MST. More Precisely: Considers UNDIRECTED graph G=(V,E) 1. Take a subset of edges X ⊂ E, where X ⊂ T, where T is a partial solution (i.e. it is part of some MST) that we assume is correct so far 2. Take a subset of vertices S ⊂ V where no edge is in the cut (S, S’) 3. Look at all edges of G in cut (S, S’) 4. Let e* be min weight edge in cut (S, S’) 5. Then: X ∪ e* ⊂ T’, where T’ is a new MST See DPV 5.1.2, also lecture video GR3_9-10.

Answer 2

TODO: Start watching around the 1hr 15min mark.

Answer 3

Abstract Type ---> Data Structure * Stack ---> Array, Linked List * Queue ---> Array, Linked List * Priority Queue ---> Heap * Dictionary / Hashmap ---> Array Source: Udemy Algorithms and Data Structures in Python lectures

Answer 4

O(1). This is because items are stored adjacent to one another in RAM.

Answer 5

Anytime we visit a new vertex via an edge, we form a Tree Edge.

Answer 6

An edge that goes from a node to a DESCENDANT in the tree.

Answer 7

An edge that goes from a node to an ANCESTOR in the tree.

Answer 8

Any other edge that isn't a Tree, Forward, or Backward edge, i.e. non-ancestry relationship. (I think thinking of them as siblings or cousins is a helpful way of distinguishing, because neither of the nodes are descendants/ancestors of one another in a cross-edge relationship.)

Answer 9

Tree edge (or Forest Edge, more generally)

Answer 10

Back edge Note: This was actually one of our poll questions.

Answer 11

Tree edge, forward edge, or a cross edge. If the post order number is going down and the relationship is a direct parent/child, then it's a tree edge. Otherwise it must be skipping down generations to a descendant, in which case it is a forward edge. If there's no ancestor relationship (i.e. the nodes are siblings or cousins), then it is a cross edge).

Answer 12

A tree is an undirected graph that is connected and acyclic. | See DPV pg. 129 for good info on tree properties.

Answer 13

A tree. | See DPV pg. 129 for good info on tree properties.

Answer 14

n - 1 | See DPV pg. 129 for good info on tree properties.

Answer 15

A tree. | See DPV pg. 129 for good info on tree properties.

Answer 16

True. In a tree, any two nodes can only have one path between them; for if there were two paths, the union of these paths would contain a cycle.

Answer 17

True. If it has n - 1 edges, then this must be the case. | See DPV pg. 129 for good info on tree properties.

Answer 18

decreasing; post

Answer 19

It's a greedy algorithm. It attempts to find the global optimum by with the aid of the local optimum. Given an input node, Dijkstra's algorithm initially assumes an infinite distance between other nodes, slowly iterates and updates each node with the local best path and the best path found so far.

Answer 20

Bellman-Ford can handle positive and negative edge weights, whereas Dijkstra's can only handle positive edge weights.

Answer 21

A heap (in this class we typically assume min-heap)

Answer 22

The idea of "Relaxation" of edge weights. It does this for all edges in the graph at the same time for V - 1 iterations (and then it does one more iteration to detect negative cycles). This gives us a runtime complexity of O(n*m).

Answer 23

O(n*m), where n = |V| and m = |E|

Answer 24

There MUST be a negative weight cycle present.

Answer 25

greedy; dynamic programming

Answer 26

|V| - 1, i.e. m - 1 if we don't check for negative weight cycles. If we do check, then we simply perform one more iteration, so a total of |V| iterations.

Answer 27

False. It can also handle negative edge weights, which is one of it's principle advantages over Dijkstra's algorithm.

Answer 28

1. Dijkstra's algorithm 2. Bellman-Ford algorithm 3 A* Search 4. Floyd-Warshall algorithm Note: I don't think A* is mentioned in the lectures or textbook, so I don't think it is available as a black box to us.

Answer 29

Queue, i.e. FIFO, like a line at Disney World

Answer 30

Stack, i.e. LIFO, like a stack of dishes

Answer 31

BFS: O(n), because we have to use a queue that ends up upper bounded by the number of leaf nodes. DFS: O(log n), because we use a stack data structure that only has to maintain up to the HEIGHT of the tree

Answer 32

Generally DFS, because it in general has better space complexity and has some other nice properties. This doesn’t mean we should avoid BFS; but I think for this class we’re likely to see more usage of DFS, whether on its own or under the hood of other algorithms (e.g. SCC, etc). AI/robot motion is a good example of an exception to this where we might prefer BFS over DFS. This is because BFS goes layer-by-layer, so it can be better for discovering the LOCAL ENVIRONMENT.

Answer 33

1. Graph G = (V, E), directed or undirected | 2. Source vertex s in V

Answer 34

1. dist[u]: array containing distance from source s to u IF s can reach u; inf otherwise 2. prev[z]: array containing the parent index of vertex z Source: Joves' notes, HW4 practice problems

Answer 35

O(n + m), i.e. linear

Answer 36

False. dist[u] is the given as the NUMBER OF EDGES from s to u, NOT the sum of the weights. Very important to remember this. Source: Joves' notes

Answer 37

1. Graph G=(V, E), directed or undirected, with POSITIVE EDGE WEIGHTS c_e in E 2. Source vertex s in V

Answer 38

1. dist[u]: array containing distance from source s to u IF s can reach u; inf otherwise 2. prev[z]: array containing the parent index of vertex z (can be used to reconstruct the shortest path) Source: Joves' notes, HW4 practice problems

Answer 39

O( (n + m)*log n)

Answer 40

False. Because the "shortest path" given by BFS is in terms of the NUMBER OF EDGES, and NOT the edge weights. We would need to use something like Dijkstra's algorithm for weighted edges (assuming the edge weights are positive).

Answer 41

They can both be used for Single Source Shortest Path (SSSP) determination. However, BFS ONLY works for unweighted graphs, whereas Dijkstra's can handle (positive) weighted edges.

Answer 42

Because BFS can only handle SSSP for UNWEIGHTED graphs, whereas Dijkstra's handles graphs with (positive) edge weights. If you're doing extra work to process weights, then you can't achieve linear O(n + m) runtime, hence the difference between the two.

Answer 43

1. Graph G=(V, E), directed or undirected Source: Joves' Notes

Answer 44

1. array prev[z]: the parent index of vertex z in the DFS visitation 2. array pre[z]: the pre-order number of vertex z in the DFS visitation 3. array post[z]: the post-order number of vertex z in the DFS visitation 4. array ccnum[z]: the connected components number of vertex z. Can also be the strongly connected component number (SCC) IF the vertices are passed in topologically sorted, i.e. as highest post number --> lowest post number AFTER running DFS on a REVERSED DIRECTED GRAPH. For a directed graph that is processed this way, the ccnum will also be the topological sort order in reverse. Source: Joves' Notes

Answer 45

True. This would come from the max value for the ccnum Source: Joves' Notes

Answer 46

False. This only works for UNDIRECTED graphs. For a directed graph, you would need to use something like SCC to do this. Source: Joves' Notes

Answer 47

They give information on how a graph COULD be explored. Some numbers might be interchangeable, others might not (see DPV 3.3(d) for a good example f this). Source: Joves' Notes

Answer 48

They give information on how a graph WOULD be explored GIVEN a particular starting point. A different root node might give an entirely different pre/post order numbering. Source: Joves' Notes

Answer 49

True. Keep in mind that a source vertex will be able to reach all connected vertices and will thus be more or less useless. TODO: Need to review the SCC section again, as this still isn't totally clear to me.

Answer 50

False. See DPV 3.3(d) for a good example of why this is the case. TL;DR version: post order numbering on a DIRECTED graph provides information on how a graph COULD be explored; it might admit multiple valid topological orderings.

Answer 51

1. DAG Source: Joves' Notes

Answer 52

1. array topo[i]: the vertex number of the i'th vertex in topological order from left to right, source to sink, (i.e. in DESCENDING post order number) Source: Joves' Notes

Answer 53

O(n + m) Source: Joves' Notes

Answer 54

It works by running DFS on a DAG and using the post order numbers to sort the vertices from highest to lowest.

Answer 55

In terms of decreasing post order number, i.e. from Source to Sink, which is equivalent to having all edges from left to right in the DAG.

Answer 56

True. No, it doesn't require manually sorting. Do an example by hand to see that this is true. (This is also part of why we can achieve an O(n + m) runtime, because if we had to sort that would be a minimum of O(nlogn) )

Answer 57

1. Graph G = (V, E), directed or undirected 2. vertex v in V to explore from Source: Joves' Notes

Answer 58

1. array visited[u] is set to "True" for all nodes u reachable from v 2. Any other data structure that DFS has, if needed Source: Joves' Notes

Answer 59

1. O(m) if run as part of DFS 2. O(n + m) if run by itself and visited[] array needs to be created Source: Joves' Notes

Answer 60

True, although not a comprehensive answer. This is True IF explore is run by itself which requires the visited[] array to be created. If run as part of DFS, then explore is O(m)

Answer 61

True. (The ccnum, prev, pre, and post, are all actually set in this subroutine; see DPV 3.2.2)

Answer 62

If we only need information about a single source vertex, then it's going to be better to use explore since it provides us with all the same outputs as DFS, but can isolate to only the source vertex of interest instead of running over all vertices)

Answer 63

True. Source: Joves' Notes

Answer 64

All of these are examples of algorithms for FINDING SHORTEST PATH(s) 1. Dijkstra: simple algorithm, and the fastest option at O( (n + m)*log(n)); however, can only use it IF all the edge weights are positive) 2. Bellman-Ford: Good to use if we have negative edge weights and are only interested in the Single Source Shortest Path (SSSP), which gives us a runtime of O(nm). 3. Floyd-Warshall: Good to use if we have negative edge weights and are interested in All Pair Shortest Path (APSP); the tradeoff is that this is the slowest order at O(n^3) Joves' notes also mention that while Bellman-Ford and Floyd-Warshall are both algorithms available to us as black-boxes, they are technically DP solutions, so think twice about whether using them is really warranted before you go off writing a solution using them). Source: Joves' Notes

Answer 65

1. Graph G=(V, E), directed. (Crucial to remember that SCC is for DIRECTED graphs!!!) Source: Joves' Notes

Answer 66

1. G_scc = (V_scc, E_scc), where G_scc is a meta graph (a DAG) with each SCC in G forming a vertex in V_scc and each edge between SCCs in E_scc 2. ccnum[.] comes from DFS used underneath the hood of this algorithm 3. V_scc will look like 1, 2, 3, 4... which are the ccnums 4. E_scc will look like (1, 2), (2, 3), (3, 4), ... which are edges between V_scc Source: Joves' Notes

Answer 67

Look up ccnum[u] and ccnum[v] Source: Joves' Notes

Answer 68

It is checking to see if u and v are part of the same strongly connected component.

Answer 69

DFS. Twice. Because we have to run on both G and the reverse of it G_r

Answer 70

True. See property on DPV 3.4.1, pg. 92.

Answer 71

Two nodes u and v of a directed graph are connected if there is a path from u to v AND a path from v to u. See DPV 3.4.1, pg. 91.

Answer 72

1. Run DFS once with pre/post order numbering on reverse graph G_r 2. Sort V in descending post order number (This gives sink to source) 3. Run DFS again on G with V sorted 4. Output will have ccnum representing SCC with higest = source, lowest = sink 5. Use the ccnum to gather up vertices belonging to each SCC 6. Check all the original edges and their endpoints. If the endpoints are in different SCCs, and a corresponding E_scc between those SCCs does not already exist, we add a E_scc to represent the edge from one SCC to another. Source: Joves' Notes

Answer 73

Source; Sink

Answer 74

1. Conjunctive normal form (CNF) formula f, which contains n variables (x1, x2, ... x_n), m clauses (x1 v x2) ^ (x3 v !x4)... Each clause has at most two literals (and for this course, we can assume that it is EXACTLY two literals). Source: Joves' Notes

Answer 75

``` An assignment (T or F) for each variable in f if it can be satisfied. Otherwise, 'NO' if it cannot be satisfied ``` Source: Joves' Notes

Answer 76

1. Graph G=(V, E), connected, undirected; edge weights w

Answer 77

1. A minimum spanning tree (MST) defined BY THE EDGES E_mst

Answer 78

O(m log m) or simplified to O(m log n)

Answer 79

1. Sort edges by weight 2. Grab lightest available edge THAT WONT CREATE A CYCLE. (If multiple edges with the same weight are available, it doesn't matter which one we pick, so long as it doesn't create a cycle.) 3. Keep doing this until all edges that will not create a cycle have been added (which will happen at exactly n - 1 edges)

Answer 80

When we've added exactly n - 1 edges (which is the definition of a tree)

Answer 81

It is one of the more common MST algorithms, so if we think we need MST, it's a good one to start with. The output gives us a set of edges which is useful to construct the next input for a black box, or to compare to the original G. Unlike Prim’s, it does not give us any path information.

Answer 82

1. Graph G=(V, E), connected, undirected; edge weights w

Answer 83

1. A minimum spanning tree (MST) defined BY THE ARRAY prev

Answer 84

O(m log m) or simplified to O(m log n)

Answer 85

1. Start with arbitrary vertex v and put it into a subtree S of included vertices 2. In each iteration, grow S by adding the LIGHTEST EDGE between a vertex in S and a vertex outside of S 3. Continue until all vertices are in S (this will happen at exactly n - 1 edges)

Answer 86

Dijkstra's algorithm. It's similar in that it can give us path information in the MST output.

Answer 87

Prim's algorithm gives us path information as part of the MST output, so if we need path information, that might be one reason to pick it over Kruskal's. In general, if a problem is asking about MSTs, it's probably best to start by thinking about using Kruskal's; ask yourself if you REALLY need path information. If so, Prim's is a good option, but it's definitely the less common scenario.

Answer 88

False. Because the output of Prim's algorithm is the array "prev" that contains the paths of the MST, whereas Kruskal's algorithms gives us the edges of the MST in the form of E_mst.

Answer 89

Kruskal's outputs the MST EDGES E_mst, whereas Prim's outputs the MST PATHS.

Answer 90

1. Conjunctive Normal Form (CNF) formula "f" that contains: n variables [x1, x2, ..., x_n] m clauses (x1 v x2) ^ (x3 v !x4) ... Each clause has at most 2 literals

Answer 91

True. Because any clause with one literal can be satisfied and removed leaving us with a CNF f' of only clauses with EXACTLY two literals

Answer 92

``` An assignment (T or F) for each variable in f if it can be satisfied; "NO" otherwise ```

Answer 93

two; x and !x

Answer 94

There would be 20 literals, because if there are n boolean variables then there will be 2n literals, for example: [x1, !x1, x2, !x2, ... x_n, !x_n]

Answer 95

False. They are part of the formula. The consequence is that finding all variables is O(m) because we have to look at all clauses and list them all.

Answer 96

It denotes the maximum number of literals in each clause (if there is no limit, then we simply call it a SAT problem)

Answer 97

NP-Complete

Answer 98

P (i.e. Polynomial)

Answer 99

It transforms a 2-SAT Conjunctive Normal Form(CNF) formula into a graph and solves it

Answer 100

1. Create a graph G from f by mapping the CNF to 2n vertices (one for each literal) and 2m edges (one for each implication) 2. Run the SCC algorithm on G 3. If for all x in X, x and !x are in DIFFERENT SCCs then the CNF is satisfiable, else return "NO" 4. Set all SOURCE literals to "FALSE" and remove the SOURCE SCC 5. Set all SINK literals to "TRUE" and remove the SINK SCC 6. Repeat steps 4 and 5 until all literals are set 7. Return the variable assignments

Answer 101

2n vertices (one for each literal) and 2m edges (one for each implication).

Answer 102

For this class, TAs have said we can assume O(m).

Answer 103

An (n x n) matrix) We do this by: 1. Initializing an (n x n) matrix # This is O(1) because cost is only incurred when actually writing values 2. Set each edge weight in the table # This is O(1) * O(m) = O(m) By only setting values where there is an edge, we avoid the typical O(n^2) cost of working with an (n x n) table

Answer 104

1. makeset(x): create a singleton set containing just x # O(1) 2. find(x): to which set does x belong? # O(log n) 3. union(x, y): merge the sets containing x and y # O(log n)

Answer 105

log(n). This fact means that it is also the upper bound on the running time of the "find" and "union" operations used by this data structure, i.e. O(log n)

Answer 106

That that element is the root of the tree

Answer 107

It is the height of the subtree hanging from that node

Answer 108

For any x, the rank(x) < rank(pi(x)) where pi here should be interpreted as a function that returns the parent pointer of x. A better way of looking at this is that the height of the subtree hanging beneath x (which is the definition of "rank") will always be strictly shorter than the height of the subtree hanging from the parent of x. (Which should be a pretty obvious statement - one rung higher up the ladder is, well... one rung higher up on the ladder.)

Answer 109

Any root node of rank k has at least 2^k nodes in its tree

Answer 110

2^k = 2^3 = 8 This is from property 2 for this data structure (see DPV 5.1.4 and Joves' notes)

Answer 111

That it was created by the merger of two trees with roots of rank k - 1

Answer 112

If there are n elements overall, there can be at most n / 2^k nodes of rank k Crucially, this implies that the max rank is log(n), therefore all the trees have height <= log(n), which is the upper bound on the running time of the "find" and "union" operations.

Answer 113

Since we're interested in how many nodes there could be with rank 3, this means k = 3. Using property 3, we know that n_k <= n / 2^k, i.e.: 16 / 2^3 = 16 / 8 = 2, therefore final answer is there could be at most 2 nodes with rank 3.

Answer 114

1. Dijkstra's algorithm: If we need simple algorithm for non-negative, weighted SSSP Pros: Fast, simple Cons: Does NOT work for negative weights 2. Bellman-Ford: if we need to get the Single Source Shortest Path (SSSP) on a graph that includes negative weights Pros: works with negative weights, faster than Floyd-Warshall [O(nm) vs O(n^3)] Cons: only provides answer for single source 3. Floyd-Warshall: if we need to get the All Pair Shortest Path (APSP) on a graph that includes negative weights Pros: works with negative weights, gives info for all vertices Cons: slowest option of the three [O(n^3)]

Answer 115

Greedy; dynamic programming

Answer 116

True. This is specific to explore without any additional details. We (the TAs) know this is an output and we simply read it as such. Source: Joves' notes

Answer 117

False. Without additional details, this is NOT something that DFS gives us. The visited array DFS returns is going to be all 'True' by the end of each run. VERY IMPORTANT TO REMEMBER THIS.

Answer 118

False. Without additional details, this is NOT something that DFS gives us. The visited array DFS returns is going to be all 'True' by the end of each run. VERY IMPORTANT TO REMEMBER THIS.

Answer 119

True. For explore, we simply run 'explore(s)' For DFS, we "start" it from s by putting s at the head of the list, so we would run DFS(G, s)

Answer 120

for v in V, find all v with the same ccnum as s. These are reachable by s. This is a detail you need to give, because it is NOT a free output of DFS! Source: Joves' notes

Answer 121

lower See DPV 3.3.2 pg. 90

Answer 122

If the explore subroutine is started at node u, then it will terminate precisely when all nodes reachable from u have been visited See DPV 3.4.2 pg. 92

Answer 123

The node that receives the HIGHEST post number in a DFS must lie in a SOURCE SCC See DPV 3.4.2 pg. 92

Answer 124

If C and C' are SCCs, and there is an edge from a node in C to a node in C', then the HIGHEST post number in C is bigger than the HIGHEST post number in C'. This can be restated as saying that the strongly connected components can be linearized by arranging them in DECREASING order of their highest post numbers. See DPV 3.4.2 pg. 92

Answer 125

True. This about it: in a DAG, you can ONLY go one direction. You can never go backwards to a previous vertex, so each vertex must be its own SCC.

Answer 126

True. With back edges, the post numbers go UP. For all other types of edges (tree, forward, cross), the post numbers go DOWN.

Answer 127

down (and the edge is part of the DFS tree)

Answer 128

down (and the edge points down to some non-child descendant in the tree)

Answer 129

down (and the nodes are not descendants of each other, i.e. they are siblings or cousins)

Answer 130

That there are no back edges in the graph (This comes from the cycle property - see GR1_9)

Answer 131

That a cycle is present in the graph.

Answer 132

False. Because of the cycle property. If a back edge is present, then there MUST be a cycle in the graph. Since DAGs are by definition acyclic, a DAG will never have a back edge.

Answer 133

False. This is because when we run DFS for the topological sort algorithm, the clock starts at 1, so the minimum value the post number can take on will be one. The maximum value will be 2n. This means we just initialize an array of size 2n, and stick the vertices in the array as we see them, which just gives us the vertices in decreasing post number. This only takes linear time to do, i.e. O(n + m) TODO: not 100% sure why the size is 2n and not just n

Answer 134

source; sink

Answer 135

It will always have the HIGHEST post number

Answer 136

It will always have the LOWEST post number

Answer 137

Two vertices v and w are connected if there is a path from v --> w AND a path from w --> v

Answer 138

An SCC is the maximal set of strongly connected vertices in a directed graph. This differs from a CC on an undirected graph because a CC in that graph is just the maximal set of connected vertices (i.e. the 'strong' part doesn't make sense in the context of an undirected graph. In an undirected graph there is always a path from u --> v and v --> u (obviously by just traversing the same edge in forward/reverse)

Answer 139

It is a DAG. Proof by Contradiction: Consider the alternative. If there was a path between an SCC S and another SCC S’ and also a path from S’ → S, then we would have a cycle, and it would mean that we could always get from anywhere in S to anywhere in S’, and vice-versa. This means that union(S, S’) would be an SCC, i.e. they would be in the same SCC, which obviously contradicts that they are in different SCCs.

Answer 140

True. This is because you could never have a cycle between two SCCs S and S', because if there was, then they would actually be part of the same SCC.

Answer 141

1. Find a sink SCC S 2. Output it, 3. Remove it 4. Repeat

Answer 142

Sink SCCs. Because the explore algorithm when run from a sink SCC will ONLY explore the vertices in that SCC and NOTHING ELSE.

Answer 143

False. This has important implications for our SCC algorithm, namely it is why we reverse the graph G to invert sources and sinks.

Answer 144

True. This has important implications for our SCC algorithm, namely it is why we reverse the graph G to invert sources and sinks.

Answer 145

True. This is what allows our SCC algorithm to work by inverting sources/sinks.

Answer 146

True. This is what allows our SCC algorithm to work by inverting sources/sinks.

Answer 147

O(n + m). Because under the hood SCC is simply running DFS twice, so O(n + m) + O(n + m) = 2*O(n + m) --> O(n + m)

Answer 148

The proof is based on the claim that the vertex with the highest post number always lies in a source SCC. We can prove this by making the simpler claim that for two SCCs S and S' if there is a path from vertex v in S to vertex w in S', then max(post(S)) > max(post(S')), therefore if we topologically sort by max post number, the highest number is guaranteed to be in the source SCC. It's easier to think of this visually. If there is a path from SCC S to S' and we start DFS from some vertex z in S', then we would visit all of S' and none of S. Conversely, if we start DFS from S, then the subtree of vertex z would contain the union(S, S'). This is what forces the condition that max(post(S)) > max(post(S')),

Answer 149

False. If there was such a bidirectional path, then both S and S' would be in the same SCC, which is a contradiction! Remember: a metagraph drawn on strongly connected components will always be a DAG i.e. no cycles will exist.

Answer 150

for all v in V, it returns dist(v) = length of shortest path from s --> v

Answer 151

(a) the min # of edges from s to u (or inf if a path doesn't exist) (b) that there isn't a path from s to u (c) True. It would return the previous vertex on the path from s to u.

Answer 152

For this class we assume O((n + m)*log n) using the "min-heap" data structure

Answer 153

^ == AND (remember that it looks kind of like an "A" | v == OR

Answer 154

For all k >= 3

Answer 155

False. There is indeed such a polynomial time algorithm. k-SAT problems are NP-Complete for all k >= 3

Answer 156

A clause with only one literal in it, e.g. (!x1) ^ (x1 v x4), the (!x1) is an example of a unit clause. The important thing about these clauses is that for a unit clause in a CNF, a unit clause MUST be satisfied in order for a solution to the formula to exist. This means that the unit clause imposes a constraint. So in the above example of the unit clause (!x1), this means that it MUST be that x1=False in order for the unit clause to evaluate to True.

Answer 157

We can simplify by doing: 1. Take a unit clause, say literal a_i 2. Satisfy it (set a_i = T) 3. Remove clauses containing a_i and drop !a_i 4. Let f’ be the resulting formula 5. Repeat the process on f' until we get down to clauses of size=2

Answer 158

2n vertices, which correspond to all the literals x1, !x1, x2, !x2, ..., x_n, !x_n

Answer 159

2m edges, which correspond to 2 "implications" per clause

Answer 160

Directed. This is a somewhat hand-wavy answer, but we're using the SCC algorithm on the graph, which only makes sense in the context of directed graphs.

Answer 161

That f is not satisfiable. Because we end up with a contradiction where with have to set x1=True and x1=False, which obviously can't be done. (It's easiest to see this by looking at one of the example problems from the lectures, like GR2_7)

Answer 162

That f is not satisfiable. Because we end up with a contradiction where with have to set x1=True and x1=False, which obviously can't be done. (It's easiest to see this by looking at one of the example problems from the lectures, like GR2_7)

Answer 163

(a) 2n = 2(3) = 6 total literals | (b) Yes, it is satisfiable because all x_i and !x_i lie in different SCCs.

Answer 164

No, it is not. In order for f to be satisfiable all the x_i and !x_i must lie in DIFFERENT SCCs.

Answer 165

True; False

Answer 166

False; True

Answer 167

That S is a sink SCC and !S is a source SCC (and vice-versa)

Answer 168

That for all x in X, x and !x lie in DIFFERENT SCCs.

Answer 169

We can conclude that f is not satisfiable. This must be because there is AT LEAST one x in X for which x and !x lie in the SAME SCC.

Answer 170

DFS would restart the explore subroutine once it explored each of the different subgraphs. BFS, on the other hand, would simply ignore nodes that aren't reachable from the start vertex s (in other words, the distance to nodes that aren't reachable from s in the rest of the forest will all be inf).

Answer 171

1. Tree on n vertices has n - 1 edges 2. Exactly one path between every pair of vertices 3. Any connected graph G=(V,E) with |E|=|V| - 1 is a tree. (This one is especially important for the cut property)

Answer 172

Disjoint Set (Union-by-Rank)

Answer 173

We can take a tree T and if we add an edge into that tree, e.g. T’ = T ∪ e* - e’, by first adding the edge e* that creates a cycle (because a tree always has n - 1 edges, so if I add an edge into an existing tree, we are guaranteed to get a cycle.) If we then remove any edge from that cycle, we get a new tree T’

Answer 174

A minimum weight edge across the cut is part of a MST

Answer 175

Forward edge

Answer 176

Cross edge

Answer 177

"You should consider the output of SCC to be just another graph. But each vertex is a ccnum, and each edge uses ccnums." Source: https://edstem.org/us/courses/22308/discussion/1589332?comment=3603438

Answer 178

False. This is in contrast to the shortest path problem, where having cycles were problematic (at least in the case of things like Dijkstra's algorithm).

Answer 179

True. Source: MF1_13 at end of video

Answer 180

Use DFS or BFS. If the path doesn't exist, then the algorithm just outputs the current flow f. Source: MF1_14

Answer 181

It assumes all capacities are integers. More sophisticated max-flow algorithms like Edmunds-Karp eliminate this requirement.

Answer 182

If we assume that all capacities are integers, then that means flow increases by >= 1 unit per round. So if C = size of the max flow, then we know there will be <= C rounds of the algorithm run.

Answer 183

Finding a path from s --> t using DFS or BFS in each round of the algorithm. Since for Ford-Fulkerson we worst case run <= C rounds, where C=size of the max flow, we end up with a final runtime of O(mC)

Answer 184

False. It is pseudo polynomial. This is because the runtime is O(mC), where C=size of the max flow. Making the runtime dependent on the input value for C makes the problem pseudo-polynomial (like the knapsack problem).

Answer 185

When it can no longer augment the path in the residual graph G_f*, i.e. when there is no longer a path from s --> t

Answer 186

That there is no longer an augmenting path in the residual graph G_f*. Based on this we know that f* is a max-flow. Source: See MF2_2

Answer 187

1. Flow network: directed graph G=(V,E), with start and end vertices s,t in V and with capacities c_e > 0 Note that for Ford-Fulkerson it requires that all capacities are integer values as well.

Answer 188

1. Flow f* of max size.

Answer 189

It's the amount of flow either out of start vertex s or in to end vertex t (which have to be equal because of conservation of flow, i.e. size(f) = f_out(s) = f_in(t) )

Answer 190

It is a cut that partitions the vertices V into two sets such that s is in L and t is in R

Answer 191

The capacity(L,R) = sum of the edge capacities from L--> R

Answer 192

False. We sum from L --> R

Answer 193

False. Checking for the path is actually the second part of the proof for the max-flow = min-cut theorem, so any algorithm that uses this property is guaranteed to be correct because of that general theorem.

Answer 194

Ford-Fullkerson: 1. Find augmenting paths using DFS or BFS 2. Runs in O(mC) time, where C is size of max flow 3. Assumes integer capacities Edmonds-Karp: 1. Find augmenting paths using BFS 2. Runs in O(n*m^2) time 3. No assumptions on capacities

Answer 195

Edmonds-Karp uses BFS to search for an st-path in the residual flow network graph, whereas Ford-Fulkerson can use DFS or BFS.

Answer 196

We would know for certain that at a minimum, the max flow would be upper bounded by the flow OUT of s' or by the flow IN to t'.

Answer 197

If we've fully capacitated all the edges out of s' and into t', then the flow is saturated. They are important because if G' has a saturating flow, then G has a FEASIBLE flow, and VICE-VERSA.

Answer 198

What has happened is that you've found a saturating flow, which implies that G has a feasible flow.

Answer 199

That G' has a SATURATING flow.

Answer 200

O(n*m^2) time

Answer 201

G=(V,E), c, s, t c: capacities such that c(u,v) > 0 for (u,v) in E s: source vertex t: sink vertex

Answer 202

f* f is a function or vector such that f(u, v) gives a flow of (u,v). It gets updated as the algorithm runs f* is the MAX-FLOW across G such that sum[f*(..., t)] is maximized size(f) is the numerical value of flow f calculated as sum[f*(..., t)]

Answer 203

That is is saturated. Because the current flow across the line is equal to the max capacity of the line.

Answer 204

f* is the max flow across G such that ∑f*(., t) is maximized; f* is what gets returned from our max-flow algorithms. Key Idea: All this is saying is that this function of all flow into the sink vertex is what gets maximized.

Answer 205

tl;dr: size(f) = max-flow into the sink t, which is the overall capacity C of the flow network. size(f) is numerical value of flow f calculated as ∑f*(., t); equivalently, size(f) = C, i.e. we define the max-flow into the sink t as the overall capacity C of the flow network

Answer 206

False. Ford-Fulkerson can be thought of as more of a template algorithm for solving max-flow problems, so how we check for that path is left undefined. Options for path checking then could be: DFS, Explore, or BFS.

Answer 207

It means that there must still be SPARE CAPACITY along the path from s --> t. This means that our algorithm won't terminate yet, because as long as there is a path from s --> t, then we haven't yet found a max-flow for the original flow network G.

Answer 208

True. As a result, there can be multiple valid solutions to a max-flow problem.

Answer 209

The algorithm might not terminate.

Answer 210

False. The flow is guaranteed to increase by >= 1 per round. The implication then is there are at most C rounds, where C is the size of the max flow.

Answer 211

Because it runs in O(mC) time, but C (the size of the max flow) has NOTHING to do with the size of the input. Think about it. The value for C is arbitrary; specifically it is a magnitude. But it doesn't have anything to do with the size of the input flow network G. Since the runtime isn't strictly a function of the input size then, we say it is pseudo-polynomial. Source: OH Week #6, 1:03:30

Answer 212

False, because Ford-Fulkerson is pseudo-polynomial. One situation where it could be faster is if C is bounded by n or m. In that case you could argue that since C is at most n or m, then C=n=m, so instead of the usual runtime of O(mC), it would be O(mn) or O(m^2), as opposed to O(nm^2) for Edmonds-Karp. This is one of those really subtle things that I suspect could be tested on an exam, so remember this one!

Answer 213

It depends. Edmonds-Karp is generally considered faster, FOR UNBOUNDED CAPACITIES. This is because of the pseudo-polynomial nature of Ford-Fulkerson. In the case where C is bounded to be at most n or m, then Ford-Fulkerson could be faster.

Answer 214

That the size of the max flow C is bounded by n or m. Because of the pseudo-polynomial nature of Ford-Fulkerson. In the case where C is bounded by n or m, we've fixed C so that it is no longer a magnitude and is instead directly bounded by the size of the input.

Answer 215

Any edge that disconnects the graph. They apply to undirected graphs.

Answer 216

Run DFS on the graph and then order the post nums in DESCENDING order (i.e. largest to smallest). Note: In practice, just say "Run topological sort on G". I really just created this question to reinforce the conceptual connection between topological sorting and post numbers from DFS runs.

Answer 217

False. There may be multiple sources and sinks.

Answer 218

False. It is only O(n + m) if the problem can be reduced to where each clause has at most 2 literals.

Answer 219

If the number of SCCs = n in the original graph, then there must not have been any cycles present in the graph. Hence every vertex in the original graph has now become a singleton SCC in G_scc.

Answer 220

There must have been at least one cycle present such that at least or more of the nodes were compressed into the same SCC.

Answer 221

O(n). Because in the worst case if no cycles were present in the original graph then every vertex in G would become a singleton SCC in G_scc, so in worst case you would have to count n vertices.

Answer 222

True. You would run, BFS(G, s) and then check if dist[v] = inf. If dist[v] != inf, then a path exists, otherwise False. (We check for inf because this is how the distances are initialized in BFS.)

Answer 223

The heaviest edge of any cycle can not be part of any MST.

CS6515_Exam2 Flashcards

(270 cards)