Section Twelve: Algorithms Flashcards

Question

Chapter 61 – Bubble sort and insertion sort Insertion Sort

Answer 1

This is a sorting algorithm that sorts one data item at a time. It is rather similar to how you might sort a hand of cards. The algorithm takes one data item from the list and places it in the correct location in the list. This process is repeated until there are no more unsorted data items in the list. Although more efficient than the bubble sort, it is not as efficient as the merge sort or quick sort.

Answer 2

The bubble sort requires close to n passes through the list, with each pass requiring a maximum of n – 1 swaps. It is of order O(n2). The insertion sort also has two nested loops and so has time complexity O(n2). However, if the list is already almost sorted, the time complexity is reduced to close to O(n).

Answer 3

The merge sort uses a divide and conquer approach. The list is successively divided in half, forming two sublists, until each sublist is of length one. The sublists are then sorted and merged into larger sublists until they are recombined into a single sorted list. The basic steps are: - Divide the unsorted list into n sublists, each containing one element - Repeatedly merge sublists to produce new sorted sublists until there is only one sublist remaining.

Answer 4

The merge sort is another example of a divide and conquer algorithm, but in this case, there are n sublists to be merged, so the time complexity has to be multiplied by a factor of n. The time complexity is therefore O(nlog n).

Answer 5

The amount of resources such as memory that an algorithm requires, known as the space complexity, is also a consideration when comparing the efficiency of algorithms. The bubble sort, for example, requires n memory locations for a list of size n. The merge sort, on the other hand, requires additional memory to hold the left half and right half of the list, so takes twice the amount of memory space.

Answer 6

The quick sort algorithm, like the insertion sort, uses a Divide and Conquer algorithm to quickly reduce the size of the problem, but without using the additional storage required by the merge sort. The steps in the quick sort are as follows: 1. Select a value called the pivot value. There are different ways to choose the pivot value but we will choose the first item in the list. The actual position where the pivot value belongs in the final sorted list, called the split point, will be used to divide the list for subsequent calls. In the list shown below, 9 is the first pivot value. 2. Divide the remainder of the list into two partitions - all elements less than the pivot value must be in the first partition - all elements greater than the pivot value must be in the second partition 3. 3 and 15 are now the pivots in the left and right partitions. Recursively repeat the process.

Answer 7

The quicksort algorithm is extremely fast. If the partition always occurs in the middle of the list, there will be log n divisions in a list of length n, and each of the n items needs to be checked against the pivot value to find the split point. It therefore has time complexity O(n log n). Another advantage is that it does not need additional memory, like the merge sort. A disadvantage is that if the split points are not near the middle of the list, but are close to the start or end of the list, the division will be very uneven. If the split point is, for example, the first item in the sequenced list, the division results in a list of 0 items and a list of n-1 items. The list of n-1 items divides into 0 items and n-2 items and so on. The resulting time complexity is O(n2). If the list is very large, and recursion continues too long, it may cause stack overflow and the program will crash.

Answer 8

- Bubble sort is the slowest of the sorts, with time complexity O(n2) - Insertion sort is O(n2) but if the list is already almost sorted, this reduces to O(n) - Merge sort is O(n log n) but requires additional memory space for the merging process - Quick sort is generally the fastest sort, but is dependent on using a pivot that is not close to the smallest or largest elements of the list. There are several methods for selecting a pivot to ensure this does not happen. It has average time complexity O(n log n). It does not require additional memory space.

Answer 9

There are two ways to traverse a graph so that every node is visited. Each of them uses a supporting data structure to keep track of which nodes have been visited, and which node to visit next. - A depth-first traversal uses a stack, which is implemented automatically during execution of a recursive routine to hold local variables, parameters and return addresses each time a subroutine is called. Alternatively, a non-recursive routine could be written and the stack maintained as part of the routine. - A breadth-first traversal uses a queue.

Answer 10

In this traversal, we go as far down one route as we can before backtracking and taking the next route. The following recursive subroutine dfs is called initially from the main program, which passes it a graph, defined here as an adjacency list (see Chapter 38) and implemented as a dictionary with nodes A, B, C, ... as keys, and neighbours of each node as data. Thus if "A" is the current vertex, graph["A"] will return the list ["B","D","E"] with reference to the algorithm below and the graph overleaf.

Answer 11

With a breadth first traversal, starting at A we first visit all the nodes adjacent to A before moving to B and repeating the process for each node at this ‘level’, before moving to the next level. Instead of a stack, a queue is used to keep track of nodes that we still have to visit. Nodes are coloured pale blue when queued and dark blue when dequeued and added to the list of nodes that have been visited.

Answer 12

The breadth-first traversal is an iterative, rather than a recursive routine. The first node (‘A’ in this example), is appended to the empty queue as soon as the subroutine is entered. A Python definition of the graph as a dictionary is given below for interest, but is not directly used in the pseudocode, as implementations will vary in different languages.

Answer 13

Applications of the depth-first search include the following: - In scheduling jobs where a series of tasks is to be performed, and certain tasks must be completed before the next one begins. - In solving problems such as mazes, which can be represented as a graph

Answer 14

Breadth-first searches are used to solve many real-life problems. For example: - A major application of a breadth-first search is to find the shortest path between two points A and B, and this will be explained in detail in the next chapter. Finding the shortest path is important in, for example, GPS navigation systems and computer networks. - Facebook. Each user profile is regarded as a node or vertex in the graph, and two nodes are connected if they are each other’s friends. This example is considered in more depth in Chapter 72, Big Data. - Web crawlers. A web crawler can analyse all the sites you can reach by following links randomly on a particular website.

Answer 15

We increasingly rely on computers to find the optimum solution to a range of different problems. For example: - scheduling aeroplanes and staff so that air crews always have the correct minimum rest time between flights - finding the best move in a chess problem - timetabling classes in schools and colleges - finding the shortest path between two points – for building circuit boards, route planning, communications networks and many other applications Finding the shortest path from A to B has numerous applications in everyday life and in computer-related problems. For example, if you visit a site like Google Maps to get directions from your current location to a particular destination, you probably want to know the shortest route. The software that finds it for you will use representations of street maps or roads as graphs, with estimated driving times or distances as edge weights.

Answer 16

Dijkstra (pronounced dike-stra) lived from 1930 to 2002. He was a Dutch computer scientist who received the Turing award in 1972 for fundamental contributions to developing programming languages. He wrote a paper in 1968 which was published under the heading “GO TO Statement Considered Harmful” and was an advocate of structured programming. Dijkstra’s algorithm is designed to find the shortest path between one particular start node and all other nodes in a weighted graph. This is similar to a breadth first search. The weights could represent, for example, distances or time taken to travel between towns, or the cost of travel between airports.

Answer 17

Dijkstra’s algorithm is a special case of a more general path-finding algorithm called the A* algorithm. Dijkstra’s algorithm has one cost function, which is the real cost value (e.g. distance) from the source node to every other node. The A* algorithm has two cost functions: 1. g(x) – as with Dijkstra’s algorithm, this is the real cost from the source to a given node. 2. h(x) – this is the approximate cost from node x to the goal node. It is a heuristic function, meaning that it is a good or adequate solution, but not necessarily the optimum one. This algorithm stipulates that the heuristic function should never overestimate the cost, therefore the real cost should be greater than or equal h(x). The total cost of each node is calculated as f(x) = g(x) + h(x). The A* algorithm focusses only on reaching the goal node, unlike Dijkstra’s algorithm which finds the lowest cost or shortest path to every node. It is used, for example, in video games to enable characters to navigate the world.

Section Twelve: Algorithms Flashcards

(41 cards)