Part 1 (All but hashing) Flashcards by Afan Ahmed

What is an ADT?

An abstract data type is a specification of a data structure in terms of what we can do with it (its functionality) rather than what it actually is (its specific representation)

How well did you know this?

Not at all

Perfectly

The implementation affects (often dramatically) the … of the solution method, but not the …of the solution.

Efficiency

correctness

How well did you know this?

Not at all

Perfectly

What is encapsulation?

Ideally the user of an ADT will not need to know, nor be able to influence, the actual representation of data

(hiding data from user and thus preventing alteration)

How well did you know this?

Not at all

Perfectly

What are some example ADTs?

List
Queue
Stack
Dictionary
Tree
Graph

How well did you know this?

Not at all

Perfectly

What are some example data structures?

array
linked list
binary search tree
hash table

How well did you know this?

Not at all

Perfectly

What are 3 main characteristics of an algorithm designed?

Domain of definition: set of legal inputs
Correctness: it gives correct output for each legal input. This depends on the problem we are trying to solve, and can be tricky to prove
Resource use: usually computing time and memory space.
- This depends on the input, and on the implementation (hardware, programmer skill, compiler, language, …).
- It usually grows as the input size grows.
- There is a tradeoff between resources (for example, time vs space).
- Running time is usually more important than space use.

How well did you know this?

Not at all

Perfectly

How running time scales with problem size

Table: Relative growth of running time T(n) when the input size increases from n = 10 to n = 1000 provided that T(10) = 1.

How well did you know this?

Not at all

Perfectly

How problem size scales with running time

Table: The largest data sizes n that can be processed by an algorithm with time complexity Θ(f(n)) provided that T(10) = 1 minute

How well did you know this?

Not at all

Perfectly

Estimating running time: harder example

How well did you know this?

Not at all

Perfectly

worst-case versus average-case analysis

a good worst-case bound is always useful, but it is just a first step and we should aim to refine the analysis for important algorithms. Average-case analysis is often more practically useful, provided the algorithm will be run on “random” data

How well did you know this?

Not at all

Perfectly

For large enough n, a linear time algorithm will always beat a quadratic time one

true/false?

true

How well did you know this?

Not at all

Perfectly

Why do constants need to be taken into account at times?

An algorithm with running time 10⁻¹⁰n² is probably better in practice than one with running time 10¹⁰n, since the latter will eventually beat the former, but only on instances of size at least 10²⁰, which is rarely met in practice

How well did you know this?

Not at all

Perfectly

What can be said about f is O(g)?

“f is Big-Oh of g”

if there is some C > 0 and some n0 ∈ N such that for all n ≥ n₀, f(n) ≤ Cg(n)

f grows at most as fast as g

How well did you know this?

Not at all

Perfectly

What can be said about f is Ω(g)?

“f is big-Omega of g”

if g is O(f)

f grows at least as fast as g

How well did you know this?

Not at all

Perfectly

What can be said about f is Θ(g)?

“f is big-Theta of g”

if f is O(g) and g is O(f)

f grows at the same rate as g

How well did you know this?

Not at all

Perfectly

Rules for asymptotic notation

(irrelevance of constant factors)

if c > 0 is constant then cf is Θ(f)

(transitivity)

If f is O(g)/Ω(g)/Θ(g) and g is O(h)/Ω(h)/Θ(h), then f is O(h)/Ω(h)/Θ(h)

(sum rule)

If f1 is O(g₁) and f₂ is O(g₂) then f₁ + f₂ is O(max{g₁, g₂})

(product rule)

If f1 is O(g₁) and f₂ is O(g₂) then f₁f₂ is O(g₁g₂)

(limit rule)

Suppose that L := lim_n→∞ f(n)/g(n) exists

Then

if L = 0 then f is O(g) and f is not Ω(g);
if 0 < L < ∞ then f is Θ(g);
if L = ∞ then f is Ω(g) and f is not O(g)

How well did you know this?

Not at all

Perfectly

Write f ≺ g if f is O(g) but f is not Θ(g), so g grows at a faster rate than f.

List runtimes

Study These Flashcards

log n ≺ (log n)² ≺ √ n ≺ n ≺ n log n ≺ n(log n)² ≺ n²≺ n³ ≺ · · · ≺ (1.5)ⁿ ≺ 2ⁿ ≺ n! ≺ nⁿ

f(n) occurring are log n, n, (n log n), n² , n³ , 2ⁿ , and each grows considerably faster than the previous one

which 2 have the smallest gap?

Study These Flashcards

gap between n and n log n is the smallest

What is a list and what are some of its basic operations?

Study These Flashcards

A list (or sequence) is an ordered collection of items

basic operations:

return the size of the list; if this is zero, list is empty;
retrieve the item in a given position;
delete an item in a given position;
insert an item at a given position;
given the position of an item, find the position of the next item, if it exists (this gives an iterator over the data structure).

How do size and capacity differ?

Study These Flashcards

size is the number of elements in an array

capacity is the total number of places in an array

In what runtime does retrieval, deletion and insertion occur, for a simple array?

Study These Flashcards

Retrieval takes time Θ(1) - (this is notation for constant time)

Deletion takes time Θ(n − i) if the deleted element was in position i

Insertion takes time Θ(n − i) if the inserted element ends up in position i, not counting the case where we need to increase capacity

statically allocated memory is from the…?

and

automatically allocated memory is from the…?

Study These Flashcards

memory heap

and

memory stack (smaller)

what does malloc do?

Study These Flashcards

dynamically allocates memory from the heap

what does realloc do?

Study These Flashcards

resizes the memory required for an object

what does free do?

prevents memory leak

Suppose that we start from empty and insert n items into a list, at the end (so there is no moving of items needed) what is the time complexity if we do not account for capacity increase?

Θ(n)

If we increase capacity by 1 every time the list is full and we need to insert another element what is the time complexity?

we perform work in Θ(1 + 2 + · · · + n), namely **Θ(n² ).** The same is true for any (“arithmetic”) increase by a constant

If we double each time the list is full and we need to insert another element what is the time complexity?

we perform work in Θ(1 + 2 + 4 + · · · + 2^k ), where 2 ^k is the largest power less than n. This simplifies to Θ(n) (“the amortized cost of insertion is Θ(1)”) *note: The worst case for an insertion is Θ(n), but we can do n in a row at average cost Θ(1)*

Finding an item in an array-based list. if the array elements are general, we must simply search every element starting from the beginning. Thus what is worst case and average case runtime?

It takes Θ(n) work in the worst case, and also on average if the data is random

If the array elements are kept sorted (this is a different ADT, the SortedList) what type of search could be we use? What is it's worst case runtime?

binary search Θ(log n) in the worst case

When we delete we should decrease the size of the list. But the capacity may stay much bigger than the size unless we explicitly reduce it. What is one way in which we can do this reduction?

One way is to reduce the capacity by half when the list is less than 1/4 full. This increases the (amortized) cost of deletion by only a constant factor

What are pros and cons of using a linked list?

**Pros:** no need to change array size; insert and delete operations are fast **Cons:** random access to an element is slow, so retrieve and find operations take longer

Linked list operations and how they're performed

To retrieve/find an element, we must follow pointers from the beginning until we get to it To insert an element at position i, we must reset the pointer from item i − 1 to point to the new element instead of to the former occupant x of position i, and make the new object point to x (which is now at position i + 1) To delete an element, make the incoming pointer point to its successor instead (how to find that pointer?). If the node is at the beginning or end, a slight variation of this is needed *Remember to free the memory, or our old node may take up space forever*

**Performance of basic list operations**

What is a stack?

A **stack** is a linear container that allows us to insert, delete and retrieve only at one end. *There is no general find operation* Intuitive idea: *a stack of plates in a cafeteria. A stack is LIFO (last in, first out)*

What are the names of the operations for stacks?

The insert operation is usually called push delete is pop retrieve is peek isempty tells us whether the stack is empty new that creates an empty stack *note: Stacks are inherently tied to the idea of recursion*

What is a queue?

A **queue** is a linear container that allows us to insert at one end, and delete and retrieve only at the other end. *There is no general find operation* Intuitive idea: *a queue of people waiting to buy tickets. A queue is FIFO (first in, first out)*

what is a priority queue?

A **priority queue** is a container ADT where each element has a key (usually a real number) called its priority. There are operations allowing us to insert an element, and to find and delete the element of highest priority. A general find or delete or retrieve is not available

Priority queues can be implemented in many ways what are some ways?

unsorted list, sorted list, binary heap, binomial heap

convert this infix form to postfixform 1 + 2 − 3 ∗ 4

1 2 + 34 ∗ − postfix Note: To evaluate a postfix expression, we can use a stack. Scan from left to right, and push each operand onto the stack. When we encounter an operator, pop the correct number of arguments from the stack, evaluate, and push the result back in. If this fails, an error has been made in the postfix notation

Consider all possible outputs orders formed by pushing each of 1, 2, . . . , n onto a stack, and popping them all of When n = 3 there is exactly one of the 6 possible orders that is impossible which one is it?

312 not possible push 1 push 2 push 3 pop 3 cannot pop 2 without popping 1

Part 1 (All but hashing) Flashcards

(41 cards)