Lecture 19 - Dynamic Programming 1: Fibonacci, Shortest Paths

Dynamic programming is a very powerful algorithm design paradigm. Dynamic programming can be thought of as a smart way of applying brute force. Dynamic programming breaks a problem into smaller sub-problems, solves the sub problem, and uses the same solution to solve the higher problem. It is well suited to optimization problems.

Dynamic programming = recursion + memoization + guessing

Guess an algorithm and all the possible steps for that algorithm. Guess all possible algorithms and then take the best one.

Fibonacci Numbers

A naive algorithm to calculate fibonacci numbers is

fibonacci(n):
    if n <= 2:
        f = 1
    else:
        f = fibonacci(n-1) + fibonacci(n-2)
    return f

This is a very inefficient algorithm because the recursion is exponential. The time complexity of this algorithm can be written as -

\[T(n) = T(n-1) + T(n-2) +(1)\]

The solution to this complexity is exponential, $\Theta$(2^n/2), which is pretty bad. A better way to do this is by dynamic programming.

Memoization

Memoization is basically storing the solutions to smaller problems in a data structure (usually a hash table) and using memory to speed up computation.

Calculating fibonacci numbers using memoization can be implemented as -

memo = {}
fibonacci(n):
    if n in memo:
        return memo[n]
    if n <= 2:
        f = 1
    else:
        f = fibonacci(n-1) + fibonacci(n-2)
        memo[n] = f
    return f

Looking at the dictionary makes sure that we don’t repeat calculations we have already done. You can do this with any recursive algorithm. You may not be able to do this in contexts, however, because of memory limits, but you can still store a few examples to help your computation.

It is easy to see how this optimization helps our computation time. For example, if we draw a recursion tree, it would look something like this -

Memoization recursion tree

To compute F_n, we have to compute F_n-1 and F_n-2 However, once we have computed F_n-2, we should already have computed F_n-3, and therefore we should be able to compute F_n-1 in constant time if only we remembered the result of F_n-2 and F_n-3. This means we can completely drop one half of the recursion tree (the right half).

Another way of thinking about it is that F_n recurses only the first time it is called, and every other time it returns in constant time after the dictionary lookup. So we make n calls to compute F_n, and each call takes constant time (because we ignore the recursion). Therefore, the time complexity of F is now $\Theta$(n). This is a general technique used to calculate the total cost for any DP with memoization.

DP Cost With Memoization

To see why each call takes constant time and why we can ignore the time taken by recursive calls, imagine an execution of the algorithm.

If you want to compute F_n, you need to compute F_n-1 + F_n-2. We recurse once (on the F_n-1 call) and now we need to compute F_n-2 + F_n-3. We recurse again (on the F_n-2 call) and now we need to compute F_n-3 + F_n-4, and so on. We eventually get to the base case of n=2, which immediately returns 1. Now we move up one level in the recursion tree, which requires us to compute F₃, which we can return in constant time since we have F₂ and F₁ in the memo table (and also by definition). We move up one more level and now we need to compute F₄, which also returns in constant time because we have F₃ and F₂ in the memo table. Similarly, F₅, F₆, …, F_n all return in constant time. Therefore, we had n recursive calls or subproblems, and each sub problem takes O(1) to solve.

This is a general method to calculate the running time of a dynamic program.

Running Time = #subproblems * time per subproblem

When calculating the time per subproblem, we ignore all recursive calls and assume they take constant time, as we shown above.

Another paradigm of writing the fibonacci algorithm (and all dynamic programming algorithms), is to start from the bottom up.

fib = {}
for k in range(1, n+1):
    if k <= 2:
        f = 1
    else:
        f = fib[k-1] + fib[k-2]
    fib[k] = f
return f

Here, we start calculating from the bottom and go up, but this algorithm performs exactly the same calculations as the memoized algorithm. This version of the algorithm allows you to save space on your dictionary/list as well. Once you calculate a fibonacci number, you only need to store the last 2 fibonacci numbers to calculate the next one. So you don’t need a big dictionary with fibonacci numbers for all the problems you have solved so far. The bottom up approach saves space compared to the memoized approach. It’s also obvious that the running time of this algorithm is linear.

Shortest Paths

Suppose we are trying to solve a single source shortest path problem from vertex s to vertex v. We can use naive recursion and reduce our current problem down to a smaller subproblem

Consider all edges going to v. The shortest path has to include one of them.
The edges going to v will all be connected to some vertex u. The shortest path will then have the weight $\delta(s,u)+w(u,v)$for all u connected to v.
The problem to be solved is now reduced from (s, v) to $min_{(u,v) in E}(\delta(s, u) + w(u, v))$.

But this algorithm does not work on graphs with cycles (we get stuck in an infinite loop).

The intuition is this - suppose you want to calculate the shortest path from s to v, $\delta(s,v)$. We know that the shortest path has to include one of the edges coming into v. So we choose a random edge coming into v, and assume that it is included in the shortest path. Then, we calculate the shortest path from u to that node instead of from u to v. We then do this for all nodes connected to v and take the minimum value.

\[\delta(s,v)=min_{(u, v) \in E}(\delta(s, u) + w(u,v))\]

def shortest_path(s, v):
    return min(shortest_path(s, u) + weight(u, v) for u in v.neighbours)

This is a $O(V+E)$ algorithm. This algorithm is essentially doing a depth-first search.