Recursion

Recursion is a method of solving problems that involves breaking a problem down into smaller and smaller subproblems until you get to a small enough problem that it can be solved trivially. Usually recursion involves a function calling itself. While it may not seem like much on the surface, recursion allows us to write elegant solutions to problems that may otherwise be very difficult to program.

3 Rules of Recursion

A recursive algorithm must have a base case.
A recursive algorithm must change its state and move toward the base case.
A recursive algorithm must call itself, recursively.

Examples

Finding Length of a String

$\begin{align} \text{length}(s) = \begin{cases} 0 & \text{if } s = \text{''} \\ 1 + \text{length}(s[1:]) & \text{otherwise} \end{cases} \end{align}$

Let’s execute this function on the string “hello”.

$\begin{align} \text{length}(\text{"hello"}) &= 1 + \text{length}(\text{"ello"}) \\ &= 1 + 1 + \text{length}(\text{"llo"}) \\ &= 1 + 1 + 1 + \text{length}(\text{"lo"}) \\ &= 1 + 1 + 1 + 1 + \text{length}(\text{"o"}) \\ &= 1 + 1 + 1 + 1 + 1 + \text{length}(\text{""}) \\ &= 1 + 1 + 1 + 1 + 1 + 0 \\ &= 5 \end{align}$

def len_of_string(s):
    if s == '':
        return 0
    return 1 + len_of_string(s[1:])

len_of_string('hello')

Factorial

$n! = \begin{cases} 1 & \text{if } n = 0 \quad \text{(base case)}\\ (n-1)! \times n & \text{if } n > 0 \quad \text{(recursive case)} \end{cases}$

Let’s execute the factorial function for $n=3$ : $\begin{align} 3! &= 3 \times 2! \\ &= 3 \times 2 \times 1! \\ &= 3 \times 2 \times 1 \times 0! \\ &= 3 \times 2 \times 1 \times 1 \\ &= 6 \end{align}$

def factorial(n):
    if n == 0:
        return 1
    else:
        return factorial(n-1) * n

factorial(5)

Palindrom Checker

$\text{isPalindrome}(s) = \begin{cases} \text{True} & \text{if } s = \text{''} \quad \text{(base case)}\\ \text{False} & \text{if } s[0] \neq s[-1] \quad \text{(base case)}\\ \text{isPalindrome}(s[1:-1]) & \text{if } s[0] = s[-1] \quad \text{(recursive case)} \end{cases}$

Executing the function for $s = \text{'racecar'}$ : $\begin{align} \text{isPalindrome}(\text{'racecar'}) &= \text{isPalindrome}(\text{'aceca'}) \\ &= \text{isPalindrome}(\text{'cec'}) \\ &= \text{isPalindrome}(\text{'e'}) \\ &= \text{isPalindrome}(\text{''}) \\ &= \text{True} \end{align}$

For $s = \text{'oello'}$ : $\begin{align} \text{isPalindrome}(\text{'oello'}) &= \text{isPalindrome}(\text{'ell'}) \\ &= \text{False} \end{align}$

def is_palindrom(s):
    if len(s) <= 1:
        return True
    else:
        return s[0] == s[-1] and is_palindrom(s[1:-1])

print(is_palindrom('radar'))
print(is_palindrom('radars'))

True
False

Tower of Hanoi

How to move all disks from the leftmost peg to the rightmost peg?

$\text{hanoi}(n, \text{from}, \text{to}, \text{via}) = \begin{cases} \text{do nothing} & \text{if } n = 0 \quad \text{(base case)}\\ \text{hanoi}(n-1, \text{from}, \text{via}, \text{to}) & \text{if } n > 0 \quad \text{(recursive case)}\\ \text{move disk from } from \text{ to } to & \text{if } n = 1 \quad \text{(base case)}\\ \text{hanoi}(n-1, \text{via}, \text{to}, \text{from}) & \text{if } n > 0 \quad \text{(recursive case)} \end{cases}$

def hanoi(n, source, to, via):
    if n == 0:
        return
    else:
        hanoi(n-1, source, via, to)
        print('Move disk from {} to {}'.format(source, to))
        hanoi(n-1, via, to, source)

hanoi(3, 'A', 'C', 'B')

Move disk from A to C
Move disk from A to B
Move disk from C to B
Move disk from A to C
Move disk from B to A
Move disk from B to C
Move disk from A to C

GCD

GCD is the largest positive integer that divides the numbers without a remainder.

It can be computed using the following recursive algorithm (Euclid’s algorithm): $\text{gcd}(a, b) = \begin{cases} a & \text{if } b = 0 \quad \text{(base case)}\\ \text{gcd}(b, a \bmod b) & \text{if } b > 0 \quad \text{(recursive case)} \end{cases}$

def gcd(a, b):
    if b == 0:
        return a
    else:
        return gcd(b, a % b)

print(gcd(12, 15))
print(gcd(12, 16))

3
4

Power of a Number

How to calculate $x^n$ efficiently?

The naive approach is to multiply $x$ by itself $n$ times. $O(n)$

$\text{power}(x, n) = \begin{cases} 1 & \text{if } n = 0 \quad \text{(base case)}\\ x \times \text{power}(x, n-1) & \text{if } n > 0 \quad \text{(recursive case)} \end{cases}$

But we can do better in $O(\log n)$ time. $\text{power}(x, n) = \begin{cases} 1 & \text{if } n = 0 \quad \text{(base case)}\\ \text{power}(x^2, n/2) & \text{if } n \text{ is even} \quad \text{(recursive case)}\\ x \times \text{power}(x^2, (n-1)/2) & \text{if } n \text{ is odd} \quad \text{(recursive case)} \end{cases}$

def power(x, n):
    if n == 0:
        return 1
    elif n % 2 == 0:
        return power(x * x, n // 2)
    else:
        return x * power(x * x, n // 2)

print(power(2, 10))
print(power(3, 10))

1024
59049

Binary Search

Given a sorted array $A$ and a value $x$ , how to find if $x$ is in $A$ ?

If we check every element in $A$ , we can find $x$ in $O(n)$ time.

But we can do better in $O(\log n)$ time.

$\text{binarySearch}(A, x, \text{low}, \text{high}) = \begin{cases} \text{False} & \text{if } \text{low} > \text{high} \quad \text{(base case)}\\ \text{True} & \text{if } A[\text{mid}] = x \quad \text{(base case)}\\ \text{binarySearch}(A, x, \text{low}, \text{mid}-1) & \text{if } A[\text{mid}] > x \quad \text{(recursive case)}\\ \text{binarySearch}(A, x, \text{mid}+1, \text{high}) & \text{if } A[\text{mid}] < x \quad \text{(recursive case)} \end{cases}$

def binary_search(a, x, low, high):
    if low > high:
        return False
    else:
        mid = (low + high) // 2
        if a[mid] == x:
            return True
        elif a[mid] < x:
            return binary_search(a, x, mid + 1, high)
        else:
            return binary_search(a, x, low, mid - 1)
        
a = [1, 2, 3, 4, 5, 6, 7, 8, 9]
print(binary_search(a, 5, 0, len(a)-1))
print(binary_search(a, 10, 0, len(a)-1))

True
False

Merge Sort

$\text{mergeSort}(A) = \begin{cases} A & \text{if } \text{len}(A) \leq 1 \quad \text{(base case)}\\ \text{merge}(\text{mergeSort}(A[:\text{mid}]), \text{mergeSort}(A[\text{mid}:])) & \text{if } \text{len}(A) > 1 \quad \text{(recursive case)} \end{cases}$

$\text{merge}(A, B) = \begin{cases} A & \text{if } \text{len}(B) = 0 \quad \text{(base case)}\\ B & \text{if } \text{len}(A) = 0 \quad \text{(base case)}\\ \text{min}(A[0], B[0]) \text{ + merge}(A[1:], B) & \text{if } A[0] < B[0] \quad \text{(recursive case)}\\ \text{min}(A[0], B[0]) \text{ + merge}(A, B[1:]) & \text{if } A[0] \geq B[0] \quad \text{(recursive case)} \end{cases}$

def merge(a, b):
    # recursive merge
    if len(a) == 0:
        return b
    elif len(b) == 0:
        return a
    else:
        if a[0] < b[0]:
            return [a[0]] + merge(a[1:], b)
        else:
            return [b[0]] + merge(a, b[1:])
        
def merge_sort(a):
    if len(a) <= 1:
        return a
    else:
        mid = len(a) // 2
        return merge(merge_sort(a[:mid]), merge_sort(a[mid:]))
    
a = [1, 3, 5, 7, 9, 2, 4, 6, 8, 10]
print(merge_sort(a))

[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

DFS

Depth-first search (DFS) is an algorithm for traversing or searching tree or graph data structures. The algorithm starts at the root node (selecting some arbitrary node as the root node in the case of a graph) and explores as far as possible along each branch before backtracking.

The pseudocode for DFS is as follows: - Mark the current vertex as being visited. - Explore each adjacent vertex that is not included in the visited set.

$\text{dfs}(G, v) = \begin{cases} \text{do nothing} & \text{if } v \text{ is visited} \quad \text{(base case)}\\ \text{mark } v \text{ as visited and } \text{dfs}(G, w) & \text{for each } w \text{ adjacent to } v \quad \text{(recursive case)} \end{cases}$

Knight’s Tour

Problem: Given a chess board of size $n \times n$ and a starting position $(x, y)$ , find a sequence of moves of a knight that visits every square exactly once.

$\text{knightTour}(n, x, y, \text{visited}) = \begin{cases} \text{Stop} & \text{if } \text{visited} = n^2 \quad \text{(base case)}\\ \text{False} & \text{if } x \text{ or } y \text{ is out of bounds} \quad \text{(base case)}\\ \text{False} & \text{if } (x, y) \text{ is already visited} \quad \text{(base case)}\\ \text{True} & \text{if } \text{knightTour}(n, x+1, y+2, \text{visited}+1) \quad \text{(recursive case)}\\ \text{True} & \text{if } \text{knightTour}(n, x+1, y-2, \text{visited}+1) \quad \text{(recursive case)}\\ \text{True} & \text{if } \text{knightTour}(n, x-1, y+2, \text{visited}+1) \quad \text{(recursive case)}\\ \text{True} & \text{if } \text{knightTour}(n, x-1, y-2, \text{visited}+1) \quad \text{(recursive case)}\\ \text{True} & \text{if } \text{knightTour}(n, x+2, y+1, \text{visited}+1) \quad \text{(recursive case)}\\ \text{True} & \text{if } \text{knightTour}(n, x+2, y-1, \text{visited}+1) \quad \text{(recursive case)}\\ \text{True} & \text{if } \text{knightTour}(n, x-2, y+1, \text{visited}+1) \quad \text{(recursive case)}\\ \text{True} & \text{if } \text{knightTour}(n, x-2, y-1, \text{visited}+1) \quad \text{(recursive case)} \end{cases}$

def knight_tour(n, x, y, visited):
    # print the path
    if len(visited) == n * n:
        print(visited)
        return True, visited
    else:
        for dx, dy in [(1, 2), (2, 1), (-1, 2), (-2, 1),
                       (-1, -2), (-2, -1), (1, -2), (2, -1)]:
            nx, ny = x + dx, y + dy
            if nx >= 0 and nx < n and ny >= 0 and ny < n and (nx, ny) not in visited:
                visited.append((nx, ny))
                res, visited = knight_tour(n, nx, ny, visited)
                if res:
                    return True, visited
                visited.pop()
        return False, visited
    
res, path = knight_tour(5, 0, 0, [(0, 0)])

print()
# visualize the path just by putting numbers on the board
board = [[0] * 5 for _ in range(5)]
for i, (x, y) in enumerate(path):
    board[x][y] = i + 1
for row in board:
    print(row)

[(0, 0), (1, 2), (2, 4), (0, 3), (1, 1), (3, 0), (4, 2), (3, 4), (2, 2), (4, 3), (3, 1), (1, 0), (0, 2), (1, 4), (3, 3), (4, 1), (2, 0), (0, 1), (1, 3), (2, 1), (4, 0), (3, 2), (4, 4), (2, 3), (0, 4)]

[1, 18, 13, 4, 25]
[12, 5, 2, 19, 14]
[17, 20, 9, 24, 3]
[6, 11, 22, 15, 8]
[21, 16, 7, 10, 23]

Sudoku Solver

Problem: Given a partially filled sudoku board, fill the empty cells.

def is_valid(board, x, y):
    n = len(board)
    # check row
    for j in range(n):
        if j != y and board[x][j] == board[x][y]:
            return False
    # check column
    for i in range(n):
        if i != x and board[i][y] == board[x][y]:
            return False
    # check block
    for i in range(3):
        for j in range(3):
            nx, ny = x // 3 * 3 + i, y // 3 * 3 + j
            if nx != x and ny != y and board[nx][ny] == board[x][y]:
                return False
    return True

def sudoku_solver(board):
    n = len(board)
    for i in range(n):
        for j in range(n):
            if board[i][j] == 0:
                for k in range(1, 10):
                    board[i][j] = k
                    if is_valid(board, i, j) and sudoku_solver(board):
                        return True
                    board[i][j] = 0
                return False
    return True

# realistic board
board = [[0, 0, 0, 0, 0, 0, 0, 0, 0],
         [0, 8, 0, 0, 0, 0, 0, 0, 0],
         [0, 0, 3, 6, 0, 0, 0, 0, 0],
         [0, 7, 0, 0, 9, 0, 2, 0, 0],
         [0, 5, 0, 0, 0, 7, 0, 0, 0],
         [0, 0, 0, 0, 4, 5, 7, 0, 0],
         [0, 0, 0, 1, 0, 0, 0, 3, 0],
         [0, 0, 1, 0, 0, 0, 0, 6, 8],
         [0, 0, 8, 5, 0, 0, 0, 1, 0]]

sudoku_solver(board)
for row in board:
    print(row)

[1, 2, 4, 7, 3, 8, 6, 5, 9]
[6, 8, 5, 9, 1, 2, 3, 7, 4]
[7, 9, 3, 6, 5, 4, 8, 2, 1]
[3, 7, 6, 8, 9, 1, 2, 4, 5]
[4, 5, 9, 2, 6, 7, 1, 8, 3]
[8, 1, 2, 3, 4, 5, 7, 9, 6]
[5, 4, 7, 1, 8, 6, 9, 3, 2]
[2, 3, 1, 4, 7, 9, 5, 6, 8]
[9, 6, 8, 5, 2, 3, 4, 1, 7]

Fractal

https://web.stanford.edu/class/archive/cs/cs106b/cs106b.1208/lectures/fractals/Lecture10_Slides.pdf

Memoization

Some recursive algorithms have overlapping subproblems.

$F(n) = \begin{cases} 0 & \text{if } n = 0 \quad \text{(base case)}\\ 1 & \text{if } n = 1 \quad \text{(base case)}\\ F(n-1) + F(n-2) & \text{if } n > 1 \quad \text{(recursive case)} \end{cases}$

Let’s compute $F(5)$ : $\begin{align} F(5) &= F(4) + F(3) \\ &= (F(3) + F(2)) + (F(2) + F(1)) \\ &= ((F(2) + F(1)) + (F(1) + F(0))) + ((F(1) + F(0)) + F(1)) \\ &= (((F(1) + F(0)) + F(1)) + (F(1) + F(0))) + ((F(1) + F(0)) + F(1)) \\ &= (((1 + 0) + 1) + (1 + 0)) + ((1 + 0) + 1) \\ &= 5 \end{align}$

How many times do we compute $F(2)$ ? $F(1)$ ? $F(0)$ ?

To solve this, there are a few approaches: - Memoization: store the results of the subproblems in a table - Solve the problem iteratively (bottom-up) or using dynamic programming (not covered now)

def fibonacci(n):
    if n <= 1:
        return n
    else:
        return fibonacci(n-1) + fibonacci(n-2)
%time fibonacci(40)

CPU times: user 11.5 s, sys: 47.1 ms, total: 11.5 s
Wall time: 11.6 s

102334155

def fibonacci_with_memoization(n):
    memo = [None] * (n + 1)
    def fib(n):
        if n <= 1:
            return n
        if memo[n] == None:
            memo[n] = fib(n-1) + fib(n-2)
        return memo[n]
    return fib(n)

%time fibonacci_with_memoization(40)

CPU times: user 14 µs, sys: 1e+03 ns, total: 15 µs
Wall time: 16 µs

102334155

11.6s vs 16 µs!

Tail Recursion

A recursive function is tail recursive when recursive call is the last thing executed by the function.

Non tail recursion version: $\text{factorial}(n) = \begin{cases} 1 & \text{if } n = 0 \quad \text{(base case)}\\ n \times \text{factorial}(n-1) & \text{if } n > 0 \quad \text{(recursive case)} \end{cases}$

Tail recursion version:

$\text{factorial}(n, \text{acc}) = \begin{cases} \text{acc} & \text{if } n = 0 \quad \text{(base case)}\\ \text{factorial}(n-1, n \times \text{acc}) & \text{if } n > 0 \quad \text{(recursive case)} \end{cases}$

Why is tail recursion important?

Some compilers and interpreters perform tail call optimization (TCO), which means that the recursive call is replaced with a jump to the start of the function. This optimization saves space on the call stack and improves performance.

Or in other words, tail recursion is as efficient as iteration.