Generating permutations, several approaches with Python

This small notebook implements, in Python 3, several algorithms aiming at a simple task: given a certain list, generate all the permutations of the list.

For instance, for [1, 2], it should give [1, 2] and [2, 1].

References

About


This notebook should be compatible with both Python versions, 2 and 3.


In [2]:
from __future__ import print_function, division  # Python 2 compatibility if needed

1. Reference implementation: from itertools

The itertools module, from the Python standard library, contains a function itertools.permutation:


In [3]:
# Builtin implementation, as a reference
from itertools import permutations as itertools_permutations

This will obviously be the quickest implementation, and there is no hope of beating it with pure Python (in terms of computational speed), as it is written in C and not in Python.

Let's check that it works as wanted:


In [4]:
itertools_permutations([1, 2])


Out[4]:
<itertools.permutations at 0x7fb18581c9e8>

What's that weird result? In fact, itertools.permutations() does not return the list of all permutations, but rather an iterator. It can be looped on in a similar way, and can be converted to a list easily:


In [5]:
for p in itertools_permutations([1, 2]):
    print(p)

list(itertools_permutations([1, 2, 3]))


(1, 2)
(2, 1)
Out[5]:
[(1, 2, 3), (1, 3, 2), (2, 1, 3), (2, 3, 1), (3, 1, 2), (3, 2, 1)]

So, what's the advantage of returning an iterator and not a list of lists? Memory and time efficiency !

The $n!$ permutations (if the list is of size $n$) take both a lot of time to compute and a lot of memory to store, so it's very clever if we can generate one after another, on demand, instead of having to compute all of them and storing them.

The first two algorithms presented below are not iterators, but the last one will be.

Permutations of the list are given as tuples, but there is no difference.

We can check how quick is this first function:


In [13]:
%time len(list(itertools_permutations(list(range(4)))))
%time len(list(itertools_permutations(list(range(8)))))
%time len(list(itertools_permutations(list(range(9)))))


CPU times: user 0 ns, sys: 0 ns, total: 0 ns
Wall time: 29.1 µs
CPU times: user 44 ms, sys: 12 ms, total: 56 ms
Wall time: 114 ms
CPU times: user 172 ms, sys: 32 ms, total: 204 ms
Wall time: 273 ms
Out[13]:
362880

In [15]:
%timeit len(list(itertools_permutations(list(range(10)))))


1 loop, best of 3: 1.17 s per loop

There is $n!$ permutations to generate, so obviously any algorithm is running in $\Omega(n!)$ time to generate all of them, and that is approximately the behavior observed above.

This claim should need better measurements to be really empirically supported!


2. First algorithm : The insert-into-all-positions solution

The basic idea is to separate the first element $x$ of the list, and the rest $xs$.

  • For instance, for $l = [1, 2, 3]$, $x = 1$ and $xs = [2, 3]$.
  • Then the permutations of $l$ are obtained by inserting $x$ in every possible positions of every permutations of $xs$.
  • Here, the permutations of $xs$ are $[2, 3]$ and $[3, 2]$. Inserting $x = 1$ in the first one give $[1, 2, 3]$ (first position), $[2, 1, 3]$ and $[2, 3, 1]$. Similarly, we obtain the last permutations: $[1, 3, 2]$, $[3, 1, 2]$ and $[3, 2, 1]$.

So we first need a function that insert an element $x$ in every possible index of a list $l$.


In [16]:
def ins_all_positions(x, l):
    """Return a list of lists obtained from l by inserting x at every possible index."""
    res = []
    for i in range(0, len(l) + 1):
        res.append(l[:i] + [x] + l[i:])
    return res

Then we write a recursive function, following the description of the algorithm:


In [21]:
from functools import reduce
# reduce(lambda acc, p: acc + f(p), l, []) is the same as the concatenation of list f(p) for p in l

In [22]:
# Now the main permutations generator.
def first_permutations(iterable):
    """Second algorithm, insert-into-all-positions solution."""
    if len(iterable) == 0:
        return []
    # we must specify this edge case
    elif len(iterable) == 1:
        return [[iterable[0]]]
    else:
        x, xs = iterable[0], iterable[1:]
        # reduce is needed instead of a simple sum(...) as sum() only works for numerical values
        return reduce(lambda acc, p: acc + ins_all_positions(x, p), first_permutations(xs), [])

We can try it out, but only on small list as it is not efficient.


In [23]:
first_permutations([1, 2, 3])


Out[23]:
[[1, 2, 3], [2, 1, 3], [2, 3, 1], [1, 3, 2], [3, 1, 2], [3, 2, 1]]

And let's measure its efficiency on small lists of size $4,5,6,7,8$:


In [35]:
%time len(list(first_permutations(list(range(4)))))
%time len(list(first_permutations(list(range(5)))))
%time len(list(first_permutations(list(range(6)))))
%time len(list(first_permutations(list(range(7)))))
%time len(list(first_permutations(list(range(8)))))


CPU times: user 0 ns, sys: 0 ns, total: 0 ns
Wall time: 89.2 µs
CPU times: user 0 ns, sys: 0 ns, total: 0 ns
Wall time: 392 µs
CPU times: user 0 ns, sys: 0 ns, total: 0 ns
Wall time: 16.2 ms
CPU times: user 72 ms, sys: 4 ms, total: 76 ms
Wall time: 168 ms
CPU times: user 4.58 s, sys: 96 ms, total: 4.68 s
Wall time: 8.11 s
Out[35]:
40320

$\implies$ This implementation take about $8 s$ for a list with $n = 8$ elements: it's crazily slow!


3. Second algorithm : The fixed-head solution

The second algorithm will not be more efficient, but it is different in his design.

Instead of inserting an element at every possible index, this second algorithm rather generate the permutations by considering that every element of the list will be the head of some of the permutation.

With a fixed head, ie an element $x$, removed from the list $xs = l \setminus x$, permutations of $l$ are obtained by simply adding $x$ as the head of every permutation of $xs$.

As for the first algorithm, this one is also recursive.

One limitation of its simple implementation below is that it requires all elements in the list to be different, as it will compute the list $xs = l \setminus x$ with this very simple function rm(x, l) :


In [30]:
def rm(x, l):
    """List l without element x."""
    return [y for y in l if x != y]

Note that with comparisons on indexes instead of comparisons on values, we could treat the general case not much harder.

Then, we need, as before, a function to add $x$ as the head of all lists $p$ in a certain list of lists $l$.


In [31]:
def head_of_all(x, l):
    """List of lists from l where x is the head of all the lists."""
    return [[x] + p for p in l]

And finally, the fixed-head algorithm is easy to implement, as a recursive function.

  • The case of en empty list or a list with only one element are easy,
  • The recursion case uses, again, a call to reduce(fun acc, x: acc + f(x), list, []) to permforms the concatenation of all lists f(x) for x in l.

In [32]:
def second_permutations(iterable):
    """Second algorithm, fixed-head solution."""
    if len(iterable) == 0:
        return []
    # we must specify this edge case
    elif len(iterable) == 1:
        return [[iterable[0]]]
    else:
        return reduce(lambda acc, x: acc + head_of_all(x, second_permutations(rm(x, iterable))), iterable, [])

Let's try it out:


In [36]:
second_permutations([1, 2, 3])


Out[36]:
[[1, 2, 3], [1, 3, 2], [2, 1, 3], [2, 3, 1], [3, 1, 2], [3, 2, 1]]

And let's measure its efficiency on small lists of size $4,5,6,7,8$:


In [38]:
%time len(list(second_permutations(list(range(4)))))
%time len(list(second_permutations(list(range(5)))))
%time len(list(second_permutations(list(range(6)))))
%time len(list(second_permutations(list(range(7)))))
%time len(list(second_permutations(list(range(8)))))
%time len(list(second_permutations(list(range(9)))))


CPU times: user 0 ns, sys: 0 ns, total: 0 ns
Wall time: 180 µs
CPU times: user 0 ns, sys: 0 ns, total: 0 ns
Wall time: 869 µs
CPU times: user 4 ms, sys: 0 ns, total: 4 ms
Wall time: 30.1 ms
CPU times: user 44 ms, sys: 0 ns, total: 44 ms
Wall time: 53 ms
CPU times: user 472 ms, sys: 12 ms, total: 484 ms
Wall time: 582 ms
CPU times: user 6.78 s, sys: 88 ms, total: 6.86 s
Wall time: 7.38 s
Out[38]:
362880

$\implies$ this second algorithm is more efficient, as it requires only $0.6 s$ to generate the $8! = 40320$ different permutations of the list $[0, 1, 2, 3, 4, 5, 6, 7]$.


In [40]:
from math import factorial
factorial(8)


Out[40]:
40320

3. Third algorithm : the Johnson-Trotter algorithm

This algorithm is more complicated to explain, I will let you refer to its Wikipedia page, or for more details, this blog post.

We use simple values to denote directions, left or right:


In [41]:
left = False
right = True

We will need a first function to attach a direction to every element of an array t, and then to remove them.


In [51]:
def attach_direction(t, d=left):
    """Attach the direction d to all elements of array t."""
    return [(x, d) for x in t]

In [43]:
def remove_direction(t):
    """Remove the attached direction d to all elements of array t."""
    return [y for y, _ in t]

This classical function swap(t, i, j) exchange the position of the elements t[i] and t[j]:


In [44]:
def swap(t, i, j):
    """Swap t[i] and t[j] in array t."""
    t[i], t[j] = t[j], t[i]

We first need to know if the element a[i] can be moved, according to its attached direction, to the left or right. The rule is that an element can only be swapped to a small element.


In [45]:
def is_movable(a, i):
    """Can a[i] be moved?"""
    x, d = a[i]
    if d == left:
        return i > 0 and x > a[i - 1][0]
    elif d == right:
        return i < len(a) - 1 and x > a[i + 1][0]
    else:
        raise ValueError("unknown direction d = {}".format(d))

Then the function move(a, i) simply swaps a[i] to the left or right, if it is possible.

It raises a ValueError exception if it cannot swap, to be general, but of course the algorithm will never be in such a undesirable state.


In [46]:
def move(a, i):
    """Move it if possible."""
    x, d = a[i]
    if is_movable(a, i):
        if d == left:
            swap(a, i, i - 1)
        elif d == right:
            swap(a, i, i + 1)
        else:
            raise ValueError("unknown direction d = {}".format(d))
    else:
        raise ValueError("not movable")

Then we need a function to scan the array a, from its beginning, to find the largest movable element. This can cost upto a time of $O(n)$ (if $n = \#a$), but it could hardly be improved.


In [47]:
def scan_largest_movable(a):
    """Find the largest movable element."""
    def aux(acc, i):
        if i >= len(a):
            return acc
        else:
            if not is_movable(a, i):
                return aux(acc, i + 1)
            else:
                x, _ = a[i]
                if acc is None:
                    return aux(i, i + 1)
                else:
                    j = acc if x < a[acc][0] else i
                    return aux(j, i + 1)
    return aux(None, 0)

Directions will be flipped, alternating left and right, with flip(d):


In [52]:
def flip(d):
    """Flip direction d : left -> right, right -> left"""
    return not d

Then the list will need to be scanned, and flip the directions of all elements larger than some x:


In [49]:
def scan_flip_larger(x, a):
    """Scan to flip larger."""
    for i, (y, d) in enumerate(a):
        if y > x:
            a[i] = y, flip(d)

We finally have all the pieces needed to implement the Johnson-Trotter algorithm:


In [50]:
def third_permutations(iterable):
    """Third algorithm, Johnson-Trotter algorithm."""
    i = sorted(list(iterable))  # Required by the algorithm
    # We attach directions, and we will only use the array a
    a = attach_direction(i)
    # First permutation
    r = list(iterable)[:]
    while True:
        yield r[:]  # A copy of the current permutation is yielded
        i = scan_largest_movable(a)
        if i is None:  # No more permutation!
            raise StopIteration
        else:
            x, _ = a[i]
            move(a, i)
            scan_flip_larger(x, a)
            # The next permutation should not have direction information attached to it
            r = remove_direction(a)

Yeay, we finally have an iterator on permutations of a list, instead of generating all of them.

Let's try it out:


In [53]:
third_permutations([1, 2, 3])


Out[53]:
<generator object third_permutations at 0x7fb1857f2360>

In [54]:
list(third_permutations([1, 2, 3]))


Out[54]:
[[1, 2, 3], [1, 3, 2], [3, 1, 2], [3, 2, 1], [2, 3, 1], [2, 1, 3]]

And let's measure its efficiency on small lists of size $4,5,6,7,8$:


In [55]:
%time len(list(second_permutations(list(range(4)))))
%time len(list(second_permutations(list(range(5)))))
%time len(list(second_permutations(list(range(6)))))
%time len(list(second_permutations(list(range(7)))))
%time len(list(second_permutations(list(range(8)))))
%time len(list(second_permutations(list(range(9)))))


CPU times: user 0 ns, sys: 0 ns, total: 0 ns
Wall time: 177 µs
CPU times: user 0 ns, sys: 0 ns, total: 0 ns
Wall time: 904 µs
CPU times: user 4 ms, sys: 0 ns, total: 4 ms
Wall time: 6.25 ms
CPU times: user 44 ms, sys: 0 ns, total: 44 ms
Wall time: 47.9 ms
CPU times: user 484 ms, sys: 0 ns, total: 484 ms
Wall time: 752 ms
CPU times: user 6.18 s, sys: 36 ms, total: 6.21 s
Wall time: 6.61 s
Out[55]:
362880

$\implies$ the Johnson-Trotter algorithm is, as expected, quicker than the previous naive implementations, but it's still pretty slow compared to the reference implementation itertools.permutation.


4. Comparing our Johnson-Trotter implementation and itertools.permutation

To compare them fairly, we need to run them several times:


In [63]:
%timeit len(list(itertools_permutations([1, 2, 3])))
%timeit len(list(third_permutations([1, 2, 3])))


The slowest run took 7.44 times longer than the fastest. This could mean that an intermediate result is being cached.
100000 loops, best of 3: 2.13 µs per loop
10000 loops, best of 3: 51.9 µs per loop

In [64]:
%timeit len(list(itertools_permutations([1, 2, 3, 4, 5])))
%timeit len(list(third_permutations([1, 2, 3, 4, 5])))


The slowest run took 5.71 times longer than the fastest. This could mean that an intermediate result is being cached.
10000 loops, best of 3: 14.5 µs per loop
1000 loops, best of 3: 1.52 ms per loop

However, IPython's %timeit function warns us that itertools.permutation could use caching, and that could bias the result.

One easy way to remove any caching is to test on different input lists, and that can be done, for instance, with random lists.


In [61]:
from numpy.random import choice

def random_list_of_size_n(n=5, N=1000):
    return list(choice(list(range(1, N + 1)), size=n, replace=False))

random_list_of_size_n(5)


Out[61]:
[384, 283, 979, 482, 388]

In [65]:
%timeit len(list(itertools_permutations(random_list_of_size_n(5))))
%timeit len(list(third_permutations(random_list_of_size_n(5))))


1000 loops, best of 3: 318 µs per loop
100 loops, best of 3: 2.1 ms per loop

In [66]:
%timeit len(list(itertools_permutations(random_list_of_size_n(6))))
%timeit len(list(third_permutations(random_list_of_size_n(6))))


1000 loops, best of 3: 313 µs per loop
100 loops, best of 3: 10.6 ms per loop

5. Testing our $3$ implementations

Additionnally to comparing the speed efficiency, we would also like to simply check that all the functions we wrote are working as expected!


In [72]:
def test(list_of_f, iterable):
    """ Test that all functions in list_of_f give the same list of permutation on this iterable."""
    print("Testing for the list of functions {} ...".format([f.__name__ for f in list_of_f]))  # DEBUG
    result = True
    print("Testing for the iterable {} ...".format(iterable))  # DEBUG
    i = iterable
    allperms = []
    for f in list_of_f:
        allperms.append(sorted([list(p) for p in f(iterable)]))
    for i, pi in enumerate(allperms):
        for j in range(i + 1, len(allperms)):
            pj = allperms[j]
            if pi != pj:
                print(" - Function #{} ({.__name__}) gave a different list of permutations as function #{} ({.__name__}) ...".format(i, list_of_f[i], j, list_of_f[j]))  # DEBUG
                result = False
            else:
                print(" - Function #{} ({.__name__}) gave the same list of permutations as function #{} ({.__name__}) ...".format(i, list_of_f[i], j, list_of_f[j]))  # DEBUG
    return result

We will test and compare the reference implementation, itertools.permutation, with the three other implementations given above.


In [73]:
list_of_f = [itertools_permutations, first_permutations, second_permutations, third_permutations]

In [74]:
iterable = [1, 2, 3]
test(list_of_f, iterable)


Testing for the list of functions ['permutations', 'first_permutations', 'second_permutations', 'third_permutations'] ...
Testing for the iterable [1, 2, 3] ...
 - Function #0 (permutations) gave the same list of permutations as function #1 (first_permutations) ...
 - Function #0 (permutations) gave the same list of permutations as function #2 (second_permutations) ...
 - Function #0 (permutations) gave the same list of permutations as function #3 (third_permutations) ...
 - Function #1 (first_permutations) gave the same list of permutations as function #2 (second_permutations) ...
 - Function #1 (first_permutations) gave the same list of permutations as function #3 (third_permutations) ...
 - Function #2 (second_permutations) gave the same list of permutations as function #3 (third_permutations) ...
Out[74]:
True

In [75]:
iterable = [1, 2, 3, 4, 5]
test(list_of_f, iterable)


Testing for the list of functions ['permutations', 'first_permutations', 'second_permutations', 'third_permutations'] ...
Testing for the iterable [1, 2, 3, 4, 5] ...
 - Function #0 (permutations) gave the same list of permutations as function #1 (first_permutations) ...
 - Function #0 (permutations) gave the same list of permutations as function #2 (second_permutations) ...
 - Function #0 (permutations) gave the same list of permutations as function #3 (third_permutations) ...
 - Function #1 (first_permutations) gave the same list of permutations as function #2 (second_permutations) ...
 - Function #1 (first_permutations) gave the same list of permutations as function #3 (third_permutations) ...
 - Function #2 (second_permutations) gave the same list of permutations as function #3 (third_permutations) ...
Out[75]:
True

In [76]:
iterable = [1, 2, 3, 4, 5, 6]
test(list_of_f, iterable)


Testing for the list of functions ['permutations', 'first_permutations', 'second_permutations', 'third_permutations'] ...
Testing for the iterable [1, 2, 3, 4, 5, 6] ...
 - Function #0 (permutations) gave the same list of permutations as function #1 (first_permutations) ...
 - Function #0 (permutations) gave the same list of permutations as function #2 (second_permutations) ...
 - Function #0 (permutations) gave the same list of permutations as function #3 (third_permutations) ...
 - Function #1 (first_permutations) gave the same list of permutations as function #2 (second_permutations) ...
 - Function #1 (first_permutations) gave the same list of permutations as function #3 (third_permutations) ...
 - Function #2 (second_permutations) gave the same list of permutations as function #3 (third_permutations) ...
Out[76]:
True

That's it for today, folks!

More notebooks can be found on my GitHub page.