In [1]:

    
from IPython.core.display import HTML
css_file = './custom.css'
HTML(open(css_file, "r").read())









    Out[1]:

Content provided under a Creative Commons Attribution license, CC-BY 4.0; code under MIT License. (c)2015 David I. Ketcheson

Version 0.1 - March 2016



In [ ]:

    
import numpy as np
%matplotlib inline
import matplotlib
import matplotlib.pyplot as plt
import matplotlib.animation
from JSAnimation import IPython_display # if you don't have this, get it from: https://github.com/jakevdp/JSAnimation

The FFT, aliasing, and filtering

Welcome to lesson 3. Here we'll learn about the Fast Fourier Transform, which we've been using all along. We'll also learn about a numerical pathology of pseudospectral methods (known as aliasing) and one way to avoid it (known as filtering or dealiasing).

The fast Fourier transform

We won't go into great detail regarding the FFT algorithm, since there is already an excellent explanation of the Fast Fourier Transform in Jupyter Notebook form available on the web:

Understanding the FFT Algorithm by Jake Vanderplas

Suffice it to say that the FFT is a fast algorithm for computing the discrete Fourier transform (DFT):

$$ \hat{u}_\xi = \sum_{j=0}^{m-1} u_j e^{-2\pi i \xi j/m} $$

or its inverse. The DFT, as we know, is linear and can be computed by multiplying $u$ by a certain $m\times m$ dense matrix $F$. Multiplication by a dense matrix requeires ${\mathcal O}(m^2)$ operations. The FFT is a shortcut to compute that matrix-vector product in just ${\mathcal O}(m \log m)$ operations by taking advantage of the special structure of $F$.

This is very important for pseudospectral methods, since most of the computational work occurs in computing the Fourier transform and its inverse. It's also important that we make use of a compiled version of the FFT, since native Python code is relatively slow. The np.fft function is an interface to a compiled FFT library.

Ordering of wavenumbers in FFT output

The vector returned by np.fft contains the Fourier coefficients of $u$ corresponding to the wavenumbers

$$ \frac{2\pi}{L} \{-m/2, -m/2 + 1, \dots, m/2 - 1\}. $$

However, for computational efficiency the output vector does not use the natural ordering above. The ordering it uses can be obtained with the following command.



In [ ]:

    
m = 16
L = 2*np.pi

xi=np.fft.fftfreq(m)*m/(L/(2*np.pi))
print xi

As you can see, the return vector starts with the nonnegative wavenumbers, followed by the negative wavenumbers. It may seem strange to you that the range of wavenumbers returned is not symmetric; in the case above, it includes $-8$ but not $+8$. This apparent asymmetry can be explained once one understands the phenomenon known as aliasing.

Aliasing

A numerical grid has a limited resolution. If you try to represent a rapidly-oscillating function with relatively few grid points, you will observe an effect known as aliasing. This naturally limits the range of frequencies that can be modelled on a given grid. It can also lead to instabilities in pseudospectral simulations, when generation of high frequencies leads to buildup of lower-frequency energy due to aliasing.

The code below plots a sine wave of a given frequency, along with its representation on a grid with $m$ points. Try changing $p$ and notice how for $m<2p$ the function looks like a lower-frequency mode.



In [ ]:

    
from IPython.display import display, HTML
from IPython.html import widgets
from IPython.html.widgets import interact, interactive

def plot_sine(wavenumber=4,grid_points=12,plot_sine='On'):
    "Plot sin(2*pi*p), sampled at m equispaced points."
    x  = np.linspace(0,1,grid_points+2); # grid
    xf = np.linspace(0,1,1000) # fine grid
    y  = np.sin(wavenumber*np.pi*x)
    yf = np.sin(wavenumber*np.pi*xf)
    fig = plt.figure(figsize = (8, 6));
    ax = fig.add_subplot(1,1,1);
    if plot_sine == 'On':
        ax.plot(xf, yf, 'r-', linewidth=2);
    ax.plot(x,  y,  'o-', lw=2)

interact(plot_sine, wavenumber=(1,44,1), grid_points=(10, 16, 1), plot_sine=(('On','Off')));

Exercise

Try to answer the questions below with pencil and paper; then check them by modifying the code above.

For a given number of grid points $m$, which wavenumbers $p$ will be aliased to the $p=0$ mode? Which will be aliased to $p=1$? Can you explain why?
What is the highest frequency mode that can be represented on a given grid?

After completing the exercise, explain why the sequence of wavenumbers given by np.fft.fftfreq is not, in fact, asymmetric after all.

Aliasing as a source of numerical instability

As we have seen, aliasing means that wavenumbers of magnitude greater than $\pi m/L$ are incorrectly represented as lower wavenumbers on a grid with $m$ points. This suggests that we shouldn't allow larger wavenumbers in our numerical solution. For linear problems, this simply means that we should represent the initial condition by a truncated Fourier series containing modes with wavenumbers less than $\pi m/L$. This happens naturally when we sample the function at the grid points. As we evolve in time, higher frequencies are not generated due to the linearity of the problem.

Nonlinear problems are a different story. Let's consider what happens when we have a quadratic term like $u^2$, as in Burgers' equation. In general, if the grid function $u$ contains wavenumbers up to $\pi m/L$, then $u^2$ contains frequencies up to $2 \pi m/L$. So each time we compute this term, we generate high frequencies that get aliased back to lower frequencies on our grid. Clearly this has nothing to do with the correct mathematical solution and will lead to errors. Even worse, this aliasing effect can, as it is repeated at every step, lead to an instability that causes the numerical solution to blow up.

$$ \newcommand{\F}{\mathcal F} \newcommand{\Finv}{{\mathcal F}^{-1}} $$

An illustration of aliasing instability: the Korteweg-de Vries equation

To see aliasing in practice, we'll consider the KdV equation, which describes certain kinds of water waves:

$$ u_t = -u u_x - u_{xxx} $$

A natural pseudospectral discretization is obtained if we compute the spatial derivatives via

\begin{align} u_x & = \Finv(i\xi \F(u)) \\ u_{xxx} & = \Finv(-i\xi^3 \F(u)). \end{align}

This gives $$ U'(t) = -D[U] \Finv(i\xi \F(U)) - \Finv(-i\xi^3 \F(U)). $$

This is identical to our discretization of Burgers' equation, except that now we have a third-derivative term. explain dispersion here

The largest-magnitude eigenvalues of the Jacobian for this semi-discretization are related to the 3rd-derivative term. If we consider only that term, the eigenvalues are

$$-i \xi^3$$

where $\xi$ lies in the range $(-m/2, m/2)$. So we need the time step to satisfy $\Delta t (m/2)^3 \in S$, where $S$ is the region of absolute stability of a given time integrator.

For this example we'll use a 3rd-order Runge-Kutta method:



In [ ]:

    
def rk3(u,xi,rhs):
    y2 = u + dt*rhs(u,xi)
    y3 = 0.75*u + 0.25*(y2 + dt*rhs(y2,xi))
    u_new = 1./3 * u + 2./3 * (y3 + dt*rhs(y3,xi))
    return u_new

Let's check the size of the imaginary axis interval contained in this method's absolute stability region:



In [ ]:

    
from nodepy import rk
ssp33 = rk.loadRKM('SSP33')
print ssp33.imaginary_stability_interval()

Now we'll go ahead and implement our solution, making sure to set the time step according to the condition above.



In [ ]:

    
def rhs(u, xi, equation='KdV'):
    uhat = np.fft.fft(u)
    if equation == 'Burgers': 
        return -u*np.real(np.fft.ifft(1j*xi*uhat)) + np.real(np.fft.ifft(-xi**2*uhat))
    elif equation == 'KdV':
        return -u*np.real(np.fft.ifft(1j*xi*uhat)) - np.real(np.fft.ifft(-1j*xi**3*uhat))
    
# Grid
m = 256
L = 2*np.pi
x = np.arange(-m/2,m/2)*(L/m)
xi = np.fft.fftfreq(m)*m*2*np.pi/L

dt = 1.73/((m/2)**3)

A = 25; B = 16;
u = 3*A**2/np.cosh(0.5*(A*(x+2.)))**2 + 3*B**2/np.cosh(0.5*(B*(x+1)))**2
tmax = 0.006

uhat2 = np.abs(np.fft.fft(u))

num_plots = 50
nplt = np.floor((tmax/num_plots)/dt)
nmax = int(round(tmax/dt))

fig = plt.figure()
axes = fig.add_subplot(211)
axes2 = fig.add_subplot(212)
line, = axes.plot(x,u,lw=3)
line2, = axes2.semilogy(xi,uhat2)
xi_max = np.max(np.abs(xi))
axes2.semilogy([xi_max/2.,xi_max/2.],[1.e-3,25000],'--r')
axes2.semilogy([-xi_max/2.,-xi_max/2.],[1.e-3,25000],'--r')

frames = [u.copy()]
tt = [0]
uuhat = [uhat2]

for n in range(1,nmax+1):
    u_new = rk3(u,xi,rhs)

    u = u_new.copy()
    t = n*dt
    # Plotting
    if np.mod(n,nplt) == 0:
        frames.append(u.copy())
        tt.append(t)
        uhat2 = np.abs(np.fft.fft(u))
        uuhat.append(uhat2)
        
def plot_frame(i):
    line.set_data(x,frames[i])
    line2.set_data(np.sort(xi),uuhat[i][np.argsort(xi)])
    axes.set_title('t= %.2e' % tt[i])
    axes.set_xlim((-np.pi,np.pi))
    axes.set_ylim((-100,3000))
    
matplotlib.animation.FuncAnimation(fig, plot_frame, frames=len(frames), interval=20)

In the output, we're plotting the solution (top plot) and its Fourier transform (bottom plot). There are a lot of interesting things to say about the solution, but for now let's focus on the Fourier transform. Notice how the wavenumbers present in the solution remain in the lower half of those representable on the grid (this region is delimited by the dashed red lines). Because of this, no aliasing occurs.

Now change the code above to use only $m=128$ grid points. What happens?

Explanation

Here we will give a somewhat simplified explanation of the blow-up just observed. First, this blowup has nothing to do with the absoute stability condition -- when we change $m$, the time step is automatically changed in a way that will ensure absolute stability. If you're not convinced, try taking the time step even smaller; you will still observe the blowup.

By taking $m=128$, we cut by half the wavenumbers that can be represented on the grid. As you can see from the plots, this means that some of the wavenumbers present in the initial data are in the upper half of the representable range (i.e., outside the dashed red lines). That means that the highest wavenumbers generated by the quadratic term will be aliased -- and they will be aliased back into that upper-half range. This leads to a gradual accumulation of high-wavenumber modes, easily visible in both plots. Eventually the high-wavenumber modes dominate the numerical solution and lead to blowup.

For a detailed discussion of aliasing instabilities, see Chapter 11 of John Boyd's "Chebyshev and Fourier Spectral Methods".

Filtering

How can we avoid aliasing instability? The proper approach is to ensure that the solution is well resolved, so that the instability never appears. However, this may entail a very substantial computational cost. One way to ensure stability even if the solution is underresolved is by filtering, which is also known as dealiasing. In general it is unwise to rely on filtering, since it can mask the fact that the solution is not resolved (and hence not accurate). But understanding filtering can give a bit more insight into aliasing instability itself.

At the most basic level, filtering means removing the modes that lead to aliasing. This can be done by damping the high wavenumbers or simply zeroing them when computing the $(u^2)_x$ term. The obvious approach would be to filter the upper half of all wavenumbers, but this is overkill. In fact, it is sufficient to filter only the uppermost third. To see why, notice that the aliased modes resulting from the lower two-thirds will appear in the uppermost third of the range of modes, and so will be filtered at the next step.

A simple 2/3 filter is implemented in the code below.



In [ ]:

    
def rhs(u, xi, filtr, equation='KdV'):
    uhat = np.fft.fft(u)
    if equation == 'Burgers': 
        return -u*np.real(np.fft.ifft(1j*xi*uhat)) + np.real(np.fft.ifft(-xi**2*uhat))
    elif equation == 'KdV':
        return -u*np.real(np.fft.ifft(1j*xi*uhat*filtr)) - np.real(np.fft.ifft(-1j*xi**3*uhat))
    
def rk3(u,xi,rhs,filtr):
    y2 = u + dt*rhs(u,xi,filtr)
    y3 = 0.75*u + 0.25*(y2 + dt*rhs(y2,xi,filtr))
    u_new = 1./3 * u + 2./3 * (y3 + dt*rhs(y3,xi,filtr))
    return u_new
    
# Grid
m = 128
L = 2*np.pi
x = np.arange(-m/2,m/2)*(L/m)
xi = np.fft.fftfreq(m)*m*2*np.pi/L
filtr = np.ones_like(xi)
xi_max = np.max(np.abs(xi))
filtr[np.where(np.abs(xi)>xi_max*1./2)] = 0.

dt = 1.73/((m/2)**3)

A = 25; B = 16;
u = 3*A**2/np.cosh(0.5*(A*(x+2.)))**2 + 3*B**2/np.cosh(0.5*(B*(x+1)))**2
tmax = 0.006

uhat2 = np.abs(np.fft.fft(u))

num_plots = 50
nplt = np.floor((tmax/num_plots)/dt)
nmax = int(round(tmax/dt))

fig = plt.figure()
axes = fig.add_subplot(211)
axes2 = fig.add_subplot(212)
line, = axes.plot(x,u,lw=3)
line2, = axes2.semilogy(xi,uhat2)
axes2.semilogy([xi_max/2.,xi_max/2.],[1.e-3,25000],'--r')
axes2.semilogy([-xi_max/2.,-xi_max/2.],[1.e-3,25000],'--r')

frames = [u.copy()]
tt = [0]
uuhat = [uhat2]

for n in range(1,nmax+1):
    u_new = rk3(u,xi,rhs,filtr)

    u = u_new.copy()
    t = n*dt
    # Plotting
    if np.mod(n,nplt) == 0:
        frames.append(u.copy())
        tt.append(t)
        uhat2 = np.abs(np.fft.fft(u))
        uuhat.append(uhat2)
        
def plot_frame(i):
    line.set_data(x,frames[i])
    line2.set_data(np.sort(xi),uuhat[i][np.argsort(xi)])
    axes.set_title('t= %.2e' % tt[i])
    axes.set_xlim((-np.pi,np.pi))
    axes.set_ylim((-100,3000))
    
matplotlib.animation.FuncAnimation(fig, plot_frame, frames=len(frames), interval=20)

Notice how the solution remains stable, but small wiggles appear throughout the domain. These are a hint that something is not sufficiently resolved.