Import standard modules:


In [ ]:
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
from IPython.display import HTML 
HTML('../style/course.css') #apply general CSS

In [ ]:
HTML('../style/code_toggle.html')

4.5 The Fourier Approximation & van Cittert-Zernike Theorem

An interferometer is sampling the sky, but the visibility function associated with the sky. Assuming no distortion of the measurements we still need to simplify the measurement frame to enable imaging by aperture synthesis. The Van Cittert-Zernike theorem will enable us to use simple tools, such a the Fourier Transform to link the visibility function to the sky. In $\S$ 4.5.1, we will start by taking some approximation which will simplifies the Eq. 4.3.2 in $\S$ 4.3 ➞.

4.5.1 Fourier approximations

Before stating the theorem, we need to take several approximations to enable its formulation. Sections $\S$ 4.5.1.A to $\S$ 4.5.1.G cover the various apprixmations from the remote location of the source to the local conditions near the observer.

4.5.1.A Mutual incoherence of the source

In this approximation, we assume that the different part of the sky are statistically independent to each other, meaning that the mutual coherency between two direction is 0. The sky can be described as a collection of point sources. Therefore, the visibility function is linear regarding the directions in the sky as the correlation between $\mathbf{s}_i$ and $\mathbf{s}_j$ is non zero only when $i=j$.

4.5.1.B Far-field approximation

In the ideal case, we are observing sources that lie on the celestial sphere at an infinite distance. The antenna patterns are the Far-Field pattern. The expression of the visibility in a spherical reference is therefore independent of $r$, the distance from the observer to the source. The distance $R >> \frac{|\mathbf{b}_\text{max}|^2}{\lambda}$ so that the dependence of Eq. xx are simplified. All extended sources are confined to the celestial sphere and are seen through their 2D projection on it.

4.5.1.C Homogeneity of the propagation medium

In the simple case of classical radio sources, we assume no propagation effects on the signal (absorption, refraction, distortion). The latter will be invalid when observing:

  • a polarized signal through a magnetized medium (e.g. the Faraday rotation of polarized radio emission through the magnetized IGM/ISM)

  • a pulsar through the IGM medium which cause signal distortions (spectral dispersion and temporal broadening of the pulses)

4.5.1.D Small-field approximation

Because real-life antennas are not isotropic, they are sensitive to a limited portion of the sky defined by their Field of View (FoV). Close to the phase center, the celestial sphere can be approximated by its tangent plane on which the source falling in the FoV will contribute to the measured visibilities.

Assuming that the observing region of the sky is small, we will consider that:

  • the antenna gain is considered constant over the FoV: $A(l,m) \approx cte$
  • the sky is observed as a 2D projection onto the (l,m) plane: $(\alpha_0,\delta_0)\leftrightarrow(l,m)$
  • the extended sources are small relative to the FoV: $\Delta\theta_\text{source} << \Delta\theta_\text{FoV}$
  • $w$ is small and $n$ is close to unity (see $\S$ 4.5.1.E)

4.5.1.E Coplanar baselines

In Eq. 4.3.2 in $\S$ 4.3 ➞, the exponent in factor of the Intensity distribution in the expression of the visibility function is $K(l,m)$ such as $$K(l,m)=\exp \left[ {-\imath 2\pi (ul+vm+w(\sqrt{1-l^2-m^2}-1)) }\right]$$

For a tracking interferometer (compensating for the phase delay of reception of the signal at the different antennas), in the regime of small-field approximation, we are receiving incoherent emission from the ($l$,$m$) plane close to the phase center at ($l=0$,$m=0$).

if ($l$,$m$)<<1 then $w(\sqrt{1-l^2-m^2}-1) \rightarrow 0$.

In that case, the $w$-term can be ignored and $K(l,m)$ then reduces to $$K(l,m)=\exp \left[ {-\imath 2\pi (ul+vm) }\right]$$

Note: $\S$ 4.5.1.D and $\S$ 4.5.1.E will be addressed for thoroughly in [$\S$ 5.5 ➞](../5_Imaging/5_5_widefield_effect.ipynb).

4.5.1.F Narrowband approximation

The correlator is operated in a narrow-bandwith $\Delta \nu$ (condition). The bandwidth will modulate the width of the fringe damping factor taking a form of a sinc function. By assuming a narrow-band approximation, we assume that the width at which the fringe contrast is dampened is infinitely large.

However, under normal circonstances over some bandwidth $\Delta \nu$, the effect of the bandwidth pattern can be limited on the interferometer response if we restrained the observed field within a region around the phase center where the spatial coherency is maintained: $$ \frac{\Delta\nu}{\nu} < \frac{1}{l_\text{max} u}, \frac{1}{m_\text{max} v} $$

This is equivalent to broadlight interferometry where we see the fringes disappear few orders away from the fringe center.

4.5.1.G Continuous sampling approximation

We assume that he sampling of the complex visibility function is continuous, and therefore we do not suffer from sampling effect in the direct space (i.e. convolution with the FT of the sampling function). More details about the effect of the sampling function are given in the imaging chapter ($\S$ 5. ➞).

4.5.2 Van Cittert-Zernike Theorem

Coming from optics, this theorem links the intensity distribution of the sky with the cross-correlation of the signals received by two receiver $R_1$ and $R_2$. Under the assumptions taken in $\S$ 4.5.1, we can now formulate the theorem.

4.5.2.A Formulation

Given a extended, monochromatic and incoherent intensity distribution $I_\nu$ and the complex visibility measured by baseline $pq$, the theorem states the complex visibility function associated with baseline $pq$ is linked to the intensity distribution by a 2D Fourier Transform so that:

$$\boxed{\boxed{V_{pq}(u,v,0)=\int_{-\infty}^\infty\int_{-\infty}^\infty{I_\nu e^{-2\imath\pi (ul+vm)}dldm}}}$$

4.5.2.B Demonstration

Let's assume an monochromatic, extended incoherence intensity distribution as the sky.

The mutual coherence between the signal measured at location of $R_1$ and $R_2$ over the time T, from a distance source is: $$\Gamma_\text{12}(u,v,\tau)= \lim_{T \rightarrow \infty} \frac{1}{2T} \int_{-T}^{T} E_1(t)E_{2}^{*}(t-\tau) dt$$

where the electric fields expresses as:

$$E_1(l,m,t)= E_{0}(l,m,t-\frac{r_1}{c}) \frac{e^{-2\imath\pi \nu(t-\frac{r_1}{c})}}{r_1}$$$$E_2(l,m,t)= E_{0}(l,m,t-\frac{r_2}{c}) \frac{e^{-2\imath\pi \nu(t-\frac{r_2}{c})}}{r_2}$$

where $r_1$ and $r_2$ are the distance of each receiver from the source.

Correlation of $\mathbf{E_1}$ and $\mathbf{E_2}$

\begin{eqnarray} \langle E_1(l_i,m_i,t) E_2^*(l_j,m_j,t) \rangle &=& \langle E_{0} (l_i,m_i,t-\frac{r_1}{c}) E_{0}(l_j,m_j,t-\frac{r_2}{c}) \rangle \frac{e^{-2\imath\pi \nu(t-\frac{r_1}{c})} e^{+2\imath\pi \nu(t-\frac{r_2}{c})}}{r_1 r_2} \\ &=& \langle E_{0}(l_i,m_i,t') E_{0}(l_j,m_j,t'-\frac{r_2-r_1}{c}) \rangle \frac{e^{2\imath\pi \nu \frac{r_1 - r_2}{c}}}{r_1 r_2}\\ &\approx& \langle E_{0}(l,m,t')E_{0}(l,m,t') \rangle \frac{e^{2\imath\pi \nu \frac{r_1 - r_2}{c}}}{r_1 r_2}\\ \end{eqnarray}
  • We used the incoherent sky approximation to consider that $\langle E_{0}(l_i,m_i,t) E_{0}(l_j,m_j,t-\frac{r_2-r_1}{c}) \rangle = 0$ for $i \neq j$.

  • Assuming the far-field approximation and the small field approximation, we have $r_1 \approx r_2 \approx r$. Therefore $(r_1 - r_2)/c$ can be neglected.

We can now write the expression of the mutual coherence $\Gamma_{12}$:

$r_1-r_2$ represents the OPD between the two receivers, and can be expressed as a function of ($u$, $v$) and ($l$, $m$): $$r_1-r_2= \frac{c}{\nu} (u_{12}l+v_{12}m)$$

$$\Gamma_{12}(u,v,\tau=0)= \int_{\text{Source}} { \frac{I_\nu e^{2\imath\pi \nu \frac{(r_1-r_2)}{c}}}{r_1 r_2} }ds = \int_{\text{Source}}{\frac{I_\nu e^{2\imath\pi (u_{12}l+v_{12}m)}}{r^2}} ds$$

As ($l$,$m$) are direction cosines, they can be linked to spherical coordinates. On the projected sphere, the surface element $ds$ can be expressed as: $$ds = r^2 \; dl \; dm$$

The integrand is bounded to zero at the border of the Field of View (due to the antenna response and the limits of temporal coherency, restricting the FoV). Therefore, we can extend the bounds of the integral to infinite bounds without loosing generality.

$\Gamma_{12}$ becomes the complex visibility $V_\nu$

$$ \Gamma_{12}(u,v,\tau=0) = \int_{-\infty}^{\infty}\int_{-\infty}^{\infty} { I_\nu(l,m) e^{-2\imath\pi(u_{12}l+v_{12}m)}} dl dm =V_{12}(u,v,0) $$

Under this form, the complex visibility $V_{12}$ is the Fourier transform of the intensity distribution $I_{\nu}(l,m)$.

More development about this theorem and the links with the classic Fraunhofer diffraction framework can be found in Thompson, Moran and Swenson $\S$ 14, p. 594.

In reality, this theorem can be derived directly from the RIME framework $\S$ 7.2 ➞. Moreover, no all points in the $uv$ plane are sampled therefore a lot of information is missing. But in the scope of these simplifications, the recovery of the intensity distribution $I_{\nu}$ from partial measurement can be done and reduces to the solving of an inverse problem. This is on this partial information that deconvolution techniques are requires.

4.5.3 Chapter concluding remarks

Interferometry in radio consists in the combinations by cross-correlation of the measured signals converted into voltages from incoming incoherent EM waves. In order to make these signals "interfere", we need to devise a system to combine them in phase. We introduced the concept of baseline, projected baseline and phase center. We started in $\S$ 4.1 ➞, by applying the tools introduced in $\S$ 3 ➞ to express the baseline in a convenient reference frame attached to the celestial sphere. In $\S$ 4.2 ➞, we studied the simple case of the 2-element interferometer for which we derived the relevant interferometric quantities (phase center, coherence, visibility).

In $\S$ 4.3 ➞, we extended the 1D case to a 3D definition of the visibility function. To give a physical interpretation on how relevant is the visibility function to study object in the sky, we linked the projected baseline measurement to the result of a spatial filter applied to the sky. Using this information, we demonstrate how the shape and the position of a source could be accurately measured by interferometry ($\S$ 4.3.2).

An interferometer samples the visibility function of the sky in a non-random manner. From the geometry of the interferometer with respect to the observed source, it was possible in $\S$ 4.4.1 ➞ to derive the actual sampling path given by a physical baseline during the course of an observation. They take the form of elliptical path which can potentially enrich the $uv$ plane. The more diversified and numerous are the samples in the $uv$ plane, the better the knowledge of the visibility function. In order to improves the $uv$ coverage, we presented in $\S$ 4.4.2 ➞ three different ways that increase the number of samples.

The collection of samples can be used to retrieve an approximation of the sky. Under a series of approximations $\S$ 4.5.1 ⤵, the definition of the visibility function can be linked to the intensity distribution through a Fourier transformation $\S$ 4.5.2 ⤵.

Once this relationship has been established, it is possible to cast the imaging problem as the inverse problem of recovering the sky $\mathcal{I}_\nu$ from a partial knowledge of its Fourier Transform, described by the sampled visibility function $\mathcal{V}_\nu$ in the scope of the Van Cittert-Zernike theorem.

In $\S$ 5, we address all the principle of imaging with the Fourier Transform and in $\S$ 6, we address all the techniques used to perform the image deconvolution that solve the inverse problem of imaging by aperture synthesis.

Future Additions: