In [ ]:
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
from IPython.display import HTML 
HTML('../style/course.css') #apply general CSS

Import section specific modules:


In [ ]:
pass

In [ ]:
HTML('../style/code_toggle.html')

Fundamentals of Radiative Transfer

All information about astrophysical sources is obtained through measuring the electromagnetic radiation received from them. Therefore, to determine the properties of astrophysical systems, it is essential to understand the physical processes that contribute to the emitted radiation and how it gets modified via propagation through the intervening space and matter before getting detected. This chapter introduces basic physical quantities used to represent the strength of the radiation field in the radio wave part of the electromagnetic spectrum and describes its interaction with the intervening media.

Specific Intensity and its moments

The radiative energy ($dE$) crossing an area $dA$ with unit normal $\vec{n}$ in a direction within solid angle $d\Omega$, in time $dt$ and with frequency in the range [$\nu$, $\nu$ + d$\nu$] is,

$$ dE = I_{\nu} \, dA \, dt \, d\Omega \, d\nu $$

The quantity $I_\nu$($\Omega$) is called monochromatic specific intensity or brightness and has units of $\rm{ergs} \, \rm{s}^{-1} \, \rm{cm}^{-2} \, \rm{Hz}^{-1} \, \rm{Sr}^{-1}$. It is the fundamental quantity in classical radiative transfer and is conserved along a ray in empty space (however, see next section).

When evaluating the radiation field, it is often satisfactory to integrate out the dependence on photon propagation direction and consider only the distribution in space, time and frequency. This averaging over solid angle $d \Omega$ leads to following two useful angular moments of $I_\nu$($\nu$):

Zeroth angular moment or mean intensity: $J_\nu$ = $\frac{1}{4\pi} \int$ $I_{\nu} \, d \Omega$, and

First angular moment or flux density: $F_\nu$ = $\frac{1}{4\pi} \int$ $I_\nu$ $\vec{n} \, d \Omega$.

$J_{\nu}$ is related to specific energy density, $U_{\nu}$ as: $U_{\nu}$ = $\frac{4\pi}{c}$ $J_\nu$.

It represents the energy per unit volume per unit frequency range, measured in units of $\rm{ergs} \, \rm{cm}^{-3} \, \rm{Hz}^{-1}$.

Note that for an isotropic radiation field, the first angular moment i.e. flux is zero. The units of flux density are $\rm{ergs} \, \rm{s}^{-1} \, \rm{cm}^{-2} \, \rm{Hz}^{-1}$. In radio astronomy, it is generally convenient to measure it in units of Jansky (symbol Jy), where $1 \, \rm{Jy} = 10^{-26} \, \rm{ergs} \, \rm{s}^{-1} \, \rm{cm}^{-2} \, \rm{Hz}^{-1}$ or $1 \, \rm{Jy} = 10^{-26} \, \rm{W} \, \rm{cm}^{-2} \, \rm{Hz}^{-1}$.

Equation of Radiative Transfer

The equation of radiative transfer describes how the specific intensity changes as it propagates through matter. Consider a beam of radiation with specific intensity $I_\nu$ incident on a slab of material and let s be the path length through the slab. In the absence of any source or sink, the specific intensity remains constant i.e. $dI_\nu/ds$ = 0. However, in general, the specific intensity along the path will increase because of emission and decrease because of absorption. The intensity may also change due to scattering. Assuming no scattering, the change in $I_\nu$ along the path can be described as

$$ \frac{dI_\nu}{d\nu} = -\alpha_\nu I_\nu + j_\nu $$

where $\alpha_\nu$ and $j_\nu$ are absorption and emission coefficients, respectively. The radiative transfer equation incorporates macroscopic aspects of emission and absorption due to different physical processes through these coefficients. The emission coefficient, $j_\nu$, has units of $\rm{ergs} \, \rm s^{-1} \, \rm{cm}^{-3} \, \rm{Hz}^{-1} \, \rm{Sr}^{-1}$ and represents energy emitted per unit time per unit solid angle per unit frequency range and per unit volume. The absorption coefficient, $\alpha_\nu$, has units of $\rm{cm}^{-1}$ and represents the loss of intensity along the path length ds.

For a medium with particle density $n$ ($\rm{cm}^{-3}$) and $\sigma_\nu$ ($\rm{cm}^2$) as the effective absorbing area or cross section offered by the particles, the absorption coefficient can be defined as, $\alpha_\nu (\rm{cm}^{-1}) = n \sigma_\nu$. The inverse of the absorption coeffcient represents the mean free path i.e. mean physical distance travelled by the photon in the medium before it is absorbed. For mathematical simplification, it is further convenient to define another variable $\tau_\nu$ called optical depth defined as, $d\tau_\nu$ = $\alpha_\nu ds$ and rewrite the radiative transfer equation as,

$$ \frac{dI_\nu}{d\tau_\nu} = -I_\nu + S_\nu $$

where $S_\nu$ is defined as $\frac{j_\nu}{\alpha_\nu}$. The optical depth is a particularly useful quantity in identifying regions with significant absorption and is one of the key observables. Specifically, a medium is said to be optically thick when $\tau_\nu \geq 1$ and optically thin when $\tau_\nu \leq 1$. In a medium that is optically thick, the photon at frequency $\nu$ cannot traverse the entire medium without being absorbed.

The equation of radiative transfer can be formally solved to obtain the solution,

$$ I_\nu(\tau_\nu) = I_\nu(0) e^{-\tau_\nu} + \int e^{-(\tau_\nu - \tau^{'}_\nu)} S(\tau^{'}_\nu)d\tau^{'}_\nu. $$

The solution can be interpreted as the sum of two terms: (i) incident intensity reduced by absorption and (ii) integrated source reduced by absorption. Further insights can be obtained by considering the case of a constant source function and using Rayleigh-Jeans approximation ($h\nu << kT$) to rewrite the equation in terms of brightness temperature $T_B$ which is related to specific intensity through, $I_\nu = 2(\frac{\nu^2}{c^2}) \, k \, T_B$. In this case the solution obtained above reduces to

$$ T_B(\tau_\nu)=T_B(0) e^{-\tau_\nu} + T(1- e^{-\tau_\nu}).$$

Here $T_B$(0) is the brightness temperature of the incident radiation and $T$ is the temperature of the cloud. In case of optically thick cloud, $T_B$ approaches $T$, i.e. the temperature of the cloud.

Radiative transport through a Hydrogen cloud

A hydrogen cloud, under certain conditions that will be elaborated in subsequent sections, can be conveniently detected via the 21-cm line of HI. In an HI atom, the electron in the ground state 1s can have its spin either parallel or antiparallel to the spin of the proton. The coupling of the electron’s magnetic moment to the magnetic moment of the proton leads to hyperfine splitting of the parallel and anti-parallel states. The anti-parallel spin state has a lower energy. Relative to this, the anti-parallel spin state has an energy $E = 5.87 \times 10^{-6} \,\rm{eV}$ and is the origin of HI 21-cm line at $\nu= 1420.405752 \, \rm{MHz}$ (evaluating Einstein’s equality $E = m \, c^2$). The Einstein's A coefficient $A_{21}$ for an atom to spontaneously decay from the excited hyperfine state by emitting a 21-cm line photon is $A_{21} = 2.85 \times 10^{-15} \, \rm{s}^{-1}$. This implies that an HI atom in the excited hyperfine state will typically take about $\frac{1}{A_{21}} \sim 10^7$ years to spontaneously decay into the ground state.

This low value of $A_{21}$ is the direct consequence of 21-cm line being due to magentic dipole transition and occuring at a very low frequency. Contrast this with the Lyman $\alpha$ excitation (rest wavelength $\lambda = 1216 \dot{A}$, i.e. in the ultraviolet) which is an electric dipole transition and has $A_{21} = 6 \times 10^8 \, \rm{s}^{-1}$. This implies that when excited to the $n=2$ state, the atom can stay there only for about $10^{-9} \, \rm{s}$ !

Consider a cloud of neutral hydrogen in front of a radio source of brightness temperature $T_B$. Let $n_1$ and $n_2$ be the number density of atoms ($\rm{cm}^{-3}$) in the lower and upper hyperfine states of the atom. Now there are a number of ways an atom or this two-level system can make transition from one state to another. The transitions from ground to excited state would give rise to absorption and the reverse would give rise to emission. Let, $R_{12}$ be the rate of change of number of atoms transiting from levels 1 to 2. Then,

$$ R_{12} = B_{12} \, n_1 \, U_\nu, $$

where the constant of proprtionality $B_{12}$ is Einstein's B coefficient. An atom excited to level 2 can deexcite to level 1 in two possible ways: (i) spontaneous emission (at a rate $A_{21} \, n_2$) and (ii) simulated emission (at a rate $B_{21} \, n_2 \, U_\nu$). Therefore, the total rate of change of number of atoms from 2 to 1 is

$$ R_{21} = A_{21} \, n_{2} + B_{21} \, n_{2} \, U_\nu. $$

Thus, we have three coefficients characterizing radiative transitions between levels 1 and 2: $A_{21}$ , $B_{21}$ and $B_{12}$. In fact, these three are related to each other. Considering thermodynamic equilibrium in which (i) the number of upward transitions equals the number of downward transitions (i.e. $R_{12} = R_{21}$), (ii) the upper and lower levels are populated according to the Boltzmann equation (i.e. $\frac{n_2}{n_1} = \frac{g_2}{g_1}e^\frac{-(E_2 - E_1)}{kT}$, and (iii) $ U_\nu = \frac{8\pi h\nu^3}{c^3} \frac{1}{e^{\frac{h\nu}{kT}}-1} $, it can be shown that

$$ \frac{B_{12}}{B_{21}} = \frac{g_2}{g_1};\, \, \, B_{21} = \frac{c^3}{8\pi h\nu^3} A_{21} \, \, \, .$$

Even when the system is not in thermodynamic equilibrium, we can use Boltzmann equation to relate the level populations as,

$$ \frac{n_2}{n_1} = \frac{g_2}{g_1}e^\frac{-(E_2 - E_1)}{kT_s} $$

where $T_s$ referred to as the spin temperature, and is nothing but a convenient shorthand for the relative populations of two state. For HI 21-cm line, $(E_2 – E_1)/k = 0.068 \, \rm{K}$. Therefore, for all conditions of interest for a neutral hydrogen cloud we expect $T_s \gg 0.068 \, \rm{K}$, implying $n_2/n_1 \sim 3$; that is, the upper level contains $\sim75$% of the atoms. In contrast, for HI Ly$\alpha$ all the atoms are in the $n=1$ ground state.

Now, it is interesting to ask what determines level population of hydrogen cloud i.e. its spin temperature. A detailed solution to this can be found in the classic paper by Field (1958) Field (1958).

The spin temperature ($T_s$) is determined by three processes: (1) ambient radio background, characterized by the temperature, $T_R$, (2) collisions with other hydrogen atoms and with electrons, characterized by the gas kinetic temperature $T_K$ and (3) resonant scattering of $\rm{Ly} \alpha$ photons, characterized by the color temperature, $T_L$. Under the (reasonable) assumption that the rate of these processes is fast compared with the de-excitation time of the line, the spin temperature can be approximated as,

$$ T_s = \frac{ T_R + y_cT_K + y_L \, T_L}{1 + y_c + y_L}. $$

Here, $y_c = \frac{T_* \, R^{c}_{21}}{T_K \, A_{21}}$, $y_L = \frac{T_* \, R^{L}_{21}}{T_L \, A_{21}}$, $T_* = \frac{h \, \nu_{21}}{k}$ and $R_{21}$ is the rate of transitions caused by radio background, collisions and $\rm{Ly} \alpha$ respectively (see Field (1958) Field 1958 for details).

Under normal conditions in the interstellar medium (ISM), especially for the cold (100 K) phase, collisions dominate the spin temperature of the neutral hydrogen. Only under special circumstances is the spin temperature affected by any of the other two excitation mechanisms. For example, very close to an AGN the ambient radiation field may become strong enough to have make appreciable contribution to the spin temperature. In the very early universe, even before the epoch of re-ionization (EoR), the universe was dense enough that resonant scattering of $\rm{Ly} \alpha$ photons can contribute significantly.

Observable Quantities

The next step is to relate physical properties of the emitting or absorbing neutral hydrogen to observable quantities such as the brightness temperature or flux density. Synthesis images provide measured intensities as flux densities in units of Jy/beam. In order to convert these to brightness temperatures, which can be directly linked to physically meaningful parameters such as the spin temperature $T_s$, we need to take into account the beam or point spread function with which the image was made. Mathematically the conversion can be expressed as:

$$ T_B= \frac{\lambda^2 \, S}{2 \, k_B \, \Omega} $$

where $T_B$ is the brightness temperature, $\lambda$ the observing wavelength, $S$ the flux density (in mJy), $k_B$ Bolzmann’s constant and $\Omega$ the beam solid angle. This equation can be simplified for the case of a pure Gaussian beam with full width at half maximum (FWHM) $a$ and $b$ (in arcsecs) and a wavelength of 21.1 cm (the rest wavelength of the HI line) into:

$$ T_B = \frac{606 \times S}{a \times b} $$

The brightness temperature is related to the HI spin temperature as:

$$ T_B = T_c e^{-\tau_\nu} + T_s (1 - e^{-\tau_\nu}). $$

Where $T_c$ is the brightness temperature of a background continuum source, located behind the HI emitting region which has optical depth $\tau_\nu$. In the absence of background continuum radiation this simplifies into: $ T_B = \tau_\nu \, T_s $ for $\tau_\nu \ll 1$, and $T_B = T_s$ for $\tau_\nu \gg 1$. So, in the case of large optical depth one directly measures the spin temperature. The usual situation in the ISM is the former, small optical depth.

The optical depth $\tau_\nu$ of neutral hydrogen is the integration of the absorption coefficient $\alpha_\nu$ over the line of sight and can be expressed as:

$$ \tau_\nu = \int\alpha_\nu ds = \int\frac{h \, \nu}{4\pi} \, (n_1 \, B_{12} - n_2 \, B_{21}) \, \phi(\nu) ds = \int\frac{h \, \nu}{4\pi} \, n_1 \, B_{12}(1 - e^{-\frac{h \nu}{kT_s}})\phi(\nu)ds $$

where $\phi$($\nu$) is the normalized profile shape. In general, the line profile shape is given by Voigt function which is the convolution of a Lorenztian (due to natural and collisional broadening) and a Gaussian (due to thermal broadening) function. In the low optical depth regime, the line profile shape is dominated by a Gaussian whereas in the high optical depth regime the damping wings due to the Lorentzian dominate (see the section on Damped $\rm{Ly}\alpha$ systems [LINK HERE]).

For long wavelengths, we are in the regime where $h \, \nu << k \, T_s$ so that the above equation can be rewritten in terms of the column density $N_{HI}(\nu)$, in $\rm{atoms} \, \rm{cm}^{-2}$ along the line of sight:

$$ \tau_\nu = \int \frac{h^2 \, \nu^2}{4\pi k \, T_s} \, n_1 \, B_{12} ds \approx \frac{N_{HI}(\nu)}{T_s}. $$

So, in the case of optically thin HI the measured brightness tempretaure is directly proportional to the HI column density via:

$$ T_B \approx \tau_\nu \, T_s = C^{'} \, N_{HI} \, (\nu). $$

To get the total column density we need to integrate over the line, i.e. frequency:

$$ N_{HI} = C \int \tau_\nu \, T_s = 1.823 \times 10^{18} \int T_B(v)dv \, \, \, \rm{cm}^{-2}, $$

where the last integral is over velocity ($v$) and not frequency ($\nu$).

With the assumption that the HI emitting regions are optically thin, the total mass of a galaxy is related to the observed total integrated flux density:

$$ M(HI) = 2.343\times 10^5 M_\odot \, (1 + z) \, (\frac{D_L}{\rm{Mpc}})^2 (\frac{F_{obs}}{\rm{Jy} \, \rm{km} \, \rm{s}^{-1}}). $$

Here, $z$ is the redshift of the galaxy, $D_L$ is the luminosity distance and $F_{obs}$ is the total integrated flux density.

One can rewrite the original equation in terms of a continuum subtracted line signal $T_L$:

$$ T_L = T_B - T_c = (1 - e^{-\tau_\nu})(T_s - T_c). $$

In the case of strong background continuum (i.e. $T_s \ll T_c$) the signal is dominated by the first term: $T_L \approx (e^{-\tau_\nu} -1) T_c$ and since we can determine $T_c$ independently at a frequency which is free of line emission, it is possible to determine $\tau_\nu$. If in addition we have a measure of the line emission along a sight line just off the continuum source we also have a measure of $(1 - e^{-\tau_\nu}) T_s$ and hence can determine $T_s$ assuming the column density in the sight lines on and off the continuum is the same.

Measurements of absorption and emission along very similar lines of sight, performed with the Parkes radio telescope in the early seventies led V. Radhakrishnan to propose that there is a two component ISM (Interstellar Medium) consisting of cool (~100 K), dense gas and warm (1000-10,000 K), tenuous gas, in pressure equilibrium. It turned out that the emission profiles had broad emission wings, which were not found in the absorption measurements. Radhakrishnan interpreted the broad emission wings as evidence for a warm component, which has very low optical depth and is the ubiquitous medium in which the cool, absorbing HI clouds are embedded. The figure below illustrates the observations that were used to draw this conclusion. An additional theoretical argument was that the tenous warm medium is needed to prevent the cool clouds from disruption and dispersion.

Figure taken from Radhakrishnan, V. 1974, IAU Symp. 60, 3-12. HI emission and absorption spectra along four sightlines in the galaxy. The wings of the emission profiles are not seen in absorption. The left panel are sightlines at intermediate latitude, the right panel sightlines in the galactic plane. Here the HI column is so long that it goes into self absorption.

The theoretical support for this interpretation was a detailed study of the heating and cooling of the ISM by Field, Goldsmith & Habing (1969) Field, Goldsmith & Habing (1969) and Goldsmith, Habing & Field (1969) Goldsmith, Habing & Field (1969) who made a self-consistent model of the ISM assuming that it is in pressure equilibrium. They showed that there are two stable phases, a cool and a warm phase with temperatures consistent with the observations. Recent observations of HI 21-cm emission and absorption in the Milky Way continue to identify bulk of HI to be distributed in two distinct phases: Cold Neutral Medium (CNM; T~100 K) and Warm Neutral Medium (WNM; T~8000 K) but find that as much as 50% of the WNM could be in the thermally unstable regime (500 - 5000 K; Heiles & Troland (2003) Heiles & Troland 2003).