Computing Galactic Orbits of Stars with Gala

Authors

Adrian Price-Whelan, Stephanie T. Douglas

Learning Goals

Query the Gaia data release 2 catalog to retrieve data for a sample of well-measured, nearby stars
Define high-mass and low-mass stellar samples using color-magnitude selections
Calculate orbits of the high-mass and low-mass stars within the Galaxy to show that the typically younger stars (high-mass) have smaller vertical excursions

Keywords

coordinates, astroquery, gala, galactic dynamics, astrometry, matplotlib, scatter plot, histogram

Companion Content

Astropy Docs: Description of the Galactocentric frame in astropy coordinates

Summary

We will use data from the Gaia mission to get sky positions, distances (parallaxes), proper motions, and radial velocities for a set of stars that are close to the Sun. We will then transform these observed, heliocentric kinematic measurements to Galactocentric Cartesian coordinates and use the positions and velocities as initial conditions to compute the orbits of these stars in the galaxy using the gala Python package. We will compare the orbits of high-mass main sequence (i.e. young) stars to the orbits of lower-mass main sequence stars to show that young stars have smaller vertical amplitudes.

Installing Dependencies

This tutorial depends on the Astropy affiliated packages gala and astroquery. Both of these packages can be pip-installed with:

pip install astro-gala astroquery

Imports



In [ ]:

    
# astropy imports
import astropy.coordinates as coord
from astropy.table import QTable
import astropy.units as u
from astroquery.gaia import Gaia

# Third-party imports
import matplotlib as mpl
import matplotlib.pyplot as plt
import numpy as np
%matplotlib inline

# gala imports
import gala.coordinates as gc
import gala.dynamics as gd
import gala.potential as gp
from gala.units import galactic

Scientific Background

The Gaia mission is an ESA mission that aims to measure the 3D positions and velocities of a large number of stars throughout the Milky Way. The primary mission objective is to enable studying the formation, structure, and evolutionary history of our Galaxy by measuring astrometry (sky position, parallax, and proper motion) for about 2 billion stars brighter than the Gaia $G$-band photometric magnitude $G \lesssim 21$. By end of mission (~2022), Gaia will also provide multi-band photometry and low-resolution spectrophotometry for these sources, along with radial or line-of-sight velocities for a subsample of about 100 million stars.

In April 2018, Gaia publicly released its first major catalog of data — data release 2 or DR2 — which provides a subset of these data to anyone with an internet connection. In this tutorial, we will use astrometry, radial velocities, and photometry for a small subset of DR2 to study the kinematics of different types of stars in the Milky Way.

Using `astroquery` to retrieve Gaia data

We'll start by querying the Gaia science archive to download astrometric and kinematic data (parallax, proper motion, radial velocity) for a sample of stars near the Sun. We'll use data exclusively from data release 2 (DR2) from the Gaia mission. For the demonstration here, let's grab data for a random subset of 4096 stars within a distance of 100 pc from the Sun that have high signal-to-noise astrometric measurements.

To perform the query and to retrieve the data, we'll use the Gaia module in the astroquery package, astroquery.gaia. This module expects us to provide an SQL query to select the data we want (technically it should be an ADQL query, which is similar to SQL but provides some additional functionality for astronomy; to learn more about ADQL syntax and options, this guide provides an introduction). We don't need all of the columns that are available in DR2, so we'll limit our query to request the sky position (ra, dec), parallax, proper motion components (pmra, pmdec), radial velocity, and magnitudes (phot_*_mean_mag). More information about the available columns is in the Gaia DR2 data model.

To select stars that have high signal-to-noise parallaxes, we'll use the filter parallax_over_error > 10 to select stars that have small fractional uncertainties. We'll also use the filter radial_velocity IS NOT null to only select stars that have measured radial velocities.



In [ ]:

    
query_text = '''SELECT TOP 4096 ra, dec, parallax, pmra, pmdec, radial_velocity,
phot_g_mean_mag, phot_bp_mean_mag, phot_rp_mean_mag
FROM gaiadr2.gaia_source
WHERE parallax_over_error > 10 AND
    parallax > 10 AND
    radial_velocity IS NOT null
ORDER BY random_index
'''

We now pass this query to the Gaia.launch_job() class method to create an anonymous job in the Gaia science archive to run our query. To retrieve the results of this query as an Astropy Table object, we then use the job.get_results() method. Note that you may receive a number of warnings (output lines that begin with WARNING:) from the astropy.io.votable package — these are expected, and it's OK to ignore these warnings (the Gaia archive returns a slightly invalid VOTable).



In [ ]:

    
# Note: the following lines require an internet connection, so we have 
# provided the results of this query as a FITS file included with the 
# tutorials repository. If you have an internet connection, feel free
# to uncomment these lines to retrieve the data with `astroquery`:
# job = Gaia.launch_job(query_text)
# gaia_data = job.get_results()
# gaia_data.write('gaia_data.fits')



In [ ]:

    
gaia_data = QTable.read('gaia_data.fits')

The data object is now an Astropy Table called gaia_data that contains Gaia data for 4096 random stars within 100 pc (or with a parallax > 10 mas) of the Sun, as we requested. Let's look at the first four rows of the table:



In [ ]:

    
gaia_data[:4]

Note that the table columns already contain units! They are indicated in the second row of the header.

Using `astropy.coordinates` to represent and transform stellar positions and velocities

Let's double check that the farthest star is still within 100 pc, as we expect from the parallax selection we did in the query above. To do this, we'll create an Astropy Distance object using the parallax (Note: this inverts the parallax to compute the distance! This is only a good approximation when the parallax signal to noise is large, as we ensured in the query above with parallax_over_error > 10):



In [ ]:

    
dist = coord.Distance(parallax=u.Quantity(gaia_data['parallax']))
dist.min(), dist.max()

It looks like the closest star in our sample is about 9 pc away, and the farthest is almost 100 pc, as we expected.

We next want to convert the coordinate position and velocity data from heliocentric, spherical values to Galactocentric, Cartesian values. We'll do this using the Astropy coordinates transformation machinery. To make use of this functionality, we first have to create a SkyCoord object from the Gaia data we downloaded. The Gaia DR2 data are in the ICRS (equatorial) reference frame, which is also the default frame when creating new SkyCoord objects, so we don't need to specify the frame below:



In [ ]:

    
c = coord.SkyCoord(ra=gaia_data['ra'], 
                   dec=gaia_data['dec'],
                   distance=dist,
                   pm_ra_cosdec=gaia_data['pmra'], 
                   pm_dec=gaia_data['pmdec'],
                   radial_velocity=gaia_data['radial_velocity'])

Note: as described in the Gaia DR2 data model, the Gaia column pmra contains the cos(dec) term. In Astropy coordinates, the name of this component is pm_ra_cosdec.

Let's again look at the first four coordinates in the SkyCoord object:



In [ ]:

    
c[:4]

Now that we have a SkyCoord object with the Gaia data, we can transform to other coordinate systems. For example, we can transform to the Galactic coordinate system (centered on the Sun but with the zero latitude approximately aligned with the Galactic plane) using the .galactic attribute (this works for any of the built-in Astropy coordinate frames, e.g., .fk5 should also work):



In [ ]:

    
c.galactic[:4]

The Galactic frame is still centered on the solar system barycenter, whereas we want to compute the positions and velocities of our sample of stars in a Galactocentric frame, centered on the center of the Milky Way. To do this transformation, Astropy provides the Galactocentric frame class, which allows us to use our own conventions for, e.g., the distance from the sun to the Galactic center (galcen_distance) or the height of the Sun over the Galactic midplane (z_sun). Let's look at the default values for the solar position and velocity:



In [ ]:

    
coord.Galactocentric()

We'll instead use a distance of 8.1 kpc — more consistent with the recent results from the GRAVITY collaboration — and a solar height of 0 pc. We'll use the default solar velocity (see output above). We can transform our data to this frame using the transform_to() method by specifying the Galactocentric frame with our adopted values:



In [ ]:

    
galcen = c.transform_to(coord.Galactocentric(z_sun=0*u.pc, 
                                             galcen_distance=8.1*u.kpc))

The galcen object now contains the data for our sample, but in the Galactocentric frame:



In [ ]:

    
galcen[:4]

We can access the positions of the stars using the .x, .y, and .z attributes, for example:



In [ ]:

    
plt.hist(galcen.z.value, bins=np.linspace(-110, 110, 32))
plt.xlabel('$z$ [{0:latex_inline}]'.format(galcen.z.unit));

Similarly, for the velocity components, we can use .v_x, .v_y, and .v_z. For example, to create a classic "UV" plane velocity plot:



In [ ]:

    
fig, ax = plt.subplots(1, 1, figsize=(6, 6))

ax.plot(galcen.v_x.value, galcen.v_y.value,
        marker='.', linestyle='none', alpha=0.5)

ax.set_xlim(-125, 125)
ax.set_ylim(200-125, 200+125)

ax.set_xlabel('$v_x$ [{0:latex_inline}]'.format(u.km/u.s))
ax.set_ylabel('$v_y$ [{0:latex_inline}]'.format(u.km/u.s))

Along with astrometric and radial velocity data, Gaia also provides photometric data for three photometric bandpasses: the broad-band G, the blue BP, and the red RP magnitudes. Let's make a Gaia color-magnitude diagram using the $G_{\rm BP}-G_{\rm RP}$ color and the absolute $G$-band magnitude $M_G$. We'll compute the absolute magnitude using the distances we computed earlier — Astropy Distance objects have a convenient .distmod attribute that provides the distance modulus:



In [ ]:

    
M_G = gaia_data['phot_g_mean_mag'] - dist.distmod
BP_RP = gaia_data['phot_bp_mean_mag'] - gaia_data['phot_rp_mean_mag']



In [ ]:

    
fig, ax = plt.subplots(1, 1, figsize=(6, 6))

ax.plot(BP_RP, M_G, 
        marker='.', linestyle='none', alpha=0.3)

ax.set_xlim(0, 3)
ax.set_ylim(11, 1)

ax.set_xlabel('$G_{BP}-G_{RP}$')
ax.set_ylabel('$M_G$')

In the above, there is a wide range of main sequence star masses which have a range of lifetimes. The most massive stars were likely born in the thin disk and their orbits therefore likely have smaller vertical amplitudes than the typical old main sequence star. To compare, we'll create two sub-selections of the Gaia CMD to select massive and low-mass main sequence stars from the CMD for comparison. You may see two RuntimeWarning(s) from running the next cell — these are expected and it's safe to ignore them.



In [ ]:

    
np.seterr(invalid="ignore")
hi_mass_mask = ((BP_RP > 0.5*u.mag) & (BP_RP < 0.7*u.mag) & 
                (M_G > 2*u.mag) & (M_G < 3.75*u.mag) & 
                (np.abs(galcen.v_y - 220*u.km/u.s) < 50*u.km/u.s))

lo_mass_mask = ((BP_RP > 2*u.mag) & (BP_RP < 2.4*u.mag) & 
                (M_G > 8.2*u.mag) & (M_G < 9.7*u.mag) &
                (np.abs(galcen.v_y - 220*u.km/u.s) < 50*u.km/u.s))

Let's also define default colors to use when visualizing the high- and low-mass stars:



In [ ]:

    
hi_mass_color = 'tab:red'
lo_mass_color = 'tab:purple'

Let's now visualize these two CMD selections:



In [ ]:

    
fig, ax = plt.subplots(1, 1, figsize=(6, 6))

ax.plot(BP_RP, M_G, 
        marker='.', linestyle='none', alpha=0.1)

for mask, color in zip([lo_mass_mask, hi_mass_mask],
                       [lo_mass_color, hi_mass_color]):
    ax.plot(BP_RP[mask], M_G[mask], 
            marker='.', linestyle='none', 
            alpha=0.5, color=color)

ax.set_xlim(0, 3)
ax.set_ylim(11, 1)

ax.set_xlabel('$G_{BP}-G_{RP}$')
ax.set_ylabel('$M_G$')

Thus far, we've used the color-magnitude diagram (using parallaxes and photometry from Gaia to compute absolute magnitudes) to select samples of high- and low-mass stars based on their colors.

In what follows, we'll compute Galactic orbits for stars in the high- and low-mass star selections above and compare.

Using `gala` to numerically integrate Galactic stellar orbits

gala is an Astropy affiliated package for Galactic dynamics. gala provides functionality for representing analytic mass models that are commonly used in Galactic dynamics contexts for numerically integrating stellar orbits. For examples, see Chapter 3 of Binney and Tremaine (2008). The gravitational potential models are defined by specifying parameters like mass, scale radii, or shape parameters and can be combined. Once defined, they can be used in combination with numerical integrators provided in gala to compute orbits. gala comes with a pre-defined, multi-component, but simple model for the Milky Way that can be used for orbit integrations. Let's create an instance of the MilkyWayPotential model and integrate orbits for the high- and low-mass main sequence stars selected above:



In [ ]:

    
milky_way = gp.MilkyWayPotential()
milky_way

This model has mass components for the Galactic disk, bulge, nucleus, and halo, and the parameters were defined by fitting measurements of the Milky Way enclosed mass at various radii. See this document for more details. The parameters of the MilkyWayPotential can be changed by passing in a dictionary of parameter values to argument names set by the component names. For example, to change the disk mass to make it slightly more massive (the choice 8e10 is arbitrary!):



In [ ]:

    
different_disk_potential = gp.MilkyWayPotential(disk=dict(m=8e10*u.Msun))
different_disk_potential

To integrate orbits, we have to combine the mass model with a reference frame into a Hamiltonian object. If no reference frame is passed in, it's assumed that we are in a static inertial frame moving with the center of the mass model:



In [ ]:

    
H = gp.Hamiltonian(milky_way)

Now that we have the mass model, we can integrate orbits. Let's now define initial conditions for subsets of the high- and low-mass star selections we did above. Initial conditions in gala are specified by creating PhaseSpacePosition objects. We can create these objects directly from a Galactocentric object, like we have defined above from transforming the Gaia data — we first have to extract the data with a Cartesian representation. We can do this by calling galcen.cartesian:



In [ ]:

    
w0_hi = gd.PhaseSpacePosition(galcen[hi_mass_mask].cartesian)
w0_lo = gd.PhaseSpacePosition(galcen[lo_mass_mask].cartesian)
w0_hi.shape, w0_lo.shape

From the above, we can see that we have 185 high-mass star and 577 low-mass stars in our selections. To integrate orbits, we call the .integrate_orbit() method on the Hamiltonian object we defined above, and pass in initial conditions. We also have to specify the timestep for integration, and how long we want to integrate for. We can do this by either specifying the amount of time to integrate for, or by specifying the number of timesteps. Let's specify a timestep of 1 Myr and a time of 500 Myr (approximately two revolutions around the Galaxy for a Sun-like orbit):



In [ ]:

    
orbits_hi = H.integrate_orbit(w0_hi, dt=1*u.Myr, 
                              t1=0*u.Myr, t2=500*u.Myr)

orbits_lo = H.integrate_orbit(w0_lo, dt=1*u.Myr, 
                              t1=0*u.Myr, t2=500*u.Myr)

By default this uses a Leapfrog numerical integration scheme, but the integrator can be customized — see the gala examples for more details.

With the orbit objects in hand, we can continue our comparison of the orbits of high-mass and low-mass main sequence stars in the solar neighborhood. Let's start by plotting a few orbits. The .plot() convenience function provides a quick way to visualize orbits in three Cartesian projections. For example, let's plot the first orbit in each subsample on the same figure:



In [ ]:

    
fig = orbits_hi[:, 0].plot(color=hi_mass_color)
_ = orbits_lo[:, 0].plot(axes=fig.axes, color=lo_mass_color)

Note in the above figure that the orbits are almost constrained to the x-y plane: the excursions are much larger in the x and y directions as compared to the z direction.

The default plots show all Cartesian projections. This can be customized to, for example, only show specified components (including velocity components):



In [ ]:

    
fig = orbits_hi[:, 0].plot(['x', 'v_x'], 
                           auto_aspect=False, 
                           color=hi_mass_color)

The representation can also be changed, for example, to a cylindrical representation:



In [ ]:

    
fig = orbits_hi[:, 0].cylindrical.plot(['rho', 'z'], 
                                       color=hi_mass_color,
                                       label='high mass')
_ = orbits_lo[:, 0].cylindrical.plot(['rho', 'z'], color=lo_mass_color,
                                     axes=fig.axes,
                                     label='low mass')

fig.axes[0].legend(loc='upper left')
fig.axes[0].set_ylim(-0.3, 0.3)

Already in the above plot we can see that the high-mass star has an orbit with smaller eccentricity (smaller radial variations) and smaller vertical oscillations as compared to the low-mass star. Below, we'll quantify this and look at the vertical excursions of all of the high- and low-mass stars, respectively.

Let's now compare the vertical amplitudes of the orbits in each of our sub-selections! We can compute the (approximate) maximum vertical height of each orbit using the convenience method .zmax() (you can see a list of all convenience methods on the Orbit object in the Gala documentation here):



In [ ]:

    
zmax_hi = orbits_hi.zmax(approximate=True)
zmax_lo = orbits_lo.zmax(approximate=True)

Let's make histograms of the maximum $z$ heights for these two samples:



In [ ]:

    
bins = np.linspace(0, 2, 50)

plt.hist(zmax_hi.value, bins=bins, 
         alpha=0.4, density=True, label='high-mass', 
         color=hi_mass_color)
plt.hist(zmax_lo.value, bins=bins, 
         alpha=0.4, density=True, label='low-mass',
         color=lo_mass_color);

plt.legend(loc='best', fontsize=14)

plt.yscale('log')
plt.xlabel(r"$z_{\rm max}$" + " [{0:latex}]".format(zmax_hi.unit))

The distribution of $z$-heights for the low-mass (i.e. typically older) stars is more extended, as we predicted!

In this tutorial, we've used astroquery to query the Gaia science archive to retrieve kinematic and photometric data for a small sample of stars with well-measured parallaxes from Gaia DR2. We used the colors and absolute magnitudes of these stars to select subsamples of high- and low-mass stars, which, on average, will provide us with subsamples of stars that are younger and older, respectively. We then constructed a model for the gravitational field of the Milky Way and numerically integrated the orbits of all stars in each of the two subsamples. Finally, we used the orbits to compute the maximum height that each star reaches above the Galactic midplane and showed that the younger (higher-mass) stars tend to have smaller excursions from the Galactic plane, consistent with the idea that stars are either born in a "thinner" disk and dynamically "heated," or that older stars formed with a larger vertical scale-height.

Exercises

Some of the low-mass star orbits have large vertical excursions from the Galactic disk (up to and above 1.5 kpc) and could therefore be stellar halo stars rather than part of the Galactic disk. Use the zmax values to select a few of these stars and plot their full orbits. Do these stars look like they are part of the disk? Why / why not?
Orbit objects also provide methods for computing apocenter and pericenter distances and eccentricities. Which types of stars (high-mass or low-mass) tend to have high eccentricity orbits within the Galaxy? Similar to the plot above, make a plot showing the two distributions of eccentricity values.



In [ ]: