A short introduction to Gaia Archive: ADQL & TAP by examples

Morgan Fouesneau

Notebook configuration

Below we will do some plotting. The following commands are making things a bit nicer. (all personal libraries are included with this notebook).



In [1]:

    
# Loading configuration
# Don't forget that mac has this annoying configuration that leads
# to limited number of figures/files
# ulimit -n 4096    <---- osx limits to 256
import warnings
warnings.catch_warnings()
warnings.simplefilter("ignore")

%pylab inline
%config InlineBackend.figure_format='retina'

import pylab as plt
import numpy as np
import figrc, setup_mpl
setup_mpl.theme()
setup_mpl.solarized_colors()









    



Populating the interactive namespace from numpy and matplotlib

Interogating Gaia Archive

Gaia Archive website: https://gea.esac.esa.int/archive/

The entry point is a TAP (Table Access Protocol) server.

TAP provides two operation modes, Synchronous and Asynchronous.

Synchronous: the response to the request will be generated as soon as the request received by the server.
Asynchronous: the server will start a job that will execute the request. The first response to the request is the required information (a link) to obtain the job status. Once the job is finished, the results can be retrieved.

Gaia Archive TAP service

Gaia Archive TAP server provides two access modes: public and authenticated

Public: this is the standard TAP access. A user can execute ADQL queries and upload tables to be used in a query 'on-the-fly' (these tables will be removed once the query is executed). The results are available to any other user and they will remain in the server for a limited space of time.
Authenticated: some functionalities are restricted to authenticated users only. The results are saved in a private user space and they will remain in the server for ever (they can be removed by the user). ADQL queries and results are saved in a user private area.
Cross-match operations: a catalogue cross-match operation can be executed. Cross-match operations results are saved in a user private area.

What is ADQL?

ADQL = Astronomical Data Query Language

ADQL has been developed based on SQL92 and supports a subset of the SQL grammar with extensions to support generic and astronomy specific operations.

In other words, ADQL is a SQL-like searching language improved with geometrical functions.

for more information see the IVOA documentation http://www.ivoa.net/documents/latest/ADQL.html

examples of SQL minimal queries

SELECT *
FROM "gaiadr1.tgas_source"

SELECT top 1000 ra, dec, phot_g_mean_mag AS mag  
FROM "gaiadr1.gaia_source"
ORDER BY mag

Basic Keywords

TOP limits the number of records to display
ORDER BY sorts records in ascending (ASC, default) or descending (DESC)
WHERE filters records according to logical expressions
IN, NOT IN operator that can determine whether a value is (not) within a given set
BETWEEN x AND y operator can determine whether a value is within a given interval
LIKE operator allows for a partial comparison, It uses wild cards % and _ ('percent' and 'underscore'). The wild card % replaces any string of characters, including the empty string. The underscore replaces exactly one character.
GROUP BY groups records by identical values (or set of values)
= or > or < or >= or <= or <>, different operators of logical comparisons
+, -, *, /, compute columns using mathematical operations
POWER(column_name, n) returns values raised to the power n. n must be a integer positive or negative.
SQRT(column_name) returns the square root of values.
CEILING(column_name) rounds up to the nearest integer value.
FLOOR(column name) rounds down to the next least integer value.
ABS(column_name) returns the absolute value.
AVG (column_name) this function returns the average value in a column for a group of data lines
COUNT (column_name) this function returns a count of rows from a reference column values if it is not NULL.
SUM(column_name) this function returns the sum of values in a column for a group of data lines.
MAX, MIN, return the largest or smallest value of a column for a group of data lines.
COS, SIN, TAN, ACOS, ASIN, ATAN of an angle in radian compute the trigonometric transformation

Geometries and geometrical functions

ADQL provides a set of 2D-functions and geometries or regions

A region is always attached to a coordinate System: FK4, FK5, ICRS, GALACTIC. The coordinates expressed in degree, can be constant or the result of a mathematical expression.

POINT('coordinate system', right ascension, declination) expresses a point source on the sky
CIRCLE('coordinate system',right ascension center, declination center, radius in degrees) expresses a circular region on the sky (a cone in space)
BOX('coordinate system', right ascension center, declination center, width, height) defines a centered box
POLYGON('coordinate system', coordinate point 1, coordinate point 2, coordinate point 3...) expresses a region on the sky with sides denoted by great circles passing through specified list of POINT objects.
DISTANCE(point1, point2) computes distance between two points.
CONTAINS(region1, region2) returns a boolean value : true if region2 contains region1, false otherwise.
INTERSECTS(region1, region2) returns a boolean value : true if region2 intersect region1, false otherwise.

Current python package

Some common code to send ADQL Queries to TAP services and notebook polishing

GaiaArchive is a shortcut from TAP_service to the interface with the Gaia Archive
TAPVizieR is a shortcut from TAP_service to the interface with TAP service of VizieR (CDS).
resolve interfaces CDS/Sesame name resolver to get positions of known objects by their names.
QueryStr is a polished string that parses an SQL syntax to make it look nicer (for notebook and console)
timeit a context manager/decorator that reports execution time (for notebook and console)



In [2]:

    
from tap import (GaiaArchive, TAPVizieR, resolve, QueryStr, timeit)

Quick Start: How to query TAP with this package?

Let's start by checking that we can access the data by requesting the first 5 sources in TGAS.

Synchronous mode

Get the service and submit the query. The result will be downloaded automatically.



In [3]:

    
gaia = GaiaArchive()

select a small number of rows in the gaia DR1 data table.

Note: we use QueryStr only for giving an easier reading. A string would work as well.



In [4]:

    
adql = QueryStr("""
select top 5 * from gaiadr1.gaia_source
""")









    




ADQL query
SELECT top 5 * FROM gaiadr1.gaia_source



In [5]:

    
from tap import GaiaArchive
selection = ','.join(['avg({0:s}) as avg_{0:s}'.format(k) 
                      for k in gaia.get_table_info('gaiadr1.tgas_source').keys() 
                      if ('error' in k) or ('corr' in k)])

data = GaiaArchive().query(QueryStr("""
select {0:s} from gaiadr1.tgas_source
""".format(selection)))
r = data.as_array().data
cor = np.array(eval("""
[
    [avg_ra_error,         avg_ra_dec_corr,       avg_ra_parallax_corr,    avg_ra_pmra_corr,       avg_ra_pmdec_corr],
    [avg_ra_dec_corr,      avg_dec_error,         avg_dec_parallax_corr,   avg_dec_pmra_corr,      avg_dec_pmdec_corr],
    [avg_ra_parallax_corr, avg_dec_parallax_corr, avg_parallax_error,      avg_parallax_pmra_corr, avg_parallax_pmdec_corr],
    [avg_ra_pmra_corr,     avg_dec_pmra_corr,     avg_parallax_pmra_corr,  avg_pmra_error,         avg_pmra_pmdec_corr],
    [avg_ra_pmdec_corr,    avg_dec_pmdec_corr,    avg_parallax_pmdec_corr, avg_pmra_pmdec_corr,    avg_pmdec_error]
]
""", {k:float(r[k]) for k in r.dtype.names}))

cov = cor.copy()

pars = r'$\alpha$ $\delta$ $\varpi$ $\mu_{\alpha\star}$ $\mu_\delta$'.split()

for k in range(len(cor)):
    val = cor[k, k]
    cov[k, :] *= val
    cov[:, k] *= val
    cov[k, k] /= val

# plot correlations
mcor = np.ma.array(cor)
for k in range(len(cor)): 
    cor[k, k] = np.nan
lim = np.max(abs(cor))
plt.matshow(cor, vmin=-lim, vmax=lim, cmap=plt.cm.RdBu_r)
for k, l in enumerate(pars):
    plt.text(k, k, l, ha='center', va='center')
plt.colorbar(shrink=0.7).set_label(r'correlation $\rho_{x,y}$')
plt.xticks([])
plt.yticks([])
plt.savefig('tgas_correlation.pdf', bbox_inches='tight')

plt.figure()
# plot correlations
for k in range(len(cor)): 
    cov[k, k] = np.nan
lim = np.max(abs(cov))
plt.matshow(cov, vmin=-lim, vmax=lim, cmap=plt.cm.RdBu_r)
pars = r'$\alpha$ $\delta$ $\varpi$ $\mu_{\alpha\star}$ $\mu_\delta$'.split()
for k, l in enumerate(pars):
    plt.text(k, k, l, ha='center', va='center')
plt.colorbar(shrink=0.7).set_label(r'covariance $\rho_{x,y}\sigma_x\sigma_y$')
plt.xticks([])
plt.yticks([]);
plt.savefig('tgas_covariance.pdf', bbox_inches='tight')









    




ADQL query
SELECT avg(ra_error) AS avg_ra_error,avg(dec_error) AS avg_dec_error,avg(parallax_error) AS avg_parallax_error,avg(pmra_error) AS avg_pmra_error,avg(pmdec_error) AS avg_pmdec_error,avg(ra_dec_corr) AS avg_ra_dec_corr,avg(ra_parallax_corr) AS avg_ra_parallax_corr,avg(ra_pmra_corr) AS avg_ra_pmra_corr,avg(ra_pmdec_corr) AS avg_ra_pmdec_corr,avg(dec_parallax_corr) AS avg_dec_parallax_corr,avg(dec_pmra_corr) AS avg_dec_pmra_corr,avg(dec_pmdec_corr) AS avg_dec_pmdec_corr,avg(parallax_pmra_corr) AS avg_parallax_pmra_corr,avg(parallax_pmdec_corr) AS avg_parallax_pmdec_corr,avg(pmra_pmdec_corr) AS avg_pmra_pmdec_corr,avg(phot_g_mean_flux_error) AS avg_phot_g_mean_flux_error FROM gaiadr1.tgas_source








    












    





<matplotlib.figure.Figure at 0x113def9e8>

Now we run the query.

Note: we use timeit in this notebook to indicate how long the operation gaia.query took.



In [6]:

    
timeit(gaia.query)(adql)









    




Execution time: 1.22 s







    Out[6]:




<Table masked=True length=5>

solution_id source_id random_index ref_epoch ra ra_error dec dec_error parallax parallax_error pmra pmra_error pmdec pmdec_error ra_dec_corr ra_parallax_corr ra_pmra_corr ra_pmdec_corr dec_parallax_corr dec_pmra_corr dec_pmdec_corr parallax_pmra_corr parallax_pmdec_corr pmra_pmdec_corr astrometric_n_obs_al astrometric_n_obs_ac astrometric_n_good_obs_al astrometric_n_good_obs_ac astrometric_n_bad_obs_al astrometric_n_bad_obs_ac astrometric_delta_q astrometric_excess_noise astrometric_excess_noise_sig astrometric_primary_flag astrometric_relegation_factor astrometric_weight_al astrometric_weight_ac astrometric_priors_used matched_observations duplicated_source scan_direction_strength_k1 scan_direction_strength_k2 scan_direction_strength_k3 scan_direction_strength_k4 scan_direction_mean_k1 scan_direction_mean_k2 scan_direction_mean_k3 scan_direction_mean_k4 phot_g_n_obs phot_g_mean_flux phot_g_mean_flux_error phot_g_mean_mag phot_variable_flag l b ecl_lon ecl_lat
Time[Julian Years] Angle[deg] Angle[mas] Angle[deg] Angle[mas] Angle[mas] Angle[mas] Angular Velocity[mas/year] Angular Velocity[mas/year] Angular Velocity[mas/year] Angular Velocity[mas/year] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Angle[mas] Angle[mas^-2] Angle[mas^-2] Angle[deg] Angle[deg] Angle[deg] Angle[deg] Flux[e-/s] Flux[e-/s] Magnitude[mag] Dimensionless[see description] Angle[deg] Angle[deg] Angle[deg] Angle[deg]
int64 int64 int64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float32 float32 float32 float32 float32 float32 float32 float32 float32 float32 int32 int32 int32 int32 int32 int32 float32 float64 float64 bool float32 float32 float32 int32 int16 bool float32 float32 float32 float32 float32 float32 float32 float32 int32 float64 float64 float64 object float64 float64 float64 float64
1635378410781933568 4486895915443650432 905350894 2015.0 264.18122510033464 2.8003516165869082 7.0516429876037998 12.9212181467864 -- -- -- -- -- -- -0.077 -- -- -- -- -- -- -- -- -- 43 0 41 0 2 0 -- 3.2780269499321597 2.8301321572232419 False 1.6920748 0.036015101 -- 2 12 False 0.86865312 0.62508136 0.66467804 0.8813076 -59.995804 -51.505093 -43.121849 -44.794041 43 218.43179998323626 3.0698243456034935 19.676480402386268 NOT_AVAILABLE 30.759313884222344 19.806472281755649 263.30404339742091 30.355774143855047
1635378410781933568 4486609183424689920 309752073 2015.0 266.16603765486315 0.55382871997862648 7.6271450028431031 0.65430966160671311 -- -- -- -- -- -- 0.92455 -- -- -- -- -- -- -- -- -- 89 0 89 0 0 0 -- 0.0 0.0 False 1.0 0.6764009 -- 2 19 False 0.46887282 0.5968886 0.31126449 0.8961063 -69.038536 -47.735455 -32.330299 -42.062531 89 885.14509075973967 2.612550689916894 18.15723390135684 NOT_AVAILABLE 32.225950841033132 18.294962138635221 265.56529752291794 31.007424683992905
1635378410781933568 4487954916943816960 1109852082 2015.0 267.43146648900176 3.3732871870793346 8.0916004466735227 4.1776617264829969 -- -- -- -- -- -- 0.84584999 -- -- -- -- -- -- -- -- -- 45 0 43 0 2 0 -- 4.9278360069994429 5.5765534150526719 False 1.6911782 0.018240357 -- 2 27 False 0.50051576 0.27280524 0.28483522 0.85747027 -106.40694 -69.268906 -8.6255636 -41.570301 44 130.70553988687644 2.1173294200763491 20.234035075545016 NOT_AVAILABLE 33.242582035644766 17.372733263801848 267.01707637453728 31.504289333914077
1635378410781933568 4488405063871908096 1085557435 2015.0 268.577179212977 0.45686872022632691 8.5283958071764445 0.66047878586423325 -- -- -- -- -- -- 0.52380002 -- -- -- -- -- -- -- -- -- 80 0 80 0 0 0 -- 0.47314460723313795 0.4344997036814196 False 1.0409509 0.27349713 -- 2 14 False 0.36665735 0.10474015 0.33550045 0.73903608 -104.3985 27.750549 -29.422886 -44.953251 77 492.79505895268744 4.7092584766237948 18.793104202112058 NOT_AVAILABLE 34.171511451972741 16.544154401270578 268.34148147147999 31.959485697633603
1635378410781933568 4488168978109094912 724156643 2015.0 266.9350267767021 0.45472258639757962 8.4501367670511502 0.52313455431024292 -- -- -- -- -- -- 0.92054999 -- -- -- -- -- -- -- -- -- 65 0 65 0 0 0 -- 0.54443893282338873 2.7908803781634761 False 1.2134782 1.1420876 -- 2 16 False 0.35940772 0.74710166 0.26141152 0.91352218 -62.16045 -45.337482 -33.693073 -42.355057 67 2029.0269077666758 3.3669145828795304 17.25655054776395 NOT_AVAILABLE 33.355528435673932 17.971330490608551 266.43026591010408 31.851445954397114

Asynchronous mode

From the same service, use the query_async method. The job will be submitted and accessible later.

Why this mode would be prefered? Some services (incl. the Gaia Archive) limit strongly the queries using the synchronous mode. For example the Gaia Archive limits to 1 minute jobs, mostly to avoid comminucation issues. Read the documentation of the service you want to use to decide.

Below I use the async mode to redo the exact same query as before.



In [7]:

    
q = gaia.query_async(adql, silent=True)
q   # pretty print display









    Out[7]:




ADQL Query
SELECT top 5 * FROM gaiadr1.gaia_source


Status:   303, Reason 303
Location: http://gea.esac.esa.int/tap-server/tap/async/1494306883754O
Job id:   1494306883754O

One can interogate the service to know if the task is complete.



In [8]:

    
q.status









    Out[8]:





'COMPLETED'

Finally we can download the result when available.

(The provided python interface has an option to wait or not for completion; keyword wait=True, default behavior)



In [9]:

    
q.get()









    Out[9]:




<Table masked=True length=5>

solution_id source_id random_index ref_epoch ra ra_error dec dec_error parallax parallax_error pmra pmra_error pmdec pmdec_error ra_dec_corr ra_parallax_corr ra_pmra_corr ra_pmdec_corr dec_parallax_corr dec_pmra_corr dec_pmdec_corr parallax_pmra_corr parallax_pmdec_corr pmra_pmdec_corr astrometric_n_obs_al astrometric_n_obs_ac astrometric_n_good_obs_al astrometric_n_good_obs_ac astrometric_n_bad_obs_al astrometric_n_bad_obs_ac astrometric_delta_q astrometric_excess_noise astrometric_excess_noise_sig astrometric_primary_flag astrometric_relegation_factor astrometric_weight_al astrometric_weight_ac astrometric_priors_used matched_observations duplicated_source scan_direction_strength_k1 scan_direction_strength_k2 scan_direction_strength_k3 scan_direction_strength_k4 scan_direction_mean_k1 scan_direction_mean_k2 scan_direction_mean_k3 scan_direction_mean_k4 phot_g_n_obs phot_g_mean_flux phot_g_mean_flux_error phot_g_mean_mag phot_variable_flag l b ecl_lon ecl_lat
Time[Julian Years] Angle[deg] Angle[mas] Angle[deg] Angle[mas] Angle[mas] Angle[mas] Angular Velocity[mas/year] Angular Velocity[mas/year] Angular Velocity[mas/year] Angular Velocity[mas/year] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Angle[mas] Angle[mas^-2] Angle[mas^-2] Angle[deg] Angle[deg] Angle[deg] Angle[deg] Flux[e-/s] Flux[e-/s] Magnitude[mag] Dimensionless[see description] Angle[deg] Angle[deg] Angle[deg] Angle[deg]
int64 int64 int64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float32 float32 float32 float32 float32 float32 float32 float32 float32 float32 int32 int32 int32 int32 int32 int32 float32 float64 float64 bool float32 float32 float32 int32 int16 bool float32 float32 float32 float32 float32 float32 float32 float32 int32 float64 float64 float64 object float64 float64 float64 float64
1635378410781933568 4486895915443650432 905350894 2015.0 264.18122510033464 2.8003516165869082 7.0516429876037998 12.9212181467864 -- -- -- -- -- -- -0.077 -- -- -- -- -- -- -- -- -- 43 0 41 0 2 0 -- 3.2780269499321597 2.8301321572232419 False 1.6920748 0.036015101 -- 2 12 False 0.86865312 0.62508136 0.66467804 0.8813076 -59.995804 -51.505093 -43.121849 -44.794041 43 218.43179998323626 3.0698243456034935 19.676480402386268 NOT_AVAILABLE 30.759313884222344 19.806472281755649 263.30404339742091 30.355774143855047
1635378410781933568 4486609183424689920 309752073 2015.0 266.16603765486315 0.55382871997862648 7.6271450028431031 0.65430966160671311 -- -- -- -- -- -- 0.92455 -- -- -- -- -- -- -- -- -- 89 0 89 0 0 0 -- 0.0 0.0 False 1.0 0.6764009 -- 2 19 False 0.46887282 0.5968886 0.31126449 0.8961063 -69.038536 -47.735455 -32.330299 -42.062531 89 885.14509075973967 2.612550689916894 18.15723390135684 NOT_AVAILABLE 32.225950841033132 18.294962138635221 265.56529752291794 31.007424683992905
1635378410781933568 4487954916943816960 1109852082 2015.0 267.43146648900176 3.3732871870793346 8.0916004466735227 4.1776617264829969 -- -- -- -- -- -- 0.84584999 -- -- -- -- -- -- -- -- -- 45 0 43 0 2 0 -- 4.9278360069994429 5.5765534150526719 False 1.6911782 0.018240357 -- 2 27 False 0.50051576 0.27280524 0.28483522 0.85747027 -106.40694 -69.268906 -8.6255636 -41.570301 44 130.70553988687644 2.1173294200763491 20.234035075545016 NOT_AVAILABLE 33.242582035644766 17.372733263801848 267.01707637453728 31.504289333914077
1635378410781933568 4488405063871908096 1085557435 2015.0 268.577179212977 0.45686872022632691 8.5283958071764445 0.66047878586423325 -- -- -- -- -- -- 0.52380002 -- -- -- -- -- -- -- -- -- 80 0 80 0 0 0 -- 0.47314460723313795 0.4344997036814196 False 1.0409509 0.27349713 -- 2 14 False 0.36665735 0.10474015 0.33550045 0.73903608 -104.3985 27.750549 -29.422886 -44.953251 77 492.79505895268744 4.7092584766237948 18.793104202112058 NOT_AVAILABLE 34.171511451972741 16.544154401270578 268.34148147147999 31.959485697633603
1635378410781933568 4488168978109094912 724156643 2015.0 266.9350267767021 0.45472258639757962 8.4501367670511502 0.52313455431024292 -- -- -- -- -- -- 0.92054999 -- -- -- -- -- -- -- -- -- 65 0 65 0 0 0 -- 0.54443893282338873 2.7908803781634761 False 1.2134782 1.1420876 -- 2 16 False 0.35940772 0.74710166 0.26141152 0.91352218 -62.16045 -45.337482 -33.693073 -42.355057 67 2029.0269077666758 3.3669145828795304 17.25655054776395 NOT_AVAILABLE 33.355528435673932 17.971330490608551 266.43026591010408 31.851445954397114

Luckily, we obtain the same result as before.

Authenticate with your account

The current python package also allows you to authenticate. This is mostly relevant when using async queries, or user tables.

gaia.login("my_user_name")

The above will prompt for a password.

Note that the password can also be provided as argument if you need to script some queries. It will not be stored, only the relevant cookie will be conserved until the end of the session.

Callback previous jobs

In some cases (mostly when authenticated) you might want to download a previous job. This python package allows you to do so using recall_query, which returns an asynchronous result.



In [ ]:

    
qprime = gaia.recall_query(q.jobid)
qprime.get()

Make the luminosity function of the TGAS stars

In this example, we want to make the luminosity function of TGAS stars. Of course we could download all stars and do it on our computer but one can do it on the server side and only download the computed histogram.

hint: Building a luminosity function means that we want to select all stars in TGAS (with magnitude) and count them after grouping them per bin of magnitudes. Additionally one will want to sort the bins for plotting them in order.



In [9]:

    
adql = QueryStr("""
select 
    count(*) as n, 
    round(phot_g_mean_mag, 1) as val
from 
    gaiadr1.tgas_source 
group by val
order by val
""")









    




ADQL query
SELECT
    count(*) AS n,
    round(phot_g_mean_mag, 1) AS val
FROM
    gaiadr1.tgas_source
GROUP BY val
ORDER BY val

Run the query



In [10]:

    
data_tgas = timeit(gaia.query)(adql)









    




Execution time: 18.9 s

Note: Sometimes the Gaia Archive crashes... BUG! This is needed feedback to the service.

You can bypass the issue by using async queries or re-run until you give up...



In [12]:

    
data_tgas = gaia.query_async(adql).get()









    



Query Status: 303 Reason: 303
Location: http://gea.esac.esa.int/tap-server/tap/async/1479126879844O
Job id: 1479126879844O

Plot the histogram data.



In [13]:

    
plt.step(data_tgas['val'], data_tgas['n'], 
         lw=2, where='pre', label='TGAS')
plt.yscale('log')
plt.xlabel('G magnitude')
plt.ylabel('counts / mag')
figrc.hide_axis('top right'.split())

Note: Creating the luminosity function for the full DR1 required more than 1 min and thefore an async query, which is not handled by this simple query interface.

Stellar density map

Let's reproduce the stellar density map of TGAS stars with all computations on the server side.

Based on the histogram technique we used for the luminosity function, one can extend it to more than one dimension.



In [14]:

    
adql = QueryStr("""
select 
    count(*) as n, 
    round(l, 0) as x, 
    round(b, 0) as y
from 
    gaiadr1.tgas_source 
group by x, y
order by x, y
""")









    




ADQL query
SELECT
    count(*) AS n,
    round(l, 0) AS x,
    round(b, 0) AS y
FROM
    gaiadr1.tgas_source
GROUP BY x, y
ORDER BY x, y

Run the query



In [15]:

    
data = timeit(gaia.query)(adql)









    




Execution time: 25.7 s

Below you can see that the table is flat (i.e, no 2d matrix) and it contains a number $n$ for each pair $(x,y)$. Note also that empty bins are not included.



In [16]:

    
data









    Out[16]:




<Table masked=True length=62938>

n x y
int64 float64 float64
1 0.0 -88.0
1 0.0 -85.0
1 0.0 -84.0
1 0.0 -83.0
1 0.0 -82.0
1 0.0 -81.0
2 0.0 -80.0
3 0.0 -79.0
4 0.0 -78.0
1 0.0 -77.0
... ... ...
2 360.0 77.0
3 360.0 78.0
1 360.0 79.0
4 360.0 80.0
2 360.0 81.0
4 360.0 82.0
1 360.0 86.0
1 360.0 88.0
1 360.0 89.0
1 360.0 90.0

Skipping polishing the projection



In [17]:

    
from matplotlib.colors import LogNorm
l = np.arange(0, 360, 1)
b = np.arange(-90, 90, 1)
n = np.zeros((len(l), len(b)))
ix = np.digitize(data['x'], l)
iy = np.digitize(data['y'], b)
n[ix - 1, iy - 1] = data['n']
plt.pcolormesh(l, b, n.T, 
               cmap=plt.cm.viridis, 
               norm=LogNorm())
figrc.hide_axis('top right'.split())
plt.xlim(l.min(), l.max())
plt.ylim(b.min(), b.max());

Query around a position

Using the region objects, one can query around a point. Below is an example of cone-search, i.e., selecting stars within 2 degrees of an object:

select * from gaiadr1.gaia_sources
where contains(point('ICRS', ra, dec), circle('ICRS',10.6847083,41.26875,2) ) = 1

The where condition selects points from the data that are contained into the circle of given center and size.

Below we make the density map of stars around the same object. We can therefore combine the 2D-histogram above and the cone-search technique.

Any idea of what object is the center of this selection?



In [18]:

    
adql = QueryStr("""
select 
    count(*) as n, 
    round(l, 2) as latitude, 
    round(b, 2) as longitude
from 
    gaiadr1.gaia_source 
where 
    contains(point('ICRS',gaiadr1.gaia_source.ra,gaiadr1.gaia_source.dec),
             circle('ICRS',10.6847083,41.26875,2) )=1  
group by latitude, longitude
order by latitude, longitude
""")









    




ADQL query
SELECT
    count(*) AS n,
    round(l, 2) AS latitude,
    round(b, 2) AS longitude
FROM
    gaiadr1.gaia_source
WHERE
    contains(point('ICRS',gaiadr1.gaia_source.ra,gaiadr1.gaia_source.dec),
             circle('ICRS',10.6847083,41.26875,2) )=1
GROUP BY latitude, longitude
ORDER BY latitude, longitude

Run the query



In [19]:

    
data = timeit(gaia.query)(adql)









    




Execution time: 13.7 s

Finally we plot the map



In [20]:

    
plt.figure(figsize=(6,6))
plt.subplot(111, aspect=1)
plt.scatter(data['latitude'], data['longitude'], c=data['n'], 
            edgecolor='None', s=6, rasterized=True, norm=LogNorm(),
            cmap=plt.cm.magma, marker='s'
           )
plt.xlim(data['latitude'].min(), data['latitude'].max())
plt.ylim(data['longitude'].min(), data['longitude'].max())
plt.axis('off');

Let's proceed to query around another object for example M33

ADQL does not provide a name resolver function. So you'll have to figure that for M33 you need

circle('ICRS',23.4621,30.6599417,0.5)

Alternatively, this python package provides a name resolver function based on the CDS/Sesame service.

from tap import resolve
ra, dec = resolve('m33')

Below I show the latter option.



In [21]:

    
ra, dec = resolve('m33')
adql = QueryStr("""
select 
    count(*) as n, 
    round(l, 2) as latitude, 
    round(b, 2) as longitude
from 
    gaiadr1.gaia_source 
where 
    contains(point('ICRS',gaiadr1.gaia_source.ra,gaiadr1.gaia_source.dec),
             circle('ICRS',{ra:f}, {dec:f},0.5) )=1  
group by latitude, longitude
order by latitude, longitude
""".format(ra=ra, dec=dec))
data = timeit(gaia.query)(adql)
plt.figure(figsize=(6,6))
plt.subplot(111, aspect=1)
plt.scatter(data['latitude'], data['longitude'], c=data['n'], 
            edgecolor='None', s=9, rasterized=True, norm=LogNorm(),
            cmap=plt.cm.magma, marker='s'
           )
plt.xlim(data['latitude'].min(), data['latitude'].max())
plt.ylim(data['longitude'].min(), data['longitude'].max())
plt.axis('off');









    




ADQL query
SELECT
    count(*) AS n,
    round(l, 2) AS latitude,
    round(b, 2) AS longitude
FROM
    gaiadr1.gaia_source
WHERE
    contains(point('ICRS',gaiadr1.gaia_source.ra,gaiadr1.gaia_source.dec),
             circle('ICRS',23.462100, 30.659942,0.5) )=1
GROUP BY latitude, longitude
ORDER BY latitude, longitude








    




Execution time: 922 ms

Color-magnitude diagram of TGAS

One of the first results published by the Gaia Consortium was the color-magnitude diagram (CMD) of the TGAS data, showing how good the parallaxes are.

Let's reproduce the figure.

Let's make a binned CMD with bins of $0.01$ mag and $0.05$ mag in color and magnitude, respectively. Additionally, we will need to use the parallax as distance measurements. As Bailer-Jones 2015 showed one must be careful, thus let's only consider stars with $\varpi/\sigma_\varpi > 5$. One could also filter on photometric signal-to-noise (in flux).



In [22]:

    
adql = QueryStr("""
select 
    count(*) as n,
    floor((hip.bt_mag - hip.vt_mag) / 0.01) * 0.01 as color,
    floor((gaia.phot_g_mean_mag + 5*log10(gaia.parallax)-10) / 0.05) * 0.05 as mag
from 
    gaiadr1.tgas_source as gaia
inner join 
    public.tycho2 as hip
    on gaia.hip = hip.hip
where 
    gaia.parallax / gaia.parallax_error >= 5 
    and (2.5/log(10)) * (gaia.phot_g_mean_flux_error / gaia.phot_g_mean_flux) <= 0.05
group by color, mag
""")
data = timeit(gaia.query)(adql)









    




ADQL query
SELECT
    count(*) AS n,
    floor((hip.bt_mag - hip.vt_mag) / 0.01) * 0.01 AS color,
    floor((gaia.phot_g_mean_mag + 5*log10(gaia.parallax)-10) / 0.05) * 0.05 AS mag
FROM
    gaiadr1.tgas_source AS gaia
INNER JOIN
    public.tycho2 AS hip
    ON gaia.hip = hip.hip
WHERE
    gaia.parallax / gaia.parallax_error >= 5
    AND (2.5/log(10)) * (gaia.phot_g_mean_flux_error / gaia.phot_g_mean_flux) <= 0.05
GROUP BY color, mag








    




Execution time: 6.42 s

Plotting the CMD is the only local operation.



In [23]:

    
plt.scatter(data['color'], data['mag'], c=data['n'], 
            edgecolor='None', s=1, rasterized=True, norm=LogNorm(),
            cmap=plt.cm.magma, marker='o'
           )
plt.xlim(data['color'].min(), data['color'].max())
plt.ylim(data['mag'].max(), data['mag'].min())
plt.xlabel('B-V (Hipparcos)')
plt.ylabel(r'G + 5 log($\varpi$) - 10')
figrc.hide_axis('top right'.split())

Using DR1 crossmatched catalogs: joining tables

Many surveys are already crossmatched by Gaia DPAC (Data Processing and Analysis Consortium). However the access may not be as trivial for many. It requires to join tables by some id values.

Getting TGAS and Tycho2 missing Ids

Note the use of left outer join, which adds to the left table (here gaia) the missing columns when and fills it in whenever possible.



In [24]:

    
adql = QueryStr("""
select top 10
        gaia.hip, gaia.tycho2_id, gaia.source_id,
        tycho2.bt_mag, tycho2.vt_mag, tycho2.e_bt_mag, tycho2.e_vt_mag
from 
        gaiadr1.tgas_source as gaia
left outer join
        public.tycho2 as tycho2 
        on gaia.tycho2_id = tycho2.id
""")
timeit(gaia.query)(adql)









    




ADQL query
SELECT top 10
        gaia.hip, gaia.tycho2_id, gaia.source_id,
        tycho2.bt_mag, tycho2.vt_mag, tycho2.e_bt_mag, tycho2.e_vt_mag
FROM
        gaiadr1.tgas_source AS gaia
LEFT OUTER JOIN
        public.tycho2 AS tycho2
        ON gaia.tycho2_id = tycho2.id








    




Execution time: 135 ms







    Out[24]:




<Table masked=True length=10>

hip tycho2_id source_id bt_mag vt_mag e_bt_mag e_vt_mag
'mag' 'mag' 'mag' 'mag'
int32 object int64 float32 float32 float32 float32
-- 1000-1009-1 4493714846038108800 12.762 12.157 0.236 0.193
-- 1000-1016-1 4492839806583533312 11.131 10.695 0.057999998 0.061999999
-- 1000-1018-1 4493575723457455872 12.224 11.849 0.155 0.163
-- 1000-1043-1 4494114312356365568 10.274 9.2080002 0.035 0.021
-- 1000-1068-1 4493519648365739008 11.95 10.608 0.106 0.048999999
-- 1000-108-1 4493522603303232768 12.391 12.389 0.185 0.21600001
-- 1000-1087-1 4493716048628949632 11.529 11.021 0.077 0.079999998
-- 1000-1092-1 4492866469738055936 12.696 12.115 0.248 0.20900001
-- 1000-111-1 4493709520280710528 12.048 11.467 0.121 0.104
-- 1000-1117-1 4493890870978560640 11.357 10.841 0.064000003 0.064000003

What happens if you do inner join instead?



In [25]:

    
adql = QueryStr("""
select top 10
    gaia.hip, gaia.tycho2_id, gaia.source_id,
    gaia.ra, gaia.ra_error, gaia.dec, gaia.dec_error, 
    gaia.parallax, gaia.parallax_error, gaia.pmra, gaia.pmra_error,
    gaia.pmdec, gaia.pmdec_error, gaia.ra_dec_corr, gaia.ra_parallax_corr,
    gaia.phot_g_n_obs, gaia.phot_g_mean_flux, gaia.phot_g_mean_flux_error,
    gaia.phot_g_mean_mag, gaia.phot_variable_flag, gaia.l, gaia.b,
    gaia.ecl_lon, gaia.ecl_lat, tycho2.bt_mag, tycho2.vt_mag,
    tycho2.e_bt_mag, tycho2.e_vt_mag, 
    allwise.allwise_oid, allwise.w1mpro,
    allwise.w1mpro_error, allwise.w2mpro, allwise.w2mpro_error,
    allwise.w3mpro, allwise.w3mpro_error, allwise.w4mpro,
    allwise.w4mpro_error, allwise.var_flag, allwise.w1mjd_mean,
    allwise.w2mjd_mean, allwise.w3mjd_mean, allwise.w4mjd_mean,
    allwise.w1gmag, allwise.w1gmag_error, allwise.w2gmag,
    allwise.w2gmag_error, allwise.w3gmag, allwise.w3gmag_error,
    allwise.w4gmag, allwise.w4gmag_error,
    tmass.tmass_oid, tmass.j_m, tmass.j_msigcom, tmass.h_m, tmass.h_msigcom,
    tmass.ks_m, tmass.ks_msigcom
from 
    gaiadr1.tgas_source as gaia
left outer join
    public.tycho2 as tycho2  
    on gaia.tycho2_id = tycho2.id
left outer join
    gaiadr1.allwise_best_neighbour as allwisexmatch  
    on gaia.source_id = allwisexmatch.source_id  
left outer join 
    gaiadr1.allwise_original_valid as allwise 
    on allwisexmatch.allwise_oid = allwise.allwise_oid  
left outer join 
    gaiadr1.tmass_best_neighbour as tmassxmatch
    on gaia.source_id = tmassxmatch.source_id  
left outer join
    gaiadr1.tmass_original_valid as tmass  
    on tmassxmatch.tmass_oid = tmass.tmass_oid
""")
data = timeit(gaia.query)(adql)









    




ADQL query
SELECT top 10
    gaia.hip, gaia.tycho2_id, gaia.source_id,
    gaia.ra, gaia.ra_error, gaia.dec, gaia.dec_error,
    gaia.parallax, gaia.parallax_error, gaia.pmra, gaia.pmra_error,
    gaia.pmdec, gaia.pmdec_error, gaia.ra_dec_corr, gaia.ra_parallax_corr,
    gaia.phot_g_n_obs, gaia.phot_g_mean_flux, gaia.phot_g_mean_flux_error,
    gaia.phot_g_mean_mag, gaia.phot_variable_flag, gaia.l, gaia.b,
    gaia.ecl_lon, gaia.ecl_lat, tycho2.bt_mag, tycho2.vt_mag,
    tycho2.e_bt_mag, tycho2.e_vt_mag,
    allwise.allwise_oid, allwise.w1mpro,
    allwise.w1mpro_error, allwise.w2mpro, allwise.w2mpro_error,
    allwise.w3mpro, allwise.w3mpro_error, allwise.w4mpro,
    allwise.w4mpro_error, allwise.var_flag, allwise.w1mjd_mean,
    allwise.w2mjd_mean, allwise.w3mjd_mean, allwise.w4mjd_mean,
    allwise.w1gmag, allwise.w1gmag_error, allwise.w2gmag,
    allwise.w2gmag_error, allwise.w3gmag, allwise.w3gmag_error,
    allwise.w4gmag, allwise.w4gmag_error,
    tmass.tmass_oid, tmass.j_m, tmass.j_msigcom, tmass.h_m, tmass.h_msigcom,
    tmass.ks_m, tmass.ks_msigcom
FROM
    gaiadr1.tgas_source AS gaia
LEFT OUTER JOIN
    public.tycho2 AS tycho2
    ON gaia.tycho2_id = tycho2.id
LEFT OUTER JOIN
    gaiadr1.allwise_best_neighbour AS allwisexmatch
    ON gaia.source_id = allwisexmatch.source_id
LEFT OUTER JOIN
    gaiadr1.allwise_original_valid AS allwise
    ON allwisexmatch.allwise_oid = allwise.allwise_oid
LEFT OUTER JOIN
    gaiadr1.tmass_best_neighbour AS tmassxmatch
    ON gaia.source_id = tmassxmatch.source_id
LEFT OUTER JOIN
    gaiadr1.tmass_original_valid AS tmass
    ON tmassxmatch.tmass_oid = tmass.tmass_oid








    




Execution time: 513 ms

Example

This example is from C. A. L. Bailer-Jones

TGAS apparently has the nasty property that it does not report both Hipparcos and Tycho-2 Id at the same time but only one of the 2. However, when you need/want to have both, this starts to get a little bit tricky.

Additionally CBJ wanted to do some selections on the parallax and proper motion for updating his stellar encounter study.

Below we select the various id values but fill the Tycho-2 ids for Hipparcos stars when available in the Tycho-2 data. Moreover, CBJ wants to filter stars based on their motion:

\begin{equation} \frac{1000 \cdot 4.74047 \cdot \sqrt{\mu_\alpha^2 + \mu_\delta^2} / \varpi^2}{ \sqrt{ (\mu_\alpha^2 + \mu_\delta^2) \cdot (4.74047/\varpi)^2 + 500^2} } < 10 \end{equation}

where $4.74047$ is the equivalent of $1$ AU/yr in km/s and a radial velocity $R_V=500$ km/s.



In [30]:

    
adql = QueryStr("""
select 
    tgas.tycho2_id, tycho2.id, tgas.source_id, tgas.phot_g_mean_mag,
    tgas.ra, tgas.dec, tgas.parallax, tgas.pmra, tgas.pmdec, tgas.ra_error,
    tgas.dec_error, tgas.parallax_error, tgas.pmra_error, tgas.pmdec_error,
    tgas.ra_dec_corr, tgas.ra_parallax_corr, tgas.ra_pmra_corr,
    tgas.ra_pmdec_corr, tgas.dec_parallax_corr, tgas.dec_pmra_corr,
    tgas.dec_pmdec_corr, tgas.parallax_pmra_corr, tgas.parallax_pmdec_corr,
    tgas.pmra_pmdec_corr
from 
    gaiadr1.tgas_source as tgas
left outer join 
    public.tycho2 as tycho2
    on tgas.hip = tycho2.hip
where ( (
            1000 * 4.74047 * sqrt(power(tgas.pmra, 2) 
            + power(tgas.pmdec, 2)) / power(tgas.parallax, 2)
         ) /
         ( sqrt(
                (power(tgas.pmra, 2) 
                + power(tgas.pmdec, 2)) * power(4.74047 / tgas.parallax, 2)
                + power(500, 2) ) )
        ) < 10
""")
data = timeit(gaia.query)(adql)









    




ADQL query
SELECT
    tgas.tycho2_id, tycho2.id, tgas.source_id, tgas.phot_g_mean_mag,
    tgas.ra, tgas.dec, tgas.parallax, tgas.pmra, tgas.pmdec, tgas.ra_error,
    tgas.dec_error, tgas.parallax_error, tgas.pmra_error, tgas.pmdec_error,
    tgas.ra_dec_corr, tgas.ra_parallax_corr, tgas.ra_pmra_corr,
    tgas.ra_pmdec_corr, tgas.dec_parallax_corr, tgas.dec_pmra_corr,
    tgas.dec_pmdec_corr, tgas.parallax_pmra_corr, tgas.parallax_pmdec_corr,
    tgas.pmra_pmdec_corr
FROM
    gaiadr1.tgas_source AS tgas
LEFT OUTER JOIN
    public.tycho2 AS tycho2
    ON tgas.hip = tycho2.hip
WHERE ( (
            1000 * 4.74047 * sqrt(power(tgas.pmra, 2)
            + power(tgas.pmdec, 2)) / power(tgas.parallax, 2)
         ) /
         ( sqrt(
                (power(tgas.pmra, 2)
                + power(tgas.pmdec, 2)) * power(4.74047 / tgas.parallax, 2)
                + power(500, 2) ) )
        ) < 10








    




Execution time: 48.5 s



In [31]:

    
data









    Out[31]:




<Table masked=True length=117330>

tycho2_id id source_id phot_g_mean_mag ra dec parallax pmra pmdec ra_error dec_error parallax_error pmra_error pmdec_error ra_dec_corr ra_parallax_corr ra_pmra_corr ra_pmdec_corr dec_parallax_corr dec_pmra_corr dec_pmdec_corr parallax_pmra_corr parallax_pmdec_corr pmra_pmdec_corr
Magnitude[mag] Angle[deg] Angle[deg] Angle[mas] Angular Velocity[mas/year] Angular Velocity[mas/year] Angle[mas] Angle[mas] Angle[mas] Angular Velocity[mas/year] Angular Velocity[mas/year] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description]
object object int64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float32 float32 float32 float32 float32 float32 float32 float32 float32 float32
-- 958-1468-2 4464207424282475392 7.1198602385889878 243.32764068754435 13.52515172664916 40.800006093117474 178.74861089198765 -420.79960382877454 0.13104649484244665 0.16117893533526509 0.31043723997045586 0.064601257490059308 0.057930718633481051 0.29621336 -0.41621789 0.13098365 -0.006937637 -0.72872072 0.059074912 0.035693608 -0.095779195 -0.054888006 -0.16881736
-- 74-2361-1 3255252986058161408 7.3695276910331131 64.006094273713217 0.4538119833656295 7.1071761246510494 24.354612773084156 -20.989618596126128 0.29644318347422099 0.096150567587155025 0.35108547075407198 0.06290689925903338 0.044369717746791014 0.49005944 -0.60830009 0.16846217 0.062295858 -0.1559172 0.085781589 0.063546307 -0.12095547 -0.046934858 0.028684257
-- 96-602-1 3288082753998763904 10.510600620896234 73.024528087103548 6.4752772048123743 80.712650990387615 153.81772481777199 -305.57059728598551 0.26137398028662923 0.18054158432210146 0.33123962532366935 0.24514693508660898 0.14702434796739774 0.16450725 -0.013415399 -0.27141082 0.15436758 -0.13150086 0.18181257 -0.24545169 -0.11510445 0.054411348 -0.11440114
-- 642-261-1 26101272172092160 10.955850032461074 39.161829394123984 12.207783096620895 20.383739875496488 240.67529562993337 -64.485729172308936 0.23582725901875154 0.20845897631933336 0.31685266385133043 0.1909477682115549 0.13636953348692929 -0.3721309 0.43391946 -0.070138283 -0.16176727 -0.71138328 -0.051297918 0.010662213 -0.050761018 -0.16512752 0.16369283
-- 635-26-1 26283928541202816 10.809861178182299 37.22781819174989 12.089496787535225 39.919961935565183 -2.5864328472578642 81.626292496943535 0.60039515603994131 0.46890750268181058 0.67934103200164608 0.18503126835123951 0.1081967345680231 -0.50900584 0.56034762 0.059183244 -0.15097594 -0.70150846 -0.075549848 0.15309452 0.032147638 -0.1839865 0.20139986
-- 705-228-1 3338162725508646528 10.057800898776822 83.060306350812724 9.8199099636248448 77.77121731583712 -180.0535819052561 -216.91689382915277 0.40115072093741344 0.53875097725055132 0.61362773790439706 0.19852981735012187 0.10421063302057142 -0.37349886 0.24006249 0.059844881 -0.047105495 -0.76504147 -0.035682362 0.17846557 0.065704599 -0.13622048 0.052898854
-- 883-319-2 3731076710381126912 8.1492894201797128 196.12815296905151 8.6531822316745419 5.1696155054476867 16.791367641792707 0.15321456418562637 0.30133273705467556 0.19938605707508514 0.34002505864489424 0.079280953996536327 0.049291718311859016 -0.71896017 -0.61844242 0.11187758 -0.071467325 0.47721675 -0.064067535 0.15949973 -0.060346734 0.020644929 -0.4326793
-- 993-77-1 4489070130967038720 11.093453539211243 265.77440288247851 9.0750980412265125 17.64909121187484 -32.322782317896724 187.88341033401773 0.19190892142781865 0.20035539243054135 0.25557908589771844 0.18762798949759626 0.14448000991756382 -0.073481336 0.094857119 -0.26464456 -0.062460415 -0.10070089 -0.12583351 -0.1455414 0.049493775 0.014614867 -0.15881599
-- 1176-118-1 2770944055625414144 11.17866688441425 355.46780594473444 14.106812279658827 23.097433789789051 189.59831620509766 -70.023716286970142 0.35201742480990672 0.17642190754784473 0.39888480045774105 0.28898996110418657 0.15902621433786063 0.48470202 0.01193755 -0.058665954 -0.13205461 -0.15349667 -0.086680703 -0.0935615 -0.16123833 -0.10170623 0.38748664
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
5867-270-1 -- 5152647044963242880 10.603436205515926 44.968316611370177 -17.500070298881951 7.4025990943432145 33.729324908746115 -17.733903316094089 0.20030820512607653 0.17715857805919705 0.21931992480341894 1.3612821907133699 0.91192106006231177 -0.27455848 0.35145411 0.15453874 -0.2122146 -0.00012747967 0.24284515 -0.40720624 0.17599767 -0.3501786 -0.86046755
5860-278-1 -- 5133045467059428480 11.164121396948143 38.266186307013285 -17.383303778755952 1.9774922198410643 -0.037879962195114525 -3.3520496302505287 0.19359523593913508 0.18120247704525139 0.2446551981971728 1.4320734219309259 0.89798867352525369 -0.44771698 0.48197046 -0.33075052 0.20729962 -0.38693327 0.42750016 -0.45998341 0.15902944 -0.27701685 -0.90613073
5894-233-1 -- 5092043441508980608 11.987577214845796 66.764372011830886 -19.811178724556346 4.3356944429337352 6.0191248770749803 -1.7024409099814568 0.13697654328442774 0.19147288767566137 0.23564775614872413 0.98110342836211573 0.7809818155346101 -0.27811912 0.18249577 -0.069153123 0.14320052 -0.078221299 0.42834297 -0.60344654 0.45738769 -0.36387622 -0.75557935
5894-1920-1 -- 5092057185404597760 9.7668801564711352 66.754995053429994 -19.570066191533872 4.8251422455439439 13.791671672690189 13.62339289293161 0.12344423467728835 0.20226559239547925 0.25375842100609775 0.7327198605349774 0.64952147017063611 -0.22804828 0.29718578 0.032664396 0.00015251618 -0.20508844 0.32589087 -0.5659228 0.28827974 -0.33773363 -0.64709479
5893-1059-1 -- 5092106938305739008 9.8765842941302768 66.513808706335112 -19.509132519523739 4.6914302182456424 -4.3777128494041664 3.177391725353508 0.15136787302026783 0.18622093437562989 0.2612116654668436 0.82086003714618427 0.75690803603805246 -0.062303752 0.25545049 -0.012737771 -0.071373485 0.19210443 0.32080275 -0.62045294 0.3567434 -0.48555902 -0.60528684
5893-1668-1 -- 5092190295031256320 9.745411171958466 65.380809466275082 -19.757701002476193 40.587814749808629 -201.93506861940239 -291.88917600303193 0.21112064350820658 0.2642609565966999 0.25989078037868213 1.3222064343951598 1.4249203796536405 -0.70768625 -0.38236192 -0.55642474 0.54509825 0.59547734 0.75052881 -0.8312822 0.67618316 -0.71395391 -0.9336803
5299-425-1 -- 5161936440749205376 11.886753070455052 52.71616317941902 -11.898024789546774 9.8755127020586908 -2.8198575902724001 -34.014293489701792 0.24833375628979376 0.22269552566153647 0.28983543837030989 1.4400720315947884 1.0200575979695843 -0.38644001 0.26966658 -0.30507544 0.17098616 -0.12732491 0.42667368 -0.61136782 0.20748778 -0.29711589 -0.8360461
5861-1838-1 -- 5124861320977821184 10.081969807740428 33.692563652566562 -20.770610986489491 7.4410265692600861 51.264674341404856 -8.5491220255946718 0.23977256166288055 0.19582961978752872 0.23588463292782372 0.7279022141830459 0.44240857853411397 -0.66783315 0.34615254 -0.88598007 0.70222366 -0.53942156 0.73258668 -0.71669334 -0.45975381 0.17038192 -0.73626155
5861-1966-1 -- 5125071671296190336 11.182314959213448 33.937992298003024 -19.853078395552302 2.8518608454983254 -0.40834817837594367 4.8134078616850173 1.8476961036828168 0.99688663455041782 0.4474011068980896 4.6247995316371622 2.7479672934106851 -0.97831637 -0.58271301 -0.99506092 0.9883343 0.49158117 0.98296231 -0.98019689 0.54243255 -0.59211057 -0.98712897
6434-853-1 -- 5125144891898722176 10.669049156745469 39.845819460599643 -24.020891343012003 3.3885319449480238 9.6228154141568858 2.3966449718780307 0.25127222426550916 0.24118940423905977 0.2829941747503309 0.78817470566861125 0.7436670838360756 -0.40587583 0.49761012 -0.81417626 0.54049796 -0.43783897 0.51121449 -0.55641109 -0.47965857 0.22281404 -0.60499299



In [32]:

    
adql = QueryStr("""
select 
    tgas.tycho2_id, tycho2.id, tgas.source_id, tgas.phot_g_mean_mag,
    tgas.ra, tgas.dec, tgas.parallax, tgas.pmra, tgas.pmdec, tgas.ra_error,
    tgas.dec_error, tgas.parallax_error, tgas.pmra_error, tgas.pmdec_error,
    tgas.ra_dec_corr, tgas.ra_parallax_corr, tgas.ra_pmra_corr,
    tgas.ra_pmdec_corr, tgas.dec_parallax_corr, tgas.dec_pmra_corr,
    tgas.dec_pmdec_corr, tgas.parallax_pmra_corr, tgas.parallax_pmdec_corr,
    tgas.pmra_pmdec_corr
from 
    gaiadr1.tgas_source as tgas
left outer join 
    public.tycho2 as tycho2
    on tgas.hip = tycho2.hip
where ( (
            1000 * 4.74047 * sqrt(power(tgas.pmra, 2) 
            + power(tgas.pmdec, 2)) / power(tgas.parallax, 2)
         ) /
         ( sqrt(
                (power(tgas.pmra, 2) 
                + power(tgas.pmdec, 2)) * power(4.74047 / tgas.parallax, 2)
                + power(500, 2) ) )
        ) < 10
""")
gaia.query_async(adql).get()









    




ADQL query
SELECT
    tgas.tycho2_id, tycho2.id, tgas.source_id, tgas.phot_g_mean_mag,
    tgas.ra, tgas.dec, tgas.parallax, tgas.pmra, tgas.pmdec, tgas.ra_error,
    tgas.dec_error, tgas.parallax_error, tgas.pmra_error, tgas.pmdec_error,
    tgas.ra_dec_corr, tgas.ra_parallax_corr, tgas.ra_pmra_corr,
    tgas.ra_pmdec_corr, tgas.dec_parallax_corr, tgas.dec_pmra_corr,
    tgas.dec_pmdec_corr, tgas.parallax_pmra_corr, tgas.parallax_pmdec_corr,
    tgas.pmra_pmdec_corr
FROM
    gaiadr1.tgas_source AS tgas
LEFT OUTER JOIN
    public.tycho2 AS tycho2
    ON tgas.hip = tycho2.hip
WHERE ( (
            1000 * 4.74047 * sqrt(power(tgas.pmra, 2)
            + power(tgas.pmdec, 2)) / power(tgas.parallax, 2)
         ) /
         ( sqrt(
                (power(tgas.pmra, 2)
                + power(tgas.pmdec, 2)) * power(4.74047 / tgas.parallax, 2)
                + power(500, 2) ) )
        ) < 10








    



Query Status: 303 Reason: 303
Location: http://gea.esac.esa.int/tap-server/tap/async/1479127182720O
Job id: 1479127182720O






    Out[32]:




<Table masked=True length=333180>

tycho2_id id source_id phot_g_mean_mag ra dec parallax pmra pmdec ra_error dec_error parallax_error pmra_error pmdec_error ra_dec_corr ra_parallax_corr ra_pmra_corr ra_pmdec_corr dec_parallax_corr dec_pmra_corr dec_pmdec_corr parallax_pmra_corr parallax_pmdec_corr pmra_pmdec_corr
Magnitude[mag] Angle[deg] Angle[deg] Angle[mas] Angular Velocity[mas/year] Angular Velocity[mas/year] Angle[mas] Angle[mas] Angle[mas] Angular Velocity[mas/year] Angular Velocity[mas/year] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description] Dimensionless[see description]
object object int64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float64 float32 float32 float32 float32 float32 float32 float32 float32 float32 float32
-- 958-1468-2 4464207424282475392 7.1198602385889878 243.32764068754435 13.52515172664916 40.800006093117474 178.74861089198765 -420.79960382877454 0.13104649484244665 0.16117893533526509 0.31043723997045586 0.064601257490059308 0.057930718633481051 0.29621336 -0.41621789 0.13098365 -0.006937637 -0.72872072 0.059074912 0.035693608 -0.095779195 -0.054888006 -0.16881736
-- 74-2361-1 3255252986058161408 7.3695276910331131 64.006094273713217 0.4538119833656295 7.1071761246510494 24.354612773084156 -20.989618596126128 0.29644318347422099 0.096150567587155025 0.35108547075407198 0.06290689925903338 0.044369717746791014 0.49005944 -0.60830009 0.16846217 0.062295858 -0.1559172 0.085781589 0.063546307 -0.12095547 -0.046934858 0.028684257
-- 96-602-1 3288082753998763904 10.510600620896234 73.024528087103548 6.4752772048123743 80.712650990387615 153.81772481777199 -305.57059728598551 0.26137398028662923 0.18054158432210146 0.33123962532366935 0.24514693508660898 0.14702434796739774 0.16450725 -0.013415399 -0.27141082 0.15436758 -0.13150086 0.18181257 -0.24545169 -0.11510445 0.054411348 -0.11440114
-- 642-261-1 26101272172092160 10.955850032461074 39.161829394123984 12.207783096620895 20.383739875496488 240.67529562993337 -64.485729172308936 0.23582725901875154 0.20845897631933336 0.31685266385133043 0.1909477682115549 0.13636953348692929 -0.3721309 0.43391946 -0.070138283 -0.16176727 -0.71138328 -0.051297918 0.010662213 -0.050761018 -0.16512752 0.16369283
-- 635-26-1 26283928541202816 10.809861178182299 37.22781819174989 12.089496787535225 39.919961935565183 -2.5864328472578642 81.626292496943535 0.60039515603994131 0.46890750268181058 0.67934103200164608 0.18503126835123951 0.1081967345680231 -0.50900584 0.56034762 0.059183244 -0.15097594 -0.70150846 -0.075549848 0.15309452 0.032147638 -0.1839865 0.20139986
-- 705-228-1 3338162725508646528 10.057800898776822 83.060306350812724 9.8199099636248448 77.77121731583712 -180.0535819052561 -216.91689382915277 0.40115072093741344 0.53875097725055132 0.61362773790439706 0.19852981735012187 0.10421063302057142 -0.37349886 0.24006249 0.059844881 -0.047105495 -0.76504147 -0.035682362 0.17846557 0.065704599 -0.13622048 0.052898854
-- 883-319-2 3731076710381126912 8.1492894201797128 196.12815296905151 8.6531822316745419 5.1696155054476867 16.791367641792707 0.15321456418562637 0.30133273705467556 0.19938605707508514 0.34002505864489424 0.079280953996536327 0.049291718311859016 -0.71896017 -0.61844242 0.11187758 -0.071467325 0.47721675 -0.064067535 0.15949973 -0.060346734 0.020644929 -0.4326793
-- 993-77-1 4489070130967038720 11.093453539211243 265.77440288247851 9.0750980412265125 17.64909121187484 -32.322782317896724 187.88341033401773 0.19190892142781865 0.20035539243054135 0.25557908589771844 0.18762798949759626 0.14448000991756382 -0.073481336 0.094857119 -0.26464456 -0.062460415 -0.10070089 -0.12583351 -0.1455414 0.049493775 0.014614867 -0.15881599
-- 1176-118-1 2770944055625414144 11.17866688441425 355.46780594473444 14.106812279658827 23.097433789789051 189.59831620509766 -70.023716286970142 0.35201742480990672 0.17642190754784473 0.39888480045774105 0.28898996110418657 0.15902621433786063 0.48470202 0.01193755 -0.058665954 -0.13205461 -0.15349667 -0.086680703 -0.0935615 -0.16123833 -0.10170623 0.38748664
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
-- -- 2022113552630031488 10.809417980891368 295.21169535460751 27.135500157046984 1.1850804393052741 0.094647723443364196 -1.0830978095114829 0.26187595094810834 0.38488387579966066 0.44052739754518433 0.13168395432746879 0.19988399431264031 -0.58875716 -0.14715578 0.0074138343 -0.10639651 0.0010837732 -0.1046929 0.026150052 -0.076331846 -0.081078418 -0.23453073
-- -- 2160231179655633024 9.7948495862371043 279.59682061496994 63.537279571946904 5.651389675567148 -19.205970300866717 17.214189395099307 0.16488992828947119 0.21876538083999636 0.21043742849875863 0.29306829593715022 0.30402815260508692 -0.010994269 -0.28242409 -0.046643503 -0.024930874 0.46829367 -0.16849175 -0.14975509 -0.27645656 -0.06322892 0.14222069
-- -- 5480636002093933312 11.446477304631612 104.15399891384047 -59.181371588182436 23.424843588095328 -150.78273904046455 454.35028679342736 0.43385865705643123 0.44692020226878532 0.46665115913645738 0.22675609712505176 0.27423566509008374 0.14139499 0.37014896 0.047423679 0.022717975 0.23878802 0.073354833 -0.085003823 0.13153161 -0.035147347 -0.075632788
-- -- 6258628096079533568 10.679517992834739 231.18923796930426 -17.128436509086288 33.146080976234572 -319.46156015395218 -201.67971887246895 0.31337904217038193 0.23285269820946283 0.37678652176389832 0.27197716663699284 0.19061893014805048 -0.22484814 0.35710266 -0.12849087 0.030083697 -0.52006841 -0.053378418 -0.13915376 0.15462995 -0.0079136845 -0.14807144
-- -- 3517841204861700480 10.402337441827887 182.79805471760096 -19.961343142015021 78.334858103802333 -212.86048826743612 -184.26565696706587 0.64595292107419156 0.45768057778449234 0.74960207722290939 0.12343635100546652 0.083204513649514519 -0.17557824 -0.27619189 0.18718562 0.00026196352 -0.25005001 -0.011549242 0.21315245 -0.033735294 -0.090193689 -0.31117707
-- -- 5776645010694951424 12.239973169418064 266.32849154808196 -78.531791105480124 1.8040871588291458 2.3325685671783343 1.6036632159191313 0.25439588150715969 0.31472172363723616 0.34454601676573277 0.24897244564818014 0.27345714493246609 0.36303121 -0.11208639 -0.097006612 0.085304782 -0.30294794 0.11997031 -0.13838637 -0.0364061 0.11174904 0.048471533
-- -- 2664795794330096512 11.411825335942742 349.13008863769124 6.9956754022071514 12.148058725446191 17.631118317221926 -58.664007000190523 0.21505496403892521 0.12773788914643644 0.26268578204508691 0.31643860669453289 0.20876037445503348 -0.12801878 0.47826239 -0.24485251 -0.44638935 -0.47547111 -0.13365172 -0.05507322 -0.36148632 -0.48934376 0.57178384
-- -- 6637022789397193856 10.919085255653844 283.21617362876736 -57.130253817383185 39.223923569807397 -246.05303998875536 -771.78617663544856 0.2586351503407488 0.28931453918304162 0.3839977537588376 0.22508644628719557 0.20224064951627124 -0.33466527 0.30280566 -0.14484498 -0.0093929665 -0.42912394 0.11182686 0.035479687 -0.065175578 -0.10270225 0.30287388
-- -- 1078374307005998592 10.386880096586156 158.53395763398763 73.836685988784396 5.9706353380101298 -17.15652431194281 -21.2591211378226 0.18242374887267754 0.21304010301896981 0.24412511727206143 0.51045713831048822 0.57072025476603139 0.18692671 -0.38935751 -0.4243913 0.099707037 -0.47588104 -0.10381275 -0.28260615 0.36046013 -0.014499335 0.42040604
-- -- 3250328273477242624 10.323536076680888 56.848008296367631 -1.973349303793126 59.265918380724678 180.4326672102344 -274.09586510398049 0.31357780161931398 0.18853398069647617 0.33196853455369524 0.17001322138876818 0.12728117236514613 -0.48252013 0.11179933 0.03517729 -0.042679433 -0.41038513 -0.14983699 -0.14641833 0.012961082 -0.050827395 0.29421562

As you can see this does not return the same number of entries but many more than the sync mode.

This can be for many reasons (esp. a bug of the service) but it could be that one of the sync mode limits was reached. Most likely the 1min time limit, but the reported time is still under that...

Testing TAPVizieR

The VizieR ADQL service (http://tapvizier.u-strasbg.fr/) a service is hosted by the CDS - Strasbourg allows access to any VizieR table and catalog. The same operations are provided through the ADQL protocols.

Below I reproduce all examples using the TAPVizieR Service.

Important note: column and table names in Vizier are not identical to the Gaia Archive ones

Testing interface



In [33]:

    
adql = QueryStr("""
select top 100
    gaia.source_id, gaia.hip,
    gaia.phot_g_mean_mag+5*log10(gaia.parallax)-10 as g_mag_abs_gaia,
    gaia.phot_g_mean_mag+5*log10(hip.plx)-10 as g_mag_abs_hip,
    hip."B-V"
from 
    "I/337/tgas" as gaia
inner join 
    "I/311/hip2" as hip
    on gaia.hip = hip.HIP
where
    gaia.parallax/gaia.parallax_error >= 5 
    and hip.Plx/hip.e_Plx >= 5 
    and hip."e_B-V" > 0.0 and hip."e_B-V" <= 0.05 
    and (2.5/log(10))*(gaia.phot_g_mean_flux_error/gaia.phot_g_mean_flux) <= 0.05
""")

vizier = TAPVizieR()
result = timeit(vizier.query)(adql)
result









    




ADQL query
SELECT top 100
    gaia.source_id, gaia.hip,
    gaia.phot_g_mean_mag+5*log10(gaia.parallax)-10 AS g_mag_abs_gaia,
    gaia.phot_g_mean_mag+5*log10(hip.plx)-10 AS g_mag_abs_hip,
    hip."B-V"
FROM
    "I/337/tgas" AS gaia
INNER JOIN
    "I/311/hip2" AS hip
    ON gaia.hip = hip.HIP
WHERE
    gaia.parallax/gaia.parallax_error >= 5
    AND hip.Plx/hip.e_Plx >= 5
    AND hip."e_B-V" > 0.0 AND hip."e_B-V" <= 0.05
    AND (2.5/log(10))*(gaia.phot_g_mean_flux_error/gaia.phot_g_mean_flux) <= 0.05








    




Execution time: 1.11 s







    Out[33]:




<Table masked=True length=100>

source_id [1] hip [1] g_mag_abs_gaia [1] g_mag_abs_hip [1] B-V [1]
mag mag mag
int64 int32 float64 float64 float64
4687774147220506496 6812 1.85137678654 1.90017597299 0.962
4687785073611638144 7142 4.8397158847 5.24915846664 0.739
4688027619006146944 7256 2.64061888525 2.63613237042 0.389
4688217112963284480 7814 3.65349275211 3.56609011335 0.59
4688292532592267264 1113 5.20891724857 5.3288419309 0.743
4688376164190784384 698 -0.361463990472 -0.367487496164 1.326
4688399700615057024 1953 3.91250851253 3.78855540529 0.594
4688654546792333952 879 4.05457755693 4.02124289603 0.551
4688756079821645056 1779 4.97370334258 5.1382801006 0.769
... ... ... ... ...
4698497409243027840 7973 1.81114679622 2.40900511892 0.578
4698554996164937344 8299 4.30182744113 4.48314912626 0.588
4698824204714644608 7954 2.59151919688 2.49679821869 0.956
4698950064436291200 8813 2.24864737395 2.7888000549 0.44
4699094340977420416 9471 4.03309380425 3.8980472268 0.669
4699102140637907456 9308 3.10007326217 3.0612059633 0.423
4699122653403365120 9686 2.99206336364 3.11125507232 0.418
4699595890078393088 10262 1.28232739002 1.56635251501 1.026
4699877674292611712 9953 0.536245541597 0.986647373954 1.252
4700068233401598208 10866 3.94266679476 3.28583057364 0.524

TGAS Luminosity function



In [34]:

    
adql = QueryStr("""
select 
    count(*) as n, 
    round(gaia.phot_g_mean_mag, 1) as val
from 
    "I/337/tgas" as gaia
group by val
order by val
""")
data_tgas = timeit(vizier.query)(adql)

# Apparently some parsing issues in TAPVizieR...
n = [int(k) for k in data_tgas['n']]
vals = [float(k) for k in data_tgas['val']]
#
plt.step(vals, n, lw=2, where='pre', label='TGAS')
plt.yscale('log')
plt.xlabel('G magnitude')
plt.ylabel('counts / mag')
figrc.hide_axis('top right'.split())









    




ADQL query
SELECT
    count(*) AS n,
    round(gaia.phot_g_mean_mag, 1) AS val
FROM
    "I/337/tgas" AS gaia
GROUP BY val
ORDER BY val








    




Execution time: 4.48 s

TGAS density map in l,b coordinates



In [35]:

    
adql = QueryStr("""
select 
    count(*) as n, 
    round(glon, 0) as x, 
    round(glat, 0) as y
from 
    "I/337/tgas"
group by x, y
order by x, y
""")
data = timeit(vizier.query)(adql)

from matplotlib.colors import LogNorm
dx = [float(k) for k in data['x']]
dy = [float(k) for k in data['y']]
dn = [int(k) for k in data['n']]
l = np.arange(0, 360, 1)
b = np.arange(-90, 90, 1)
n = np.zeros((len(l), len(b)))
ix = np.digitize(dx, l)
iy = np.digitize(dy, b)
n[ix - 1, iy - 1] = dn
plt.pcolormesh(l, b, n.T, 
               cmap=plt.cm.viridis, 
               norm=LogNorm())
figrc.hide_axis('top right'.split())
plt.xlim(l.min(), l.max())
plt.ylim(b.min(), b.max());









    




ADQL query
SELECT
    count(*) AS n,
    round(glon, 0) AS x,
    round(glat, 0) AS y
FROM
    "I/337/tgas"
GROUP BY x, y
ORDER BY x, y








    




Execution time: 22.5 s

M31 cone-search



In [36]:

    
adql = QueryStr("""
select
    count(*) as n, 
    round(ra, 2) as latitude, 
    round(dec, 2) as longitude
from 
    "I/337/gaia"
where 
    contains(point('ICRS', ra, dec),
             circle('ICRS',10.6847083,41.26875,2) )=1  
group by latitude, longitude
order by latitude, longitude
""")
data = timeit(vizier.query)(adql)

lat = [float(k) for k in data['latitude']]
lon = [float(k) for k in data['longitude']]
n = [int(k) for k in data['n']]

plt.figure(figsize=(6,6))
plt.subplot(111, aspect=1)
plt.scatter(lat, lon, c=n,
            edgecolor='None', s=6, rasterized=True, norm=LogNorm(),
            cmap=plt.cm.magma, marker='s'
           )
plt.xlim(min(lat), max(lat))
plt.ylim(min(lon), max(lon))
plt.axis('off');









    




ADQL query
SELECT
    count(*) AS n,
    round(ra, 2) AS latitude,
    round(dec, 2) AS longitude
FROM
    "I/337/gaia"
WHERE
    contains(point('ICRS', ra, dec),
             circle('ICRS',10.6847083,41.26875,2) )=1
GROUP BY latitude, longitude
ORDER BY latitude, longitude








    




Execution time: 4.51 s

M33 cone search



In [37]:

    
adql = QueryStr("""
select 
    count(*) as n, 
    round(ra, 2) as latitude, 
    round(dec, 2) as longitude
from 
    gaiadr1.gaia_source 
where 
    contains(point('ICRS', ra, dec),
             circle('ICRS',23.4621,30.6599417,0.5) )=1  
group by latitude, longitude
order by latitude, longitude
""")
data = timeit(gaia.query)(adql)

lat = [float(k) for k in data['latitude']]
lon = [float(k) for k in data['longitude']]
n = [int(k) for k in data['n']]

plt.figure(figsize=(6,6))
plt.subplot(111, aspect=1)
plt.scatter(lat, lon, c=n,
            edgecolor='None', s=9, rasterized=True, norm=LogNorm(),
            cmap=plt.cm.magma, marker='s'
           )
plt.xlim(min(lat), max(lat))
plt.ylim(min(lon), max(lon))
plt.axis('off');









    




ADQL query
SELECT
    count(*) AS n,
    round(ra, 2) AS latitude,
    round(dec, 2) AS longitude
FROM
    gaiadr1.gaia_source
WHERE
    contains(point('ICRS', ra, dec),
             circle('ICRS',23.4621,30.6599417,0.5) )=1
GROUP BY latitude, longitude
ORDER BY latitude, longitude








    




Execution time: 920 ms

TGAS CMD



In [38]:

    
adql = QueryStr("""
select
    count(*) as n,
    floor((hip."B-V") / 0.01) * 0.01 as color,
    floor((gaia.phot_g_mean_mag + 5*log10(gaia.parallax)-10) / 0.05) * 0.05 as mag
from 
    "I/337/tgas" as gaia
inner join 
    "I/311/hip2" as hip
    on gaia.hip = hip.HIP
where
    gaia.parallax/gaia.parallax_error >= 5 
    and hip."e_B-V" > 0.0 and hip."e_B-V" <= 0.05 
    and hip.Plx/hip.e_Plx >= 5 
    and (2.5/log(10))*(gaia.phot_g_mean_flux_error/gaia.phot_g_mean_flux) <= 0.05
group by color, mag
""")
data = timeit(vizier.query)(adql)
color = [float(k) for k in data['color']]
mag = [float(k) for k in data['mag']]
n = [int(k) for k in data['n']]
plt.scatter(color, mag, c=n, 
            edgecolor='None', s=2, rasterized=True, norm=LogNorm(),
            cmap=plt.cm.magma, marker='s'
           )
plt.xlim(min(color), max(color))
plt.ylim(max(mag), min(mag))
plt.xlabel('B-V (Hipparcos)')
plt.ylabel(r'G + 5 log($\varpi$) - 10')
figrc.hide_axis('top right'.split())









    




ADQL query
SELECT
    count(*) AS n,
    floor((hip."B-V") / 0.01) * 0.01 AS color,
    floor((gaia.phot_g_mean_mag + 5*log10(gaia.parallax)-10) / 0.05) * 0.05 AS mag
FROM
    "I/337/tgas" AS gaia
INNER JOIN
    "I/311/hip2" AS hip
    ON gaia.hip = hip.HIP
WHERE
    gaia.parallax/gaia.parallax_error >= 5
    AND hip."e_B-V" > 0.0 AND hip."e_B-V" <= 0.05
    AND hip.Plx/hip.e_Plx >= 5
    AND (2.5/log(10))*(gaia.phot_g_mean_flux_error/gaia.phot_g_mean_flux) <= 0.05
GROUP BY color, mag








    




Execution time: 9.46 s

Other example

Based on Watkins et al 2016.

Paper Abstract They perform a systematic search for Galactic globular cluster (GC) stars in the Tycho-Gaia Astrometric Solution (TGAS) catalog that formed part of Gaia Data Release 1 (DR1), and identify 5 members of NGC 104 (47 Tucanae), 1 member of NGC 5272 (M 3), 5 members of NGC 6121 (M 4), 7 members of NGC 6397, and 2 members of NGC 6656 (M 22).

By taking a weighted average of the member stars, fully accounting for the correlations between parameter estimates, they estimate the parallax (and, hence, distance) and proper motion (PM) of the GCs. This provides a homogeneous PM study of multiple GCs based on an astrometric catalogue with small and well-controlled systematic errors, and yields random PM errors that are similar to existing measurements. Detailed comparison to the available Hubble Space Telescope (HST) measurements generally shows excellent agreement, validating the astrometric quality of both TGAS and HST.

By contrast, comparison to ground-based measurements shows that some of those must have systematic errors exceeding the random errors. Our parallax estimates have uncertainties an order of magnitude larger than previous studies, but nevertheless imply distances consistent with previous estimates. By combining our PM measurements with literature positions, distances and radial velocities, we measure Galactocentric space motions for the clusters and find that these are also in good agreement with previous orbital analyses.

Our results highlight the future promise of Gaia for the determining accurate distances and PMs of Galactic GCs, which will provide crucial constraints on the near end of the cosmic distance ladder, and provide accurate GC orbital histories.

method For each cluster, they calculate the likelihood of each nearby stars of being a cluster member by decomposing this likelihood into a positional part $L_{\alpha\delta,i}$ and a motion part $L_{\varpi\mu,i}$.

On the one hand, stars close to the cluster centre of the cluster are more likely to be members than stars near to the $2\,R_{tidal}$ boundary, so we also calculate the likelihood of a star $i$ with coordinates $(\alpha_i, \delta_i)$ being a member of the GC with centre $(\alpha_{GC}, \delta_{GC})$ as,

\begin{eqnarray} L_{\alpha\delta,i} &=& p(\alpha_i, \delta_i | \alpha_{GC}, \delta_{GC}, \sigma) &=& \exp\left[-\frac{1}{2\sigma^2} \left((\alpha_i - \alpha_{GC})^2 + (\delta_i - \delta_{GC})^2\right) \right]. \end{eqnarray}

where they use $\sigma = \frac{1}{2} R_{tidal}$ to account for the approximate extent of the cluster. The uncertainties on the cluster centre coordinates and on the positions of the stars are negligible compared to the extent of the cluster, so we neglect the measurement errors in this calculation.

On the other hand, for a star $i$ with parallax and PM measurements $m_i = (\varpi, \mu_\alpha, \mu_\delta)$, and covariance $C_i$, they ask what is the likelihood $L_{\varpi\mu,i}$ that this star is a member of a GC with measurements $m_{GC}$ and covariance $C_{GC}$,

\begin{eqnarray} L_{\varpi\mu,i} &=& p(m_i | C_i, m_{GC}, C_{GC})\\ &=& \frac{1}{ \left[(2\pi)^3 |\det(C_{i} + C_{GC})|\right]^{1/2}}\,\exp\left[-\frac{1}{2} (m_i - m_{GC})^T \cdot (C_{i} + C_{GC})^{-1} \cdot (m_i - m_{GC}) \right]. \end{eqnarray}

The above is a standard 3-dimensional Gaussian.

To construct the GC covariance matrix $C_{GC}$, they assume that the errors are uncorrelated, so the diagonal terms are the squared uncertainties on the parallax and PM measurements and the off-diagonal elements are zero. They further add the GC velocity dispersion (H96) in quadrature to the PM terms to account for the expected spread in velocities. \begin{eqnarray} C_{GC} = \left[\begin{matrix} \sigma^2_\varpi & 0 & 0\\ 0 & \mu^2_\alpha + \sigma^2_v & 0 \\ 0 & 0 & \mu^2_\delta + \sigma^2_v \end{matrix}\right] \end{eqnarray}

additionally, (they do not explain but worth mentioning) the covariance matrix for the stars $C_i$ is given by \begin{eqnarray} C_{i} = \left[\begin{matrix} \sigma^2_\varpi & \rho_{\varpi,\mu\alpha}\sigma_\varpi\sigma_{\mu\alpha} & \rho_{\varpi,\mu\delta}\sigma_\varpi\sigma_{\mu\delta}\\ \rho_{\varpi,\mu\alpha}\sigma_\varpi\sigma_{\mu\alpha} & \sigma^2_{\mu,\alpha} & \rho_{\mu\alpha,\mu\delta}\sigma_{\mu,\alpha}\sigma_{\mu,\delta} \\ \rho_{\varpi,\mu\delta}\sigma_\varpi\sigma_{\mu\delta} & \rho_{\mu\alpha,\mu\delta}\sigma_{\mu,\alpha}\sigma_{\mu,\delta} & \sigma^2_{\mu,\delta} \end{matrix}\right] \end{eqnarray}

Finally, they compute the full likelihood as the product of the two pieces, $L_i = L_{\varpi\mu,i} \cdot L_{\alpha\delta,i}$ and they keep all stars with $\ln L_i > −11$ as possible cluster members.

Cluster	$\alpha$	$\delta$	$c$	$R_{\rm core}$	$R_{\rm tidal}$	$D$	$\mu_\alpha$	$\mu_\delta$	$E$(B-V)	[Fe/H]	$v_{\rm r}$	$\sigma_{v}$
	(deg)	(deg)		(arcmin)	(arcmin)	(kpc)	(mas/yr)	(mas/yr)		(dex)	(km/s)	(km/s)
NGC 104	6.02	-72.08	2.07	0.360	42.296	4.5	5.63 $\pm$ 0.21	-2.73 $\pm$ 0.29	0.04	-0.72	-18.0 $\pm$ 0.1	11.0 $\pm$ 0.3
NGC 288	13.19	-26.58	0.99	1.350	13.193	8.9	4.67 $\pm$ 0.22	-5.60 $\pm$ 0.35	0.03	-1.32	-45.4 $\pm$ 0.2	2.9 $\pm$ 0.3
NGC 3201	154.40	-46.41	1.29	1.300	25.348	4.9	5.28 $\pm$ 0.32	-0.98 $\pm$ 0.33	0.24	-1.59	494.0 $\pm$ 0.2	5.0 $\pm$ 0.2
NGC 4372	186.44	-72.66	1.30	1.750	34.917	5.8	-6.49 $\pm$ 0.33	3.71 $\pm$ 0.32	0.39	-2.17	72.3 $\pm$ 1.2	$\dots$
NGC 4590	189.87	-26.74	1.41	0.580	14.908	10.3	-3.76 $\pm$ 0.66	1.79 $\pm$ 0.62	0.05	-2.23	-94.7 $\pm$ 0.2	2.5 $\pm$ 0.4
NGC 4833	194.89	-70.88	1.25	1.000	17.783	6.6	-8.11 $\pm$ 0.35	-0.96 $\pm$ 0.34	0.32	-1.85	200.2 $\pm$ 1.2	$\dots$

A bit of code

First, one can surely code the position likelihood in ADQL, which will also allow us to filter already stars that are too far to be member candidates.

Second, one can potentially implement the other likelihood, but it requires to code the dot-product and matrix inversion manually. This is feasible as we only manipulate 3x3 matrices. But let just not complicate our task, we will still download some data afterall. We can however compute as much as possible on the server side.

The query below shows how to implement for instance the computation of the covariance matrices and its determinant on top of the rest.



In [27]:

    
from tap import (GaiaArchive, QueryStr, timeit)

def get_tgas_stars(center_ra, center_dec, Rtidal, parallax, parallaxerr, 
                   mualpha, mualphaerr, mudelta, mudeltaerr, s_v):
    """ (sync)Query the database for a particular position and cluster properties. 
    
    Parameters
    ----------
    center_ra: float
        RA of the cluster center
    center_dec: float
        Dec of the cluster center
    Rtidal: float
        tidal radius of the cluster (in degrees)
    parallax: float
        mean parallax of the cluster in mas (1 mas <-> 1kpc)
    parallaxerr: float
        uncertainty on the cluster parallax
    mualpha: float
        mean proper motion of the cluster along RA (in mas/yr)
    mualphaerr: float
        mean proper motion uncertainty of the cluster along RA (in mas/yr)
    mudelta: float
        mean proper motion of the cluster along Dec (in mas/yr)
    mudeltaerr: float
        mean proper motion uncertainty of the cluster along Dec (in mas/yr)
    s_v: float
        internal velocity dispersion
    
    Returns
    -------
    data: Table
        entries from the query
    """
    adql = QueryStr("""
    select *, 
        q.a * (q.d * q.f - q.e * q.e) - q.b * (q.b * q.f - q.c * q.e) + q.c * q.e * (q.b - q.c) as det_pm_cov
    from (
        select 
            gaia.source_id, gaia.ra, gaia.dec, gaia.parallax, gaia.pmra, gaia.pmdec,
            gaia.phot_g_mean_mag as G_mag, tycho2.bt_mag, tycho2.vt_mag, parallax_error,
            pmra_error, pmdec_error, pmra_pmdec_corr, parallax_pmra_corr, parallax_pmdec_corr,
            (-0.5 / power({Rtidal:f},2) * (power({center_ra:+f} - gaia.ra , 2) + power({center_dec:+f}-gaia.dec, 2))) as lnl_alpha_delta,
            (-0.5 / power({Rtidal:f},2) * distance(point('icrs', gaia.ra, gaia.dec), point('icrs', {center_ra:+f}, {center_dec:+f}))) as lnl_alpha_delta2,
            power({s_gcparallax:f}, 2) + power(gaia.parallax_error, 2) as a,
            gaia.parallax_pmra_corr * gaia.parallax_error * gaia.pmra_error as b,
            gaia.parallax_pmdec_corr * gaia.parallax_error * gaia.pmdec_error as c,
            power({s_gcmualpha:f},2) + power({s_gcv:f}, 2) + power(gaia.pmra_error, 2) as d,
            gaia.pmra_pmdec_corr * gaia.pmra_error * gaia.pmdec_error as e,
            power({s_gcmudelta:f},2) + power({s_gcv:f}, 2) + power(gaia.pmdec_error, 2) as f,
            (gaia.parallax - {gc_parallax:f}) as delta_parallax,
            (gaia.pmra + (-1) * {gc_pmra:f}) as delta_pmalpha,
            (gaia.pmdec + (-1) * {gc_pmdec:f}) as delta_pmdec
        from 
            gaiadr1.tgas_source as gaia
        inner join 
            public.tycho2 as tycho2
            on gaia.tycho2_id = tycho2.id
        where 
            contains(point('ICRS', gaia.ra, gaia.dec),
                     circle('ICRS',{center_ra:f}, {center_dec:f}, {size:f}) )=1 
        ) as q
""".format(center_ra=center_ra, center_dec=center_dec, Rtidal=Rtidal, size=3 * Rtidal,
           gc_parallax=parallax, gc_pmra=mualpha, gc_pmdec=mudelta, 
           s_gcparallax=parallaxerr, s_gcmualpha=mualphaerr, s_gcmudelta=mudeltaerr, s_gcv=s_v
           ))
    gaia = GaiaArchive()
    return timeit(gaia.query)(adql)

def get_gaia_density(center_ra, center_dec, size):
    """ Query the database and produce a density map from the full DR1 catalog """
    adql = QueryStr("""
select 
    count(*) as n, 
    round(ra, 2) as latitude, 
    round(dec, 2) as longitude
from 
    gaiadr1.gaia_source as gaia
where 
    contains(point('ICRS', gaia.ra, gaia.dec),
             circle('ICRS',{0:f}, {1:f}, {2:f}) )=1  
group by latitude, longitude
order by latitude, longitude
""".format(center_ra, center_dec, size))
    gaia = GaiaArchive()
    return timeit(gaia.query)(adql)

Below I code the second likelihood in python for convenience.



In [28]:

    
def add_lnl_mu(recs):
    """ Adding the motion likelihood and final likelihood values """
    lnl_mu = np.zeros(len(recs), dtype=float)
    for k, data in enumerate(recs):
        a = data['a'] 
        b = data['b']
        c = data['c']
        d = data['d']
        e = data['e']
        f = data['f']
      
        cov = np.array(((a, b, c),
                        (b, d, e),
                        (c, e, f)))
        invcov = np.linalg.inv(cov)
        det_cov = np.linalg.det(2 * np.pi * invcov)
        m = np.array((data['delta_parallax'], data['delta_pmalpha'], data['delta_pmdec']))
        lnl_mu[k] = - 0.5 * np.log(abs(det_cov)) - 0.5 * m.T @ (invcov @ m)
    from astropy.table import Column
    if 'lnl_mu' in ngc104.keys():
        recs.remove_column('lnl_mu')
        recs.remove_column('lnl')
        recs.remove_column('lnl2')
    recs.add_column(Column(lnl_mu, name='lnl_mu'))    
    recs.add_column(Column(recs['lnl_alpha_delta'] + lnl_mu, name='lnl'))
    recs.add_column(Column(recs['lnl_alpha_delta2'] + lnl_mu, name='lnl2'))
    return recs[np.argsort(recs['lnl'][::-1])]

We make also some figures



In [29]:

    
def fig_plot(data, ngc104, **kwargs):
    """ Plot some figures """
    from matplotlib.colors import LogNorm
    lat = [float(k) for k in data['latitude']]
    lon = [float(k) for k in data['longitude']]
    n = [int(k) for k in data['n']]
    
    members = kwargs.pop("members", None)

    plt.figure(figsize=(10,4))
    plt.subplot(121)
    plt.scatter(lat, lon, c=n,
                edgecolor='None', s=12, rasterized=True, norm=LogNorm(),
                cmap=plt.cm.magma, marker='s', alpha=0.4
               )
    plt.plot(ngc104['ra'], ngc104['dec'], 'o', mfc='b', mec='0.8', mew=2)
    if members is not None:
        plt.plot(members['ra'], members['dec'], 'o', mfc='r', mec='0.8', mew=2)
    plt.xlim(min(lat), max(lat))
    plt.ylim(min(lon), max(lon))
    plt.axis('off');

    plt.subplot(122)
    plt.plot(ngc104['bt_mag']-ngc104['vt_mag'], ngc104['vt_mag'], 'o')
    if members is not None:
        plt.plot(members['bt_mag']-members['vt_mag'], members['vt_mag'], 'ro')
    plt.ylim(plt.ylim()[::-1])
    plt.xlabel('B$_T$ - V$_T$')
    plt.ylabel('V$_T$')
    figrc.hide_axis('top right'.split())
    plt.tight_layout()

    plt.figure(figsize=(10, 5))
    ax = plt.subplot(221)
    plt.plot(ngc104['pmra'], ngc104['parallax'], 'o', **kwargs)
    if members is not None:
        plt.plot(members['pmra'], members['parallax'], 'ro', **kwargs)
    plt.ylabel(r'$\varpi$ [mas]')
    figrc.hide_axis('top right'.split())

    ax = plt.subplot(223, sharex=ax)
    plt.plot(ngc104['pmra'], ngc104['pmdec'], 'o', **kwargs)
    if members is not None:
        plt.plot(members['pmra'], members['pmdec'], 'ro', **kwargs)
    plt.xlabel(r'$\mu_\alpha$ [mas/yr]')
    plt.ylabel(r'$\mu_\delta$ [mas/yr]')
    figrc.hide_axis('top right'.split())

    ax = plt.subplot(224, sharey=ax)
    plt.plot(ngc104['parallax'], ngc104['pmdec'], 'o', **kwargs)
    if members is not None:
        plt.plot(members['parallax'], members['pmdec'], 'ro', **kwargs)
    plt.xlabel(r'$\varpi$ [mas]')
    figrc.hide_axis('top right'.split())

    plt.tight_layout(h_pad=0, w_pad=0)

Let's look at NGC 104, a.k.a. 47 Tuc

We can use the values from the referenced paper to define our cluster properties and run our query.



In [30]:

    
# 1 milliarcsec = 1kpc
ra, dec, Rtidal, parallax, parallaxerr, mualpha, mualphaerr, mudelta, mudeltaerr, s_v = (
    6.02, -72.08, 0.360, 1. / 4.5, 0, 5.63,  0.21, -2.73, 0.29, 0.0
)
data = get_gaia_density(ra, dec, 3 * Rtidal)
ngc104 = get_tgas_stars(ra, dec, Rtidal, parallax, parallaxerr, mualpha, mualphaerr, mudelta, mudeltaerr, s_v)









    




ADQL query
SELECT
    count(*) AS n,
    round(ra, 2) AS latitude,
    round(dec, 2) AS longitude
FROM
    gaiadr1.gaia_source AS gaia
WHERE
    contains(point('ICRS', gaia.ra, gaia.dec),
             circle('ICRS',6.020000, -72.080000, 1.080000) )=1
GROUP BY latitude, longitude
ORDER BY latitude, longitude








    




Execution time: 4.44 s







    




ADQL query
SELECT *,
        q.a * (q.d * q.f - q.e * q.e) - q.b * (q.b * q.f - q.c * q.e) + q.c * q.e * (q.b - q.c) AS det_pm_cov
    FROM (
        SELECT
            gaia.source_id, gaia.ra, gaia.dec, gaia.parallax, gaia.pmra, gaia.pmdec,
            gaia.phot_g_mean_mag AS G_mag, tycho2.bt_mag, tycho2.vt_mag, parallax_error,
            pmra_error, pmdec_error, pmra_pmdec_corr, parallax_pmra_corr, parallax_pmdec_corr,
            (-0.5 / power(0.360000,2) * (power(+6.020000 - gaia.ra , 2) + power(-72.080000-gaia.dec, 2))) AS lnl_alpha_delta,
            (-0.5 / power(0.360000,2) * distance(point('icrs', gaia.ra, gaia.dec), point('icrs', +6.020000, -72.080000))) AS lnl_alpha_delta2,
            power(0.000000, 2) + power(gaia.parallax_error, 2) AS a,
            gaia.parallax_pmra_corr * gaia.parallax_error * gaia.pmra_error AS b,
            gaia.parallax_pmdec_corr * gaia.parallax_error * gaia.pmdec_error AS c,
            power(0.210000,2) + power(0.000000, 2) + power(gaia.pmra_error, 2) AS d,
            gaia.pmra_pmdec_corr * gaia.pmra_error * gaia.pmdec_error AS e,
            power(0.290000,2) + power(0.000000, 2) + power(gaia.pmdec_error, 2) AS f,
            (gaia.parallax - 0.222222) AS delta_parallax,
            (gaia.pmra + (-1) * 5.630000) AS delta_pmalpha,
            (gaia.pmdec + (-1) * -2.730000) AS delta_pmdec
        FROM
            gaiadr1.tgas_source AS gaia
        INNER JOIN
            public.tycho2 AS tycho2
            ON gaia.tycho2_id = tycho2.id
        WHERE
            contains(point('ICRS', gaia.ra, gaia.dec),
                     circle('ICRS',6.020000, -72.080000, 1.080000) )=1
        ) AS q








    




Execution time: 301 ms

We also add the second likelihood and final one. Finally we filter stars to keep those with lnl > 11.



In [31]:

    
result = add_lnl_mu(ngc104)
fields = ['source_id', 'ra', 'dec', 'parallax', 'lnl_alpha_delta', 'lnl_alpha_delta2', 'lnl_mu', 'lnl', 'lnl2']
members = result[result['lnl'] > -11]
members.sort('lnl')
members[fields][::-1]









    Out[31]:




<Table masked=True length=8>

source_id ra dec parallax lnl_alpha_delta lnl_alpha_delta2 lnl_mu lnl lnl2
Angle[deg] Angle[deg] Angle[mas] 
int64 float64 float64 float64 float64 float64 float64 float64 float64
4689644416501132800 6.2170101137593949 -71.936456942381554 0.37507341579041814 -0.22923454596428225 -0.60150036729073375 -2.7988054087 -3.02803995467 -3.40030577599
4689638437899435136 5.5745638831857649 -72.103433200803295 0.58021017218719462 -0.76760204113592645 -0.53610627647358955 -4.52626678339 -5.29386882452 -5.06237305986
4689645000616682240 6.0141150444826863 -71.929822389578547 0.7569972004848331 -0.087144858693433566 -0.57943138406236783 -5.61636849493 -5.70351335362 -6.19579987899
4689620330317403136 5.9132796741373417 -72.277756673829472 0.27418891704465148 -0.19481840276361445 -0.77328553777998654 -5.85211566785 -6.04693407061 -6.62540120563
4689832845301844352 6.0936958914206301 -71.891298383615776 1.0583841942748684 -0.15833095848109333 -0.73330595273316934 -6.13607645112 -6.2944074096 -6.86938240385
4689623594492482176 6.2653565131119864 -72.15881251500781 0.18046400711992883 -0.25621616916793633 -0.42062056968602829 -6.19642464456 -6.45264081373 -6.61704521425
4689595831823970304 5.4133921110160106 -72.414062612237686 0.67526452113793323 -1.8501966044470644 -1.4731690619749811 -4.82298628575 -6.6731828902 -6.29615534773
4690024022888359424 7.304409361221766 -71.815410192959902 0.1092560087723143 -6.6347036002454525 -1.8438481503016266 -3.9692286786 -10.6039322788 -5.8130768289

Interestingly, Watkins et al. claim only 5 candidates for this cluster, and we obtain 8. It would be interesting to understand the differences...

They find also stars with source_id:

4689620330317403136
4690024022888359424
4689638437899435136
4689645000616682240
4689595831823970304

but do not find

4689644416501132800
4689832845301844352
4689623594492482176

The following figure shows the DR1 density with on top the TGAS stars in blue, and member candidates in red.



In [32]:

    
fig_plot(data, ngc104, members=members, ms=4, alpha=0.5)

Correcting the distance-based likelihood



In [33]:

    
members = result[result['lnl2'] > -11]
members.sort('lnl2')
members[fields][::-1]









    Out[33]:




<Table masked=True length=8>

source_id ra dec parallax lnl_alpha_delta lnl_alpha_delta2 lnl_mu lnl lnl2
Angle[deg] Angle[deg] Angle[mas] 
int64 float64 float64 float64 float64 float64 float64 float64 float64
4689644416501132800 6.2170101137593949 -71.936456942381554 0.37507341579041814 -0.22923454596428225 -0.60150036729073375 -2.7988054087 -3.02803995467 -3.40030577599
4689638437899435136 5.5745638831857649 -72.103433200803295 0.58021017218719462 -0.76760204113592645 -0.53610627647358955 -4.52626678339 -5.29386882452 -5.06237305986
4690024022888359424 7.304409361221766 -71.815410192959902 0.1092560087723143 -6.6347036002454525 -1.8438481503016266 -3.9692286786 -10.6039322788 -5.8130768289
4689645000616682240 6.0141150444826863 -71.929822389578547 0.7569972004848331 -0.087144858693433566 -0.57943138406236783 -5.61636849493 -5.70351335362 -6.19579987899
4689595831823970304 5.4133921110160106 -72.414062612237686 0.67526452113793323 -1.8501966044470644 -1.4731690619749811 -4.82298628575 -6.6731828902 -6.29615534773
4689623594492482176 6.2653565131119864 -72.15881251500781 0.18046400711992883 -0.25621616916793633 -0.42062056968602829 -6.19642464456 -6.45264081373 -6.61704521425
4689620330317403136 5.9132796741373417 -72.277756673829472 0.27418891704465148 -0.19481840276361445 -0.77328553777998654 -5.85211566785 -6.04693407061 -6.62540120563
4689832845301844352 6.0936958914206301 -71.891298383615776 1.0583841942748684 -0.15833095848109333 -0.73330595273316934 -6.13607645112 -6.2944074096 -6.86938240385



In [34]:

    
fig_plot(data, ngc104, members=members, ms=4, alpha=0.5)

The likelihood values based on spherical distance are significantly different from their definition. However, for this particular cluster we do not change the result by correcting their likelihood.

A little bit of crazy: make it all in ADQL query

Below I added the calculations of the second likelihood directly in ADQL. Why? Actually just for the fun of it...

As you can see, this is (i) tedious for a simple 3x3 matrix only, (ii) not very readable. Of course it gives the same results directly with lnl and therefore one can filter directly.

But let's be honest, it's faster to download a bit more and finish the calculations locally.

This example is mostly to show that one can implement complex variables and if necessary embed one query into another for more complex calculations.



In [45]:

    
def get_tgas_stars_full(center_ra, center_dec, Rtidal, parallax, parallaxerr, 
                        mualpha, mualphaerr, mudelta, mudeltaerr, s_v):
    """ (sync)Query the database for a particular position and cluster properties. 
    
    Parameters
    ----------
    center_ra: float
        RA of the cluster center
    center_dec: float
        Dec of the cluster center
    Rtidal: float
        tidal radius of the cluster (in degrees)
    parallax: float
        mean parallax of the cluster in mas (1 mas <-> 1kpc)
    parallaxerr: float
        uncertainty on the cluster parallax
    mualpha: float
        mean proper motion of the cluster along RA (in mas/yr)
    mualphaerr: float
        mean proper motion uncertainty of the cluster along RA (in mas/yr)
    mudelta: float
        mean proper motion of the cluster along Dec (in mas/yr)
    mudeltaerr: float
        mean proper motion uncertainty of the cluster along Dec (in mas/yr)
    s_v: float
        internal velocity dispersion
    
    Returns
    -------
    data: Table
        entries from the query
    """
    adql = QueryStr("""
    select 
        r.source_id, r.ra, r.dec, r.parallax, r.pmra, r.pmdec,
        r.G_mag, r.bt_mag, r.vt_mag, r.parallax_error,
        r.pmra_error, r.pmdec_error, r.pmra_pmdec_corr, 
        r.parallax_pmra_corr, r.parallax_pmdec_corr,
        r.lnl_alpha_delta, r.lnl_alpha_delta2, r.lnl_mu,
        (- 0.5 * log(abs(r.det_pm_cov)) + r.lnl_mu + r.lnl_alpha_delta) as lnl,
        (- 0.5 * log(abs(r.det_pm_cov)) + r.lnl_mu + r.lnl_alpha_delta2) as lnl2
    from (
        select *, 
            q.a * (q.d * q.f - q.e * q.e) - q.b * (q.b * q.f - q.c * q.e) + q.c * q.e * (q.b - q.c) as det_pm_cov,
            -0.5 * (delta_parallax*(delta_parallax*(-(q.a*q.d - power(q.b, 2))*(-q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) + q.c/q.a)*(q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) - q.c/q.a)/(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d) + 1.0/q.a + power(q.b, 2)/(power(q.a, 2)*(q.d - power(q.b, 2)/q.a))) + delta_pmalpha*((q.e - q.b*q.c/q.a)*(q.a*q.d - power(q.b, 2))*(-q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) + q.c/q.a)/((q.d - power(q.b, 2)/q.a)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d)) - q.b/(q.a*(q.d - power(q.b, 2)/q.a))) - delta_pmdec*(q.a*q.d - power(q.b, 2))*(-q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) + q.c/q.a)/(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d)) + delta_pmalpha*(delta_parallax*(-(q.e - q.b*q.c/q.a)*(q.a*q.d - power(q.b, 2))*(q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) - q.c/q.a)/((q.d - power(q.b, 2)/q.a)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d)) - q.b/(q.a*(q.d - power(q.b, 2)/q.a))) + delta_pmalpha*(1.0/(q.d - power(q.b, 2)/q.a) + power(q.e - q.b*q.c/q.a, 2)*(q.a*q.d - power(q.b, 2))/(power(q.d - power(q.b, 2)/q.a, 2)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d))) - delta_pmdec*(q.e - q.b*q.c/q.a)*(q.a*q.d - power(q.b, 2))/((q.d - power(q.b, 2)/q.a)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d))) + delta_pmdec*(delta_parallax*(q.a*q.d - power(q.b, 2))*(q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) - q.c/q.a)/(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d) - delta_pmalpha*(q.e - q.b*q.c/q.a)*(q.a*q.d - power(q.b, 2))/((q.d - power(q.b, 2)/q.a)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d)) + delta_pmdec*(q.a*q.d - power(q.b, 2))/(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d))) as lnl_mu
        from (
            select 
                gaia.source_id, gaia.ra, gaia.dec, gaia.parallax, gaia.pmra, gaia.pmdec,
                gaia.phot_g_mean_mag as G_mag, tycho2.bt_mag, tycho2.vt_mag, parallax_error,
                pmra_error, pmdec_error, pmra_pmdec_corr, parallax_pmra_corr, parallax_pmdec_corr,
                (-0.5 / power({Rtidal:f},2) * (power({center_ra:+f} - gaia.ra , 2) + power({center_dec:+f}-gaia.dec, 2))) as lnl_alpha_delta,
                (-0.5 / power({Rtidal:f},2) * distance(point('icrs', gaia.ra, gaia.dec), point('icrs', {center_ra:+f}, {center_dec:+f}))) as lnl_alpha_delta2,
                power({s_gcparallax:f}, 2) + power(gaia.parallax_error, 2) as a,
                gaia.parallax_pmra_corr * gaia.parallax_error * gaia.pmra_error as b,
                gaia.parallax_pmdec_corr * gaia.parallax_error * gaia.pmdec_error as c,
                power({s_gcmualpha:f},2) + power({s_gcv:f}, 2) + power(gaia.pmra_error, 2) as d,
                gaia.pmra_pmdec_corr * gaia.pmra_error * gaia.pmdec_error as e,
                power({s_gcmudelta:f},2) + power({s_gcv:f}, 2) + power(gaia.pmdec_error, 2) as f,
                (gaia.parallax - {gc_parallax:f}) as delta_parallax,
                (gaia.pmra + (-1) * {gc_pmra:f}) as delta_pmalpha,
                (gaia.pmdec + (-1) * {gc_pmdec:f}) as delta_pmdec
            from 
                gaiadr1.tgas_source as gaia
            left outer join
                public.tycho2 as tycho2
                on gaia.tycho2_id = tycho2.id
            where 
                contains(point('ICRS', gaia.ra, gaia.dec),
                         circle('ICRS',{center_ra:f}, {center_dec:f}, {size:f}) )=1 
            ) as q
        ) as r
""".format(center_ra=center_ra, center_dec=center_dec, Rtidal=Rtidal, size=3 * Rtidal,
           gc_parallax=parallax, gc_pmra=mualpha, gc_pmdec=mudelta, 
           s_gcparallax=parallaxerr, s_gcmualpha=mualphaerr, s_gcmudelta=mudeltaerr, s_gcv=s_v
           ))
    gaia = GaiaArchive()
    return timeit(gaia.query)(adql)



In [46]:

    
# 1 milliarcsec = 1kpc
ra, dec, Rtidal, parallax, parallaxerr, mualpha, mualphaerr, mudelta, mudeltaerr, s_v = (
    6.02, -72.08, 0.360, 1. / 4.5, 0, 5.63,  0.21, -2.73, 0.29, 0.0
)
ngc104 = get_tgas_stars_full(ra, dec, Rtidal, parallax, parallaxerr, 
                             mualpha, mualphaerr, mudelta, mudeltaerr, 
                             s_v)









    




ADQL query
SELECT
        r.source_id, r.ra, r.dec, r.parallax, r.pmra, r.pmdec,
        r.G_mag, r.bt_mag, r.vt_mag, r.parallax_error,
        r.pmra_error, r.pmdec_error, r.pmra_pmdec_corr,
        r.parallax_pmra_corr, r.parallax_pmdec_corr,
        r.lnl_alpha_delta, r.lnl_alpha_delta2, r.lnl_mu,
        (- 0.5 * log(abs(r.det_pm_cov)) + r.lnl_mu + r.lnl_alpha_delta) AS lnl,
        (- 0.5 * log(abs(r.det_pm_cov)) + r.lnl_mu + r.lnl_alpha_delta2) AS lnl2
    FROM (
        SELECT *,
            q.a * (q.d * q.f - q.e * q.e) - q.b * (q.b * q.f - q.c * q.e) + q.c * q.e * (q.b - q.c) AS det_pm_cov,
            -0.5 * (delta_parallax*(delta_parallax*(-(q.a*q.d - power(q.b, 2))*(-q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) + q.c/q.a)*(q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) - q.c/q.a)/(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d) + 1.0/q.a + power(q.b, 2)/(power(q.a, 2)*(q.d - power(q.b, 2)/q.a))) + delta_pmalpha*((q.e - q.b*q.c/q.a)*(q.a*q.d - power(q.b, 2))*(-q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) + q.c/q.a)/((q.d - power(q.b, 2)/q.a)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d)) - q.b/(q.a*(q.d - power(q.b, 2)/q.a))) - delta_pmdec*(q.a*q.d - power(q.b, 2))*(-q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) + q.c/q.a)/(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d)) + delta_pmalpha*(delta_parallax*(-(q.e - q.b*q.c/q.a)*(q.a*q.d - power(q.b, 2))*(q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) - q.c/q.a)/((q.d - power(q.b, 2)/q.a)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d)) - q.b/(q.a*(q.d - power(q.b, 2)/q.a))) + delta_pmalpha*(1.0/(q.d - power(q.b, 2)/q.a) + power(q.e - q.b*q.c/q.a, 2)*(q.a*q.d - power(q.b, 2))/(power(q.d - power(q.b, 2)/q.a, 2)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d))) - delta_pmdec*(q.e - q.b*q.c/q.a)*(q.a*q.d - power(q.b, 2))/((q.d - power(q.b, 2)/q.a)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d))) + delta_pmdec*(delta_parallax*(q.a*q.d - power(q.b, 2))*(q.b*(q.e - q.b*q.c/q.a)/(q.a*(q.d - power(q.b, 2)/q.a)) - q.c/q.a)/(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d) - delta_pmalpha*(q.e - q.b*q.c/q.a)*(q.a*q.d - power(q.b, 2))/((q.d - power(q.b, 2)/q.a)*(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d)) + delta_pmdec*(q.a*q.d - power(q.b, 2))/(q.a*q.d*q.f - q.a*power(q.e, 2) - power(q.b, 2)*q.f + 2*q.b*q.c*q.e - power(q.c, 2)*q.d))) AS lnl_mu
        FROM (
            SELECT
                gaia.source_id, gaia.ra, gaia.dec, gaia.parallax, gaia.pmra, gaia.pmdec,
                gaia.phot_g_mean_mag AS G_mag, tycho2.bt_mag, tycho2.vt_mag, parallax_error,
                pmra_error, pmdec_error, pmra_pmdec_corr, parallax_pmra_corr, parallax_pmdec_corr,
                (-0.5 / power(0.360000,2) * (power(+6.020000 - gaia.ra , 2) + power(-72.080000-gaia.dec, 2))) AS lnl_alpha_delta,
                (-0.5 / power(0.360000,2) * distance(point('icrs', gaia.ra, gaia.dec), point('icrs', +6.020000, -72.080000))) AS lnl_alpha_delta2,
                power(0.000000, 2) + power(gaia.parallax_error, 2) AS a,
                gaia.parallax_pmra_corr * gaia.parallax_error * gaia.pmra_error AS b,
                gaia.parallax_pmdec_corr * gaia.parallax_error * gaia.pmdec_error AS c,
                power(0.210000,2) + power(0.000000, 2) + power(gaia.pmra_error, 2) AS d,
                gaia.pmra_pmdec_corr * gaia.pmra_error * gaia.pmdec_error AS e,
                power(0.290000,2) + power(0.000000, 2) + power(gaia.pmdec_error, 2) AS f,
                (gaia.parallax - 0.222222) AS delta_parallax,
                (gaia.pmra + (-1) * 5.630000) AS delta_pmalpha,
                (gaia.pmdec + (-1) * -2.730000) AS delta_pmdec
            FROM
                gaiadr1.tgas_source AS gaia
            LEFT OUTER JOIN
                public.tycho2 AS tycho2
                ON gaia.tycho2_id = tycho2.id
            WHERE
                contains(point('ICRS', gaia.ra, gaia.dec),
                         circle('ICRS',6.020000, -72.080000, 1.080000) )=1
            ) AS q
        ) AS r








    




Execution time: 450 ms



In [47]:

    
fields = ['source_id', 'ra', 'dec', 'parallax', 'lnl_alpha_delta', 'lnl_alpha_delta2', 'lnl_mu', 'lnl', 'lnl2']
members = result[result['lnl'] > -11]
members.sort('lnl')
members[fields][::-1]









    Out[47]:




<Table masked=True length=8>

source_id ra dec parallax lnl_alpha_delta lnl_alpha_delta2 lnl_mu lnl lnl2
Angle[deg] Angle[deg] Angle[mas] 
int64 float64 float64 float64 float64 float64 float64 float64 float64
4689644416501132800 6.2170101137593949 -71.936456942381554 0.37507341579041814 -0.22923454596428225 -0.60150036729073375 -2.7988054087 -3.02803995467 -3.40030577599
4689638437899435136 5.5745638831857649 -72.103433200803295 0.58021017218719462 -0.76760204113592645 -0.53610627647358955 -4.52626678339 -5.29386882452 -5.06237305986
4689645000616682240 6.0141150444826863 -71.929822389578547 0.7569972004848331 -0.087144858693433566 -0.57943138406236783 -5.61636849493 -5.70351335362 -6.19579987899
4689620330317403136 5.9132796741373417 -72.277756673829472 0.27418891704465148 -0.19481840276361445 -0.77328553777998654 -5.85211566785 -6.04693407061 -6.62540120563
4689832845301844352 6.0936958914206301 -71.891298383615776 1.0583841942748684 -0.15833095848109333 -0.73330595273316934 -6.13607645112 -6.2944074096 -6.86938240385
4689623594492482176 6.2653565131119864 -72.15881251500781 0.18046400711992883 -0.25621616916793633 -0.42062056968602829 -6.19642464456 -6.45264081373 -6.61704521425
4689595831823970304 5.4133921110160106 -72.414062612237686 0.67526452113793323 -1.8501966044470644 -1.4731690619749811 -4.82298628575 -6.6731828902 -6.29615534773
4690024022888359424 7.304409361221766 -71.815410192959902 0.1092560087723143 -6.6347036002454525 -1.8438481503016266 -3.9692286786 -10.6039322788 -5.8130768289



In [48]:

    
fig_plot(data, ngc104, members=members, ms=4, alpha=0.5)

Some comment for the authors of Watkins et al.

The position likelihood is incorrect. It should be a sherical distance not a cartesian one. This gives more weight on RA than DEC variations.
differences in the number of members come from a bug in their paper table generation (private communication with L. Watkins). Bug apart they add more filtering later on which does not change their conclusions.
They should provide the exact conversion they use to convert $\sigma_v$, $\sigma_r$ into their covariance, I am puzzled by the statistical treatment (here I ignore these terms without affecting the results apparently).



In [ ]:

    
table1 = gaia
table2 = list of objects (ra, dec, size, name ...)

select ra, dec, source_id
from gaia 
where contain(point(gaia.ra, gaia.dec), circle(table2.ra, table2.dec, 1))
and table2.name = bla

solution_id	source_id	random_index	ref_epoch	ra	ra_error	dec	dec_error	parallax	parallax_error	pmra	pmra_error	pmdec	pmdec_error	ra_dec_corr	ra_parallax_corr	ra_pmra_corr	ra_pmdec_corr	dec_parallax_corr	dec_pmra_corr	dec_pmdec_corr	parallax_pmra_corr	parallax_pmdec_corr	pmra_pmdec_corr	astrometric_n_obs_al	astrometric_n_obs_ac	astrometric_n_good_obs_al	astrometric_n_good_obs_ac	astrometric_n_bad_obs_al	astrometric_n_bad_obs_ac	astrometric_delta_q	astrometric_excess_noise	astrometric_excess_noise_sig	astrometric_primary_flag	astrometric_relegation_factor	astrometric_weight_al	astrometric_weight_ac	astrometric_priors_used	matched_observations	duplicated_source	scan_direction_strength_k1	scan_direction_strength_k2	scan_direction_strength_k3	scan_direction_strength_k4	scan_direction_mean_k1	scan_direction_mean_k2	scan_direction_mean_k3	scan_direction_mean_k4	phot_g_n_obs	phot_g_mean_flux	phot_g_mean_flux_error	phot_g_mean_mag	phot_variable_flag	l	b	ecl_lon	ecl_lat
			Time[Julian Years]	Angle[deg]	Angle[mas]	Angle[deg]	Angle[mas]	Angle[mas]	Angle[mas]	Angular Velocity[mas/year]	Angular Velocity[mas/year]	Angular Velocity[mas/year]	Angular Velocity[mas/year]	Dimensionless[see description]	Dimensionless[see description]	Dimensionless[see description]	Dimensionless[see description]	Dimensionless[see description]	Dimensionless[see description]	Dimensionless[see description]	Dimensionless[see description]	Dimensionless[see description]	Dimensionless[see description]								Angle[mas]				Angle[mas^-2]	Angle[mas^-2]								Angle[deg]	Angle[deg]	Angle[deg]	Angle[deg]		Flux[e-/s]	Flux[e-/s]	Magnitude[mag]	Dimensionless[see description]	Angle[deg]	Angle[deg]	Angle[deg]	Angle[deg]
int64	int64	int64	float64	float64	float64	float64	float64	float64	float64	float64	float64	float64	float64	float32	float32	float32	float32	float32	float32	float32	float32	float32	float32	int32	int32	int32	int32	int32	int32	float32	float64	float64	bool	float32	float32	float32	int32	int16	bool	float32	float32	float32	float32	float32	float32	float32	float32	int32	float64	float64	float64	object	float64	float64	float64	float64
1635378410781933568	4486895915443650432	905350894	2015.0	264.18122510033464	2.8003516165869082	7.0516429876037998	12.9212181467864	--	--	--	--	--	--	-0.077	--	--	--	--	--	--	--	--	--	43	0	41	0	2	0	--	3.2780269499321597	2.8301321572232419	False	1.6920748	0.036015101	--	2	12	False	0.86865312	0.62508136	0.66467804	0.8813076	-59.995804	-51.505093	-43.121849	-44.794041	43	218.43179998323626	3.0698243456034935	19.676480402386268	NOT_AVAILABLE	30.759313884222344	19.806472281755649	263.30404339742091	30.355774143855047
1635378410781933568	4486609183424689920	309752073	2015.0	266.16603765486315	0.55382871997862648	7.6271450028431031	0.65430966160671311	--	--	--	--	--	--	0.92455	--	--	--	--	--	--	--	--	--	89	0	89	0	0	0	--	0.0	0.0	False	1.0	0.6764009	--	2	19	False	0.46887282	0.5968886	0.31126449	0.8961063	-69.038536	-47.735455	-32.330299	-42.062531	89	885.14509075973967	2.612550689916894	18.15723390135684	NOT_AVAILABLE	32.225950841033132	18.294962138635221	265.56529752291794	31.007424683992905
1635378410781933568	4487954916943816960	1109852082	2015.0	267.43146648900176	3.3732871870793346	8.0916004466735227	4.1776617264829969	--	--	--	--	--	--	0.84584999	--	--	--	--	--	--	--	--	--	45	0	43	0	2	0	--	4.9278360069994429	5.5765534150526719	False	1.6911782	0.018240357	--	2	27	False	0.50051576	0.27280524	0.28483522	0.85747027	-106.40694	-69.268906	-8.6255636	-41.570301	44	130.70553988687644	2.1173294200763491	20.234035075545016	NOT_AVAILABLE	33.242582035644766	17.372733263801848	267.01707637453728	31.504289333914077
1635378410781933568	4488405063871908096	1085557435	2015.0	268.577179212977	0.45686872022632691	8.5283958071764445	0.66047878586423325	--	--	--	--	--	--	0.52380002	--	--	--	--	--	--	--	--	--	80	0	80	0	0	0	--	0.47314460723313795	0.4344997036814196	False	1.0409509	0.27349713	--	2	14	False	0.36665735	0.10474015	0.33550045	0.73903608	-104.3985	27.750549	-29.422886	-44.953251	77	492.79505895268744	4.7092584766237948	18.793104202112058	NOT_AVAILABLE	34.171511451972741	16.544154401270578	268.34148147147999	31.959485697633603
1635378410781933568	4488168978109094912	724156643	2015.0	266.9350267767021	0.45472258639757962	8.4501367670511502	0.52313455431024292	--	--	--	--	--	--	0.92054999	--	--	--	--	--	--	--	--	--	65	0	65	0	0	0	--	0.54443893282338873	2.7908803781634761	False	1.2134782	1.1420876	--	2	16	False	0.35940772	0.74710166	0.26141152	0.91352218	-62.16045	-45.337482	-33.693073	-42.355057	67	2029.0269077666758	3.3669145828795304	17.25655054776395	NOT_AVAILABLE	33.355528435673932	17.971330490608551	266.43026591010408	31.851445954397114

hip	tycho2_id	source_id	bt_mag	vt_mag	e_bt_mag	e_vt_mag
			'mag'	'mag'	'mag'	'mag'
int32	object	int64	float32	float32	float32	float32
--	1000-1009-1	4493714846038108800	12.762	12.157	0.236	0.193
--	1000-1016-1	4492839806583533312	11.131	10.695	0.057999998	0.061999999
--	1000-1018-1	4493575723457455872	12.224	11.849	0.155	0.163
--	1000-1043-1	4494114312356365568	10.274	9.2080002	0.035	0.021
--	1000-1068-1	4493519648365739008	11.95	10.608	0.106	0.048999999
--	1000-108-1	4493522603303232768	12.391	12.389	0.185	0.21600001
--	1000-1087-1	4493716048628949632	11.529	11.021	0.077	0.079999998
--	1000-1092-1	4492866469738055936	12.696	12.115	0.248	0.20900001
--	1000-111-1	4493709520280710528	12.048	11.467	0.121	0.104
--	1000-1117-1	4493890870978560640	11.357	10.841	0.064000003	0.064000003