Advanced: Datasets

Datasets tell PHOEBE how and at what times to compute the model. In some cases these will include the actual observational data, and in other cases may only include the times at which you want to compute a synthetic model.

If you're not already familiar with the basic functionality of adding datasets, make sure to read the datasets tutorial first.

Setup

Let's first make sure we have the latest version of PHOEBE 2.3 installed. (You can comment out this line if you don't use pip for your installation or don't want to update to the latest release).


In [ ]:
!pip install -I "phoebe>=2.3,<2.4"

As always, let's do imports and initialize a logger and a new Bundle. See Building a System for more details.


In [1]:
import phoebe
from phoebe import u # units
import numpy as np
import matplotlib.pyplot as plt

logger = phoebe.logger()

b = phoebe.default_binary()

Passband Options

Passband options follow the exact same rules as dataset columns.

Sending a single value to the argument will apply it to each component in which the time array is attached (either based on the list of components sent or the defaults from the dataset method).

Note that for light curves, in particular, this rule gets slightly bent. The dataset arrays for light curves are attached at the system level, always. The passband-dependent options, however, exist for each star in the system. So, that value will get passed to each star if the component is not explicitly provided.


In [16]:
b.add_dataset('lc', 
              times=[0,1], 
              ld_func='logarithmic', 
              dataset='lc01', 
              overwrite=True)


Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@primary@rv02@rv@dataset must be of same length as times@primary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@secondary@rv02@rv@dataset must be of same length as times@secondary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Out[16]:
<ParameterSet: 43 parameters | contexts: compute, dataset, constraint, figure>

In [17]:
print(b['times@lc01'])


Parameter: times@lc01@dataset
                       Qualifier: times
                     Description: Observed times
                           Value: [0. 1.] d
                  Constrained by: 
                      Constrains: None
                      Related to: None


In [18]:
print(b['ld_func@lc01'])


ParameterSet: 0 parameters
NO PARAMETERS

As you might expect, if you want to pass different values to different components, simply provide them in a dictionary.


In [19]:
b.add_dataset('lc', 
              times=[0,1], 
              ld_mode='manual',
              ld_func={'primary': 'logarithmic', 'secondary': 'quadratic'}, 
              dataset='lc01',
             overwrite=True)


Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@primary@rv02@rv@dataset must be of same length as times@primary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@secondary@rv02@rv@dataset must be of same length as times@secondary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Out[19]:
<ParameterSet: 43 parameters | contexts: compute, dataset, constraint, figure>

In [20]:
print(b['ld_func@lc01'])


ParameterSet: 2 parameters
    ld_func@primary@lc01@dataset: logarithmic
  ld_func@secondary@lc01@dataset: quadratic

Note here that we didn't explicitly override the defaults for '_default', so they used the phoebe-wide defaults. If you wanted to set a value for the ld_coeffs of any star added in the future, you would have to provide a value for '_default' in the dictionary as well.


In [21]:
print(b.filter('ld_func@lc01', check_default=False))


ParameterSet: 3 parameters
   ld_func@_default@lc01@dataset: logarithmic
    ld_func@primary@lc01@dataset: logarithmic
  ld_func@secondary@lc01@dataset: quadratic

This syntax may seem a bit bulky - but alternatively you can add the dataset without providing values and then change the values individually using dictionary access or set_value.

Adding a Dataset from a File

Manually from Arrays

For now, the only way to load data from a file is to do the parsing externally and pass the arrays on (as in the previous section).

Here we'll load times, fluxes, and errors of a light curve from an external file and then pass them on to a newly created dataset. Since this is a light curve, it will automatically know that you want the summed light from all copmonents in the hierarchy.


In [22]:
times, fluxes, sigmas = np.loadtxt('test.lc.in', unpack=True)
b.add_dataset(phoebe.dataset.lc, 
              times=times, 
              fluxes=fluxes, 
              sigmas=sigmas, 
              dataset='lc01',
              overwrite=True)


Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@primary@rv02@rv@dataset must be of same length as times@primary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@secondary@rv02@rv@dataset must be of same length as times@secondary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Out[22]:
<ParameterSet: 43 parameters | contexts: compute, dataset, constraint, figure>

Enabling and Disabling Datasets

See the Compute Tutorial

Dealing with Phases

Datasets will no longer accept phases. It is the user's responsibility to convert phased data into times given an ephemeris. But it's still useful to be able to convert times to phases (and vice versa) and be able to plot in phase.

Those conversions can be handled via b.get_ephemeris, b.to_phase, and b.to_time.


In [23]:
print(b.get_ephemeris())


{'period': 1.0, 't0': 0.0, 'dpdt': 0.0}

In [24]:
print(b.to_phase(0.0))


0.0

In [25]:
print(b.to_time(-0.25))


-0.25

All of these by default use the period in the top-level of the current hierarchy, but accept a component keyword argument if you'd like the ephemeris of an inner-orbit or the rotational ephemeris of a star in the system.

We'll see how plotting works later, but if you manually wanted to plot the dataset with phases, all you'd need to do is:


In [26]:
print(b.to_phase(b['times@primary@rv01']))


[0. 0.]

or


In [27]:
print(b.to_phase('times@primary@rv01'))


[0. 0.]

Although it isn't possible to attach data in phase-space, it is possible (new in PHOEBE 2.2) to tell PHOEBE at which phases to compute the model by setting compute_phases. Note that this overrides the value of times when the model is computed.


In [28]:
b.add_dataset('lc',
              compute_phases=np.linspace(0,1,11),
              dataset='lc01',
              overwrite=True)


Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@primary@rv02@rv@dataset must be of same length as times@primary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@secondary@rv02@rv@dataset must be of same length as times@secondary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Out[28]:
<ParameterSet: 43 parameters | contexts: compute, dataset, constraint, figure>

The usage of compute_phases (as well as compute_times) will be discussed in further detail in the compute tutorial and the advanced: compute times & phases tutorial.

Note also that although you can pass compute_phases directly to add_dataset, if you do not, it will be constrained by compute_times by default. In this case, you would need to flip the constraint before setting compute_phases. See the constraints tutorial and the flip_constraint API docs for more details on flipping constraints.


In [29]:
b.add_dataset('lc',
              times=[0],
              dataset='lc01', 
              overwrite=True)


Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@primary@rv02@rv@dataset must be of same length as times@primary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@secondary@rv02@rv@dataset must be of same length as times@secondary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Out[29]:
<ParameterSet: 43 parameters | contexts: compute, dataset, constraint, figure>

In [30]:
print(b['compute_phases@lc01'])


ParameterSet: 2 parameters
*    compute_phases@lc01@dataset: []
  compute_phases@lc01@constraint: times_to_phases({compute_times@lc01@dataset}, {period@binary@component}, {dpdt@binary@component}, {compute_phases_t0@lc01@dataset}, {t0_supconj@binary@component}, {t0_perpass@binary@component}, {t0_ref@binary@component})

In [31]:
b.flip_constraint('compute_phases', dataset='lc01', solve_for='compute_times')


Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@primary@rv02@rv@dataset must be of same length as times@primary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@secondary@rv02@rv@dataset must be of same length as times@secondary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Out[31]:
<ConstraintParameter: {compute_times@lc01@dataset} = phases_to_times({compute_phases@lc01@dataset}, {period@binary@component}, {dpdt@binary@component}, {compute_phases_t0@lc01@dataset}, {t0_supconj@binary@component}, {t0_perpass@binary@component}, {t0_ref@binary@component}) (solar units) => [] d>

In [32]:
b.set_value('compute_phases', dataset='lc01', value=np.linspace(0,1,101))


Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@primary@rv02@rv@dataset must be of same length as times@primary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@secondary@rv02@rv@dataset must be of same length as times@secondary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@primary@rv02@rv@dataset must be of same length as times@primary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.
Wed, 11 Dec 2019 16:15 BUNDLE       WARNING rvs@secondary@rv02@rv@dataset must be of same length as times@secondary@rv02@rv@dataset  If not addressed, this warning will continue to be raised and will throw an error at run_compute.

Removing Datasets

Removing a dataset will remove matching parameters in either the dataset, model, or constraint contexts. This action is permanent and not undo-able via Undo/Redo.


In [33]:
print(b.datasets)


['orb01', 'rv01', 'rv02', 'lc01']

The simplest way to remove a dataset is by its dataset tag:


In [34]:
b.remove_dataset('lc01')


Out[34]:
<ParameterSet: 43 parameters | contexts: compute, dataset, constraint, figure>

In [35]:
print(b.datasets)


['orb01', 'rv01', 'rv02']

But remove_dataset also takes any other tag(s) that could be sent to filter.


In [36]:
b.remove_dataset(kind='rv')


Out[36]:
<ParameterSet: 78 parameters | contexts: compute, dataset, constraint, figure>

In [37]:
print(b.datasets)


['orb01']