Dataset collectionsΒΆ

A DatasetCollection instance describes a collection of different datasets that can be combined in one analysis.

As usual, we need to instantiate a Config first

[12]:
from skyllh.core.config import Config

cfg = Config()

The available data samples are accessible through the skyllh.data_samples dictionary

[13]:
from skyllh.datasets import data_samples

data_samples
[13]:
{'IceTracks-DR1': <module 'skyllh.datasets.i3.PublicData_10y_ps' from '/Users/chiarabellenghi/Work/skyllh_pre_release/skyllh/skyllh/datasets/i3/PublicData_10y_ps.py'>,
 'IceTracks-DR2': <module 'skyllh.datasets.i3.PublicData_14y_ps' from '/Users/chiarabellenghi/Work/skyllh_pre_release/skyllh/skyllh/datasets/i3/PublicData_14y_ps.py'>,
 'IceTracks-DR1_wMC': <module 'skyllh.datasets.i3.PublicData_10y_ps_wMC' from '/Users/chiarabellenghi/Work/skyllh_pre_release/skyllh/skyllh/datasets/i3/PublicData_10y_ps_wMC.py'>,
 'TestData': <module 'skyllh.datasets.i3.TestData' from '/Users/chiarabellenghi/Work/skyllh_pre_release/skyllh/skyllh/datasets/i3/TestData.py'>}

To access the IceTracks-DR2 dataset collection:

[14]:
dsc = data_samples['IceTracks-DR2'].create_dataset_collection(cfg=cfg)

The dataset_names property provides a list of all the data sets defined in the dataset collection.

[15]:
dsc.dataset_names
[15]:
['IC40',
 'IC59',
 'IC79',
 'IC86_I',
 'IC86_I-XI',
 'IC86_II',
 'IC86_III',
 'IC86_IV',
 'IC86_IX',
 'IC86_V',
 'IC86_VI',
 'IC86_VII',
 'IC86_VIII',
 'IC86_X',
 'IC86_XI']

When importing data to create an analysis object, one can decides which specific data sets to use:

[20]:
import skyllh

datasets = skyllh.create_datasets('IceTracks-DR2', cfg=cfg, names=['IC86_XI'])
datasets
[20]:
[<skyllh.i3.dataset.I3Dataset at 0x10de8ad50>]

Information about the data being selected can be printed to console for each of the selected data sets:

[21]:
print(datasets[0])
Dataset "IC86_XI": v001p00
    { livetime = UNDEFINED }
    Experimental data:
        [NOT FOUND] /Users/chiarabellenghi/.cache/skyllh/icecube_pstracks_v004p02/events/IC86_XI_exp.csv
    MC data:

    Auxiliary data:
        eff_area_datafile:
            [NOT FOUND] /Users/chiarabellenghi/.cache/skyllh/icecube_pstracks_v004p02/irfs/IC86_effectiveArea.csv
        smearing_datafile:
            [NOT FOUND] /Users/chiarabellenghi/.cache/skyllh/icecube_pstracks_v004p02/irfs/IC86_smearing.csv
    GRL data:
        [NOT FOUND] /Users/chiarabellenghi/.cache/skyllh/icecube_pstracks_v004p02/uptime/IC86_XI_exp.csv

If no names are passed to the skyllh.create_datasets method, the combination of all available data sets in the dataset collection will be imported.