PISA Example Using Public IceCube Data¶

This analysis is a close variant of what is published in [1], and referred to as Sample B in [2] and [3]. The set of systematic uncertainties is slightly reduced and results will be close to, but not identical to the published numbers.

References:

[1] “Measurement of Atmospheric Neutrino Oscillations at 6-56 GeV with IceCube DeepCore,” IceCube Collaboration: M. G. Aartsen et al., Physical Review Letters 120, 071801 (2018). DOI: 10.1103/PhysRevLett.120.071801
[2] “Measurement of Atmospheric Tau Neutrino Appearance with IceCube DeepCore,” IceCube Collaboration: M. G. Aartsen et al., Phys. Rev. D99.3(2019), p. 032007. DOI: 10.1103/PhysRevD.99.032007
[3] “Probing the Neutrino Mass Ordering with Atmospheric Neutrinos from Three Years of IceCube DeepCore Data,” IceCube Collaboration: M.G. Aartsen et al., Feb 20, 2019, preprint: arXiv:1902.07771

Dataset must be obtained from https://icecube.wisc.edu/science/data/highstats_nuosc_3y

Setup¶

Usually you would need to export the location of the data and the config files to PISA_RESOURCES. For this example however, the data is inside our package.

import copy
from uncertainties import unumpy as unp
import matplotlib.pyplot as plt
import pisa
from pisa import ureg
from pisa.analysis.analysis import Analysis
from pisa.core.distribution_maker import DistributionMaker
from pisa.core.pipeline import Pipeline

<< PISA is running in double precision (FP64) mode; numba is running on CPU (single core) >>

Model¶

We can now instantiate the model (given our configs) that we later fit to data. This contains two Pipelines in a DistributionMaker, one for our neutrinos, and one for the background muons. We turn on profiling so that we will be able to analyse the time it took to generate the model predictions over the course of the analysis.

model = DistributionMaker(
    pipelines=["settings/pipeline/IceCube_3y_neutrinos.cfg", "settings/pipeline/IceCube_3y_muons.cfg"],
    profile=True
)
model

pipeline number	name	detector name	output_binning	output_key	profile
0	neutrinos		"dragon_datarelease": 8 (reco_energy) x 8 (reco_coszen) x 2 (pid)	('weights', 'errors')	True
1	muons		"dragon_datarelease": 8 (reco_energy) x 8 (reco_coszen) x 2 (pid)	('weights', 'errors')	True

Our model has a number of free parameters, that will be used in our fit to the data

model.params.free

name	value	nominal_value	range	prior	units	is_fixed
nue_numu_ratio	1	1	[np.float64(0.5), np.float64(1.5)]	+/- 0.05	dimensionless	False
Barr_uphor_ratio	0	0	[np.float64(-3.0), np.float64(3.0)]	+/- 1.0	dimensionless	False
Barr_nu_nubar_ratio	0	0	[np.float64(-3.0), np.float64(3.0)]	+/- 1.0	dimensionless	False
delta_index	0	0	[np.float64(-0.5), np.float64(0.5)]	+/- 0.1	dimensionless	False
theta13	8.5	8.5	[np.float64(7.85), np.float64(9.1)]	+/- 0.205	degree	False
theta23	42.3	42.3	[np.int64(31), np.int64(59)]	uniform	degree	False
deltam31	0.002457	0.002457	[np.float64(0.001), np.float64(0.007)]	uniform	electron_volt ** 2	False
aeff_scale	1	1	[np.float64(0.0), np.float64(3.0)]	uniform	dimensionless	False
nutau_norm	1	1	[np.float64(-1.0), np.float64(8.5)]	uniform	dimensionless	False
nu_nc_norm	1	1	[np.float64(0.5), np.float64(1.5)]	+/- 0.2	dimensionless	False
opt_eff_overall	1	1	[np.float64(0.8), np.float64(1.2)]	+/- 0.1	dimensionless	False
opt_eff_lateral	25	25	[np.int64(5), np.int64(50)]	+/- 10.0	dimensionless	False
opt_eff_headon	0	0	[np.float64(-5.0), np.float64(2.0)]	uniform	dimensionless	False
ice_scattering	0	0	[np.int64(-15), np.int64(15)]	+/- 10.0	dimensionless	False
ice_absorption	0	0	[np.int64(-15), np.int64(15)]	+/- 10.0	dimensionless	False
atm_muon_scale	1	1	[np.float64(0.0), np.float64(5.0)]	uniform	dimensionless	False

The two pipelines are quite different, with most complexity in the neutrino pipeline, that has several Stages and free parameters:

model.pipelines[0]

stage number	name	calc_mode	apply_mode	has setup	has compute	has apply	# fixed params	# free params
0	csv_loader	events	events	True	False	True	0	0
1	honda_ip	"true_allsky_fine": 200 (true_energy) x 200 (true_coszen)	events	True	True	False	1	0
2	barr_simple	"true_allsky_fine": 200 (true_energy) x 200 (true_coszen)	events	True	True	False	1	4
3	prob3	"true_allsky_fine": 200 (true_energy) x 200 (true_coszen)	events	True	True	True	9	3
4	aeff	events	events	False	False	True	2	3
5	hist	events	"dragon_datarelease": 8 (reco_energy) x 8 (reco_coszen) x 2 (pid)	True	False	True	0	0
6	hypersurfaces	"dragon_datarelease": 8 (reco_energy) x 8 (reco_coszen) x 2 (pid)	"dragon_datarelease": 8 (reco_energy) x 8 (reco_coszen) x 2 (pid)	True	True	True	0	5

model.pipelines[0].stages[2].params

name	value	nominal_value	range	prior	units	is_fixed
nu_nubar_ratio	1	1	[np.float64(0.7), np.float64(1.3)]	+/- 0.1	1	True
nue_numu_ratio	1	1	[np.float64(0.5), np.float64(1.5)]	+/- 0.05	1	False
Barr_uphor_ratio	0	0	[np.float64(-3.0), np.float64(3.0)]	+/- 1.0	1	False
Barr_nu_nubar_ratio	0	0	[np.float64(-3.0), np.float64(3.0)]	+/- 1.0	1	False
delta_index	0	0	[np.float64(-0.5), np.float64(0.5)]	+/- 0.1	1	False

In contrast, the muon pipeline is rather simple:

model.pipelines[1]

stage number	name	calc_mode	apply_mode	has setup	has compute	has apply	# fixed params	# free params
0	csv_icc_hist	"dragon_datarelease": 8 (reco_energy) x 8 (reco_coszen) x 2 (pid)	"dragon_datarelease": 8 (reco_energy) x 8 (reco_coszen) x 2 (pid)	True	False	True	0	1

Retrieve Outputs¶

We can get individual outputs from a pipeline like so. This fetches outputs from the neutrino pipeline, here 12 maps.

maps = model.pipelines[0].get_outputs()

maps.names

['nue_cc',
 'numu_cc',
 'nutau_cc',
 'nue_nc',
 'numu_nc',
 'nutau_nc',
 'nuebar_cc',
 'numubar_cc',
 'nutaubar_cc',
 'nuebar_nc',
 'numubar_nc',
 'nutaubar_nc']

fig, axes = plt.subplots(3,4, figsize=(24,10))
plt.subplots_adjust(hspace=0.5)
axes = axes.flatten()

for m, ax in zip(maps, axes):
    m.plot(ax=ax)

../_images/6f10848b44761cbc23aa254be67e4138fb01ad309d95bb1d8b31a13361673115.png

If we are interested in just the total expectation from the full model (all neutrinos + muons), we can do the following:

model.get_outputs(return_sum=True).plot()

../_images/53bf109f77d76f65586aa374ae026455da6aef9fc4032edbe4f72d0c6dbbce72.png

Diff plots¶

Let’s explore how a change in one of our nuisance parameters affects the expected counts per bin. Here we choose a hole ice parameter and move it a smidge.

# reset all free parameters to put them back to nominal values
model.reset_free()
nominal = model.get_outputs(return_sum=True)

# shift one parameter
model.params.opt_eff_lateral.value = 20
sys = model.get_outputs(return_sum=True)

((nominal[0] - sys[0])/nominal[0]).plot(symm=True, clabel="rel. difference")

(<Figure size 640x480 with 3 Axes>,
 <Axes: title={'center': '$Total,{\\;}{\\rm PID}{\\;}bin{\\;}0$'}, xlabel='$E_{\\rm reco} \\; \\left( \\mathrm{GeV} \\right)$', ylabel='$\\cos{\\theta}_{\\rm reco}$'>,
 <matplotlib.collections.QuadMesh at 0x7f66a1f89250>,
 None)

../_images/412f8c02c675119226f7d128e5c7c8b92d1341a01afa5a53e7f96e8631f03fe7.png

Get Data¶

We can load the real observed data too. This is a Pipeline with no free parameters, as the data is of course fixed. NB: When developing a new analysis you will not be allowed to look at the data as we do here before the box opening (c.f. blindness).

# real data
data_maker = Pipeline("settings/pipeline/IceCube_3y_data.cfg")
data = data_maker.get_outputs()

data_maker

stage number	name	calc_mode	apply_mode	has setup	has compute	has apply	# fixed params	# free params
0	csv_data_hist	events	"dragon_datarelease": 8 (reco_energy) x 8 (reco_coszen) x 2 (pid)	True	False	False	0	0

fig, ax = plt.subplots(1, 3, figsize=(20, 3))

model.reset_free()
nominal = model.get_outputs(return_sum=True)

data.plot(ax=ax[0], title="Data")
nominal.plot(ax=ax[1], title="Model")
(data - nominal).plot(ax=ax[2], symm=True, title="Diff")

../_images/29d7b6ee42f55fde215a90b82b14f5f6956b0291ab44457963bc77c28f0d0544.png

Fitting¶

For fitting we need to configure a minimizer: several standard cfgs are available, but you can also define your own. Here we make use of SciPy’s SLSQP solver. In addition, we need to choose a “metric” (objective function), here PISA’s “modified chi-square”. Also, we fit theta23 octants, which are quasi degenerate, separately. PISA automatically minimizes over the separate outcomes to arrive at a global best fit.

minimizer_cfg = pisa.utils.fileio.from_file('settings/minimizer/slsqp_ftol1e-6_eps1e-4_maxiter1000.json')
# instantiate the analysis object 
ana = Analysis()

method = "octants"
method_kwargs = {
    "angle": "theta23",
    "inflection_point": 45 * ureg.deg
}
local_fit_kwargs = {
    "method": "scipy",
    "method_kwargs": minimizer_cfg,
    "local_fit_kwargs": None
}

%%time
result = ana.fit_recursively(
    data_dist=data,
    hypo_maker=model,
    metric='mod_chi2',
    external_priors_penalty=None,
    method=method,
    method_kwargs=method_kwargs,
    local_fit_kwargs=local_fit_kwargs
)

Now we can investigate the fit result, such as the parameter values that have been found to minimize the metric, or the value of the latter itself at this point.

min_metric = result.metric_val
print(f"minimal metric value: {min_metric:.5g}")

minimal metric value: 115.81

bestfit_params = result.params.free
bestfit_params

name	value	nominal_value	range	prior	units	is_fixed
nue_numu_ratio	1.03301	1	[np.float64(0.5), np.float64(1.5)]	+/- 0.05	dimensionless	False
Barr_uphor_ratio	-0.315637	0	[np.float64(-3.0), np.float64(3.0)]	+/- 1.0	dimensionless	False
Barr_nu_nubar_ratio	-0.0208958	0	[np.float64(-3.0), np.float64(3.0)]	+/- 1.0	dimensionless	False
delta_index	-0.0106002	0	[np.float64(-0.5), np.float64(0.5)]	+/- 0.1	dimensionless	False
theta13	8.51035	8.5	[np.float64(7.85), np.float64(9.1)]	+/- 0.205	degree	False
theta23	46.251	47.7	[np.int64(31), np.int64(59)]	uniform	degree	False
deltam31	0.00242612	0.002457	[np.float64(0.001), np.float64(0.007)]	uniform	electron_volt ** 2	False
aeff_scale	0.934906	1	[np.float64(0.0), np.float64(3.0)]	uniform	dimensionless	False
nutau_norm	0.610797	1	[np.float64(-1.0), np.float64(8.5)]	uniform	dimensionless	False
nu_nc_norm	1.26362	1	[np.float64(0.5), np.float64(1.5)]	+/- 0.2	dimensionless	False
opt_eff_overall	1.05343	1	[np.float64(0.8), np.float64(1.2)]	+/- 0.1	dimensionless	False
opt_eff_lateral	22.0036	25	[np.int64(5), np.int64(50)]	+/- 10.0	dimensionless	False
opt_eff_headon	-1.32362	0	[np.float64(-5.0), np.float64(2.0)]	uniform	dimensionless	False
ice_scattering	-2.87944	0	[np.int64(-15), np.int64(15)]	+/- 10.0	dimensionless	False
ice_absorption	1.75368	0	[np.int64(-15), np.int64(15)]	+/- 10.0	dimensionless	False
atm_muon_scale	0.931613	1	[np.float64(0.0), np.float64(5.0)]	uniform	dimensionless	False

# update the model with the best fit
model.update_params(copy.deepcopy(bestfit_params))

Let’s see how good that fit looks. We here construct signed mod_chi2 maps by hand. You can see that after the fit, it improved considerably, and the distribution of chi2 values is now more uniform - only few features remain visible.

fig, ax = plt.subplots(2, 3, figsize=(20, 7))
plt.subplots_adjust(hspace=0.5)

bestfit = model.get_outputs(return_sum=True)

data.plot(ax=ax[0,0], title="Data")
nominal.plot(ax=ax[0,1], title="Nominal")
diff = data - nominal
(abs(diff)*diff/(nominal + unp.std_devs(nominal.hist['total']))).plot(ax=ax[0,2], symm=True, title=r"signed $\chi^2$", vmin=-12, vmax=12)

data.plot(ax=ax[1,0], title="Data")
bestfit.plot(ax=ax[1,1], title="Bestfit")
diff = data - bestfit
(abs(diff)*diff/(bestfit + unp.std_devs(bestfit.hist['total']))).plot(ax=ax[1,2], symm=True, title=r"signed $\chi^2$", vmin=-12, vmax=12)

../_images/773015b5fea3b41b5f5c21b0d968211d18d2535df23d3e734ed37d71a71b9e48.png

When checking the chi2 value from the fitted model, you maybe see that it is around 113, while in the minimizer loop we saw it converged to 116. It is important to keep in mind that in the fit we had extended the metric with prior penalty terms. When we add those back we get the identical number as reported in the fit.

print(data.metric_total(nominal, 'mod_chi2'))
print(data.metric_total(bestfit, 'mod_chi2'))

175.08610766919114
113.03289312869633

Evaluating other metrics just for fun:

for metric in pisa.utils.stats.ALL_METRICS:
    try:
        print(f'{metric} = {data.metric_total(bestfit, metric):.3f}')
    except:
        print(f'{metric} failed')

llh = -64.971
conv_llh = -57.324
barlow_llh failed
mcllh_mean = -541.331
mcllh_eff = -541.484
generalized_poisson_llh failed
chi2 = 128.473
mod_chi2 = 113.033
correct_chi2 = 847.217
weighted_chi2 failed
signed_sqrt_mod_chi2 = -9.176

Adding prior penalty terms

model.update_params(copy.deepcopy(bestfit_params))
print(data.metric_total(bestfit, 'mod_chi2') + model.params.priors_penalty('mod_chi2'))

115.80889047543769

print(result.metric_val)

115.80889047543769

Storing the results to a file¶

Since the fit took a while, it might be useful to store the results to a file. (NB: in this example we use a temporary file, but in practice you would just use a real path instead.)

import tempfile
temp = tempfile.NamedTemporaryFile(suffix='.json')
pisa.utils.fileio.to_file(result, temp.name, warn=False)

To reload, we can read the file and re-instantiate the set of fit parameters.

result_reload = pisa.utils.fileio.from_file(temp.name)
temp.close() # remove the temporary file again
bestfit_params = pisa.core.ParamSet(result_reload['params']).free
bestfit_params

name	value	nominal_value	range	prior	units	is_fixed
nue_numu_ratio	1.03301	1	[0.5, 1.5]	+/- 0.05	dimensionless	False
Barr_uphor_ratio	-0.315637	0	[-3.0, 3.0]	+/- 1.0	dimensionless	False
Barr_nu_nubar_ratio	-0.0208958	0	[-3.0, 3.0]	+/- 1.0	dimensionless	False
delta_index	-0.0106002	0	[-0.5, 0.5]	+/- 0.1	dimensionless	False
theta13	8.51035	8.5	[7.85, 9.1]	+/- 0.205	degree	False
theta23	46.251	47.7	[31, 59]	uniform	degree	False
deltam31	0.00242612	0.002457	[0.001, 0.007]	uniform	electron_volt ** 2	False
aeff_scale	0.934906	1	[0.0, 3.0]	uniform	dimensionless	False
nutau_norm	0.610797	1	[-1.0, 8.5]	uniform	dimensionless	False
nu_nc_norm	1.26362	1	[0.5, 1.5]	+/- 0.2	dimensionless	False
opt_eff_overall	1.05343	1	[0.8, 1.2]	+/- 0.1	dimensionless	False
opt_eff_lateral	22.0036	25	[5, 50]	+/- 10.0	dimensionless	False
opt_eff_headon	-1.32362	0	[-5.0, 2.0]	uniform	dimensionless	False
ice_scattering	-2.87944	0	[-15, 15]	+/- 10.0	dimensionless	False
ice_absorption	1.75368	0	[-15, 15]	+/- 10.0	dimensionless	False
atm_muon_scale	0.931613	1	[0.0, 5.0]	uniform	dimensionless	False

Profiling¶

To understand what parts of the model were executed how many times, and how long it took, the profiling can help:

model.report_profile()

Pipeline: neutrinos
- setup:        Total time (s): 4.662, n calls: 1
- run:          Total time (s): 410.261, n calls: 782, time/call (s): mean 0.525, max. 24.954, min. 0.040
- get_outputs:  Total time (s): 413.123, n calls: 782, time/call (s): mean 0.528, max. 24.962, min. 0.043
Individual services:
data csv_loader
- setup:    Total time (s): 4.531, n calls: 1
- compute:  Total time (s): +0.000, n calls: 1
- apply:    Total time (s): 0.905, n calls: 782, time/call (s): mean 0.001, max. 0.007, min. +0.000
flux honda_ip
- setup:    Total time (s): 0.025, n calls: 1
- compute:  Total time (s): 23.341, n calls: 1
- apply:    Total time (s): +0.000, n calls: 782, time/call (s): mean +0.000, max. +0.000, min. +0.000
flux barr_simple
- setup:    Total time (s): +0.000, n calls: 1
- compute:  Total time (s): 82.179, n calls: 331, time/call (s): mean 0.248, max. 0.516, min. 0.187
- apply:    Total time (s): +0.000, n calls: 782, time/call (s): mean +0.000, max. +0.000, min. +0.000
osc prob3
- setup:    Total time (s): 0.076, n calls: 1
- compute:  Total time (s): 258.130, n calls: 291, time/call (s): mean 0.887, max. 1.811, min. 0.755
- apply:    Total time (s): 13.937, n calls: 782, time/call (s): mean 0.018, max. 0.461, min. 0.005
aeff aeff
- setup:    Total time (s): +0.000, n calls: 1
- compute:  Total time (s): +0.000, n calls: 291, time/call (s): mean +0.000, max. +0.000, min. +0.000
- apply:    Total time (s): 1.699, n calls: 782, time/call (s): mean 0.002, max. 0.014, min. 0.002
utils hist
- setup:    Total time (s): 0.024, n calls: 1
- compute:  Total time (s): +0.000, n calls: 1
- apply:    Total time (s): 26.674, n calls: 782, time/call (s): mean 0.034, max. 0.087, min. 0.030
discr_sys hypersurfaces
- setup:    Total time (s): +0.000, n calls: 1
- compute:  Total time (s): 0.234, n calls: 373, time/call (s): mean +0.000, max. 0.003, min. +0.000
- apply:    Total time (s): 0.393, n calls: 782, time/call (s): mean +0.000, max. 0.001, min. +0.000
Pipeline: muons
- setup:        Total time (s): 0.002, n calls: 1
- run:          Total time (s): 0.280, n calls: 781, time/call (s): mean +0.000, max. 0.002, min. +0.000
- get_outputs:  Total time (s): 0.564, n calls: 781, time/call (s): mean +0.000, max. 0.005, min. +0.000
Individual services:
data csv_icc_hist
- setup:    Total time (s): 0.002, n calls: 1
- compute:  Total time (s): +0.000, n calls: 171, time/call (s): mean +0.000, max. +0.000, min. +0.000
- apply:    Total time (s): 0.066, n calls: 781, time/call (s): mean +0.000, max. +0.000, min. +0.000

Here we can see that computing the outputs of the neutrino pipeline on average took approximately half a second.

Apart from the onetime calculation of the atmospheric neutrino fluxes by the “honda_ip” service and the loading of the MC events by the “csv_loader” service, the most time-consuming, recurring, operations occur in the “prob3” neutrino oscillation service and the “barr_simple” neutrino flux service.

In contrast, computing the outputs of the muon pipeline took at most $5\times 10^{-3}$ seconds.