https://doi.org/10.1140/epjc/s10052-024-13353-w
Regular Article - Experimental Physics
Classifier surrogates: sharing AI-based searches with the world
1
Institut für Experimentalphysik, Universität Hamburg, Luruper Chaussee 149, 22761, Hamburg, Germany
2
Institut für Experimentelle Teilchenphysik, Karlsruher Institut für Technologie, Wolfgang-Gaede-Str. 1, 76131, Karlsruhe, Germany
3
Institut für Stochastik, Karlsruher Institut für Technologie, Englerstr. 2, 76131, Karlsruhe, Germany
a
sebastian.guido.bieringer@uni-hamburg.de
Received:
27
February
2024
Accepted:
9
September
2024
Published online:
27
September
2024
In recent years, neural network-based classification has been used to improve data analysis at collider experiments. While this strategy proves to be hugely successful, the underlying models are not commonly shared with the public and rely on experiment-internal data as well as full detector simulations. We show a concrete implementation of a newly proposed strategy, so-called Classifier Surrogates, to be trained inside the experiments, that only utilise publicly accessible features and truth information. These surrogates approximate the original classifier distribution, and can be shared with the public. Subsequently, such a model can be evaluated by sampling the classification output from high-level information without requiring a sophisticated detector simulation. Technically, we show that continuous normalizing flows are a suitable generative architecture that can be efficiently trained to sample classification results using conditional flow matching. We further demonstrate that these models can be easily extended by Bayesian uncertainties to indicate their degree of validity when confronted with unknown inputs by the user. For a concrete example of tagging jets from hadronically decaying top quarks, we demonstrate the application of flows in combination with uncertainty estimation through either inference of a mean-field Gaussian weight posterior, or Monte Carlo sampling network weights.
© The Author(s) 2024
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Funded by SCOAP3.