https://doi.org/10.1140/epjc/s10052-025-14156-3
Regular Article - Experimental Physics
Trackformers: in search of transformer-based particle tracking for the high-luminosity LHC era
1
High-Energy Physics, Radboud University, Nijmegen, The Netherlands
2
Institute for Computing and Information Sciences, Radboud University, Nijmegen, The Netherlands
3
Faculty of Engineering Technology, University of Twente, Enschede, The Netherlands
4
Institute of Physics, University of Amsterdam, Amsterdam, The Netherlands
5
National Institute for Subatomic Physics (Nikhef), Amsterdam, The Netherlands
6
SURF, Amsterdam, The Netherlands
7
Intelligent Data Analysis Laboratory (IDAL), Department of Electronic Engineering, ETSE-UV, University of Valencia, Valencia, Spain
8
Instituto de Física Corpuscular, University of Valencia, Valencia, Spain
9
Valencian Graduate School and Research Network of Artificial Intelligence (ValgrAI), Valencia, Spain
Received:
24
July
2024
Accepted:
5
April
2025
Published online:
25
April
2025
High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking.
© The Author(s) 2025
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Funded by SCOAP3.