Leveraging universality of jet taggers through transfer learning

Abstract A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is comm...

Full description

Bibliographic Details
Main Authors: Frédéric A. Dreyer, Radosław Grabarczyk, Pier Francesco Monni
Format: Article
Language:English
Published: SpringerOpen 2022-06-01
Series:European Physical Journal C: Particles and Fields
Online Access:https://doi.org/10.1140/epjc/s10052-022-10469-9
_version_ 1811334267369684992
author Frédéric A. Dreyer
Radosław Grabarczyk
Pier Francesco Monni
author_facet Frédéric A. Dreyer
Radosław Grabarczyk
Pier Francesco Monni
author_sort Frédéric A. Dreyer
collection DOAJ
description Abstract A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.
first_indexed 2024-04-13T17:05:14Z
format Article
id doaj.art-8635fff087374e51822b701ddedaa682
institution Directory Open Access Journal
issn 1434-6052
language English
last_indexed 2024-04-13T17:05:14Z
publishDate 2022-06-01
publisher SpringerOpen
record_format Article
series European Physical Journal C: Particles and Fields
spelling doaj.art-8635fff087374e51822b701ddedaa6822022-12-22T02:38:30ZengSpringerOpenEuropean Physical Journal C: Particles and Fields1434-60522022-06-0182611110.1140/epjc/s10052-022-10469-9Leveraging universality of jet taggers through transfer learningFrédéric A. Dreyer0Radosław Grabarczyk1Pier Francesco Monni2Clarendon Laboratory, Rudolf Peierls Centre for Theoretical Physics, University of OxfordClarendon Laboratory, Rudolf Peierls Centre for Theoretical Physics, University of OxfordTheoretical Physics Department, CERNAbstract A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.https://doi.org/10.1140/epjc/s10052-022-10469-9
spellingShingle Frédéric A. Dreyer
Radosław Grabarczyk
Pier Francesco Monni
Leveraging universality of jet taggers through transfer learning
European Physical Journal C: Particles and Fields
title Leveraging universality of jet taggers through transfer learning
title_full Leveraging universality of jet taggers through transfer learning
title_fullStr Leveraging universality of jet taggers through transfer learning
title_full_unstemmed Leveraging universality of jet taggers through transfer learning
title_short Leveraging universality of jet taggers through transfer learning
title_sort leveraging universality of jet taggers through transfer learning
url https://doi.org/10.1140/epjc/s10052-022-10469-9
work_keys_str_mv AT fredericadreyer leveraginguniversalityofjettaggersthroughtransferlearning
AT radosławgrabarczyk leveraginguniversalityofjettaggersthroughtransferlearning
AT pierfrancescomonni leveraginguniversalityofjettaggersthroughtransferlearning