Leveraging universality of jet taggers through transfer learning

Abstract A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is comm...

Full description

Bibliographic Details
Main Authors:	Frédéric A. Dreyer, Radosław Grabarczyk, Pier Francesco Monni
Format:	Article
Language:	English
Published:	SpringerOpen 2022-06-01
Series:	European Physical Journal C: Particles and Fields
Online Access:	https://doi.org/10.1140/epjc/s10052-022-10469-9

_version_	1811334267369684992
author	Frédéric A. Dreyer Radosław Grabarczyk Pier Francesco Monni
author_facet	Frédéric A. Dreyer Radosław Grabarczyk Pier Francesco Monni
author_sort	Frédéric A. Dreyer
collection	DOAJ
description	Abstract A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.
first_indexed	2024-04-13T17:05:14Z
format	Article
id	doaj.art-8635fff087374e51822b701ddedaa682
institution	Directory Open Access Journal
issn	1434-6052
language	English
last_indexed	2024-04-13T17:05:14Z
publishDate	2022-06-01
publisher	SpringerOpen
record_format	Article
series	European Physical Journal C: Particles and Fields
spelling	doaj.art-8635fff087374e51822b701ddedaa6822022-12-22T02:38:30ZengSpringerOpenEuropean Physical Journal C: Particles and Fields1434-60522022-06-0182611110.1140/epjc/s10052-022-10469-9Leveraging universality of jet taggers through transfer learningFrédéric A. Dreyer0Radosław Grabarczyk1Pier Francesco Monni2Clarendon Laboratory, Rudolf Peierls Centre for Theoretical Physics, University of OxfordClarendon Laboratory, Rudolf Peierls Centre for Theoretical Physics, University of OxfordTheoretical Physics Department, CERNAbstract A significant challenge in the tagging of boosted objects via machine-learning technology is the prohibitive computational cost associated with training sophisticated models. Nevertheless, the universality of QCD suggests that a large amount of the information learnt in the training is common to different physical signals and experimental setups. In this article, we explore the use of transfer learning techniques to develop fast and data-efficient jet taggers that leverage such universality. We consider the graph neural networks LundNet and ParticleNet, and introduce two prescriptions to transfer an existing tagger into a new signal based either on fine-tuning all the weights of a model or alternatively on freezing a fraction of them. In the case of W-boson and top-quark tagging, we find that one can obtain reliable taggers using an order of magnitude less data with a corresponding speed-up of the training process. Moreover, while keeping the size of the training data set fixed, we observe a speed-up of the training by up to a factor of three. This offers a promising avenue to facilitate the use of such tools in collider physics experiments.https://doi.org/10.1140/epjc/s10052-022-10469-9
spellingShingle	Frédéric A. Dreyer Radosław Grabarczyk Pier Francesco Monni Leveraging universality of jet taggers through transfer learning European Physical Journal C: Particles and Fields
title	Leveraging universality of jet taggers through transfer learning
title_full	Leveraging universality of jet taggers through transfer learning
title_fullStr	Leveraging universality of jet taggers through transfer learning
title_full_unstemmed	Leveraging universality of jet taggers through transfer learning
title_short	Leveraging universality of jet taggers through transfer learning
title_sort	leveraging universality of jet taggers through transfer learning
url	https://doi.org/10.1140/epjc/s10052-022-10469-9
work_keys_str_mv	AT fredericadreyer leveraginguniversalityofjettaggersthroughtransferlearning AT radosławgrabarczyk leveraginguniversalityofjettaggersthroughtransferlearning AT pierfrancescomonni leveraginguniversalityofjettaggersthroughtransferlearning

Leveraging universality of jet taggers through transfer learning

Similar Items