Exploring the space of jets with CMS open data

We explore the metric space of jets using public collider data from the CMS experiment. Starting from 2.3  fb^{-1} of proton-proton collisions at sqrt[s]=7  TeV collected at the Large Hadron Collider in 2011, we isolate a sample of 1,690,984 central jets with transverse momentum above 375 GeV. To va...

Full description

Bibliographic Details
Main Authors: Komiske, Patrick T., Mastandrea, Radha R., Metodiev, Eric Mario, Naik, Preksha, Thaler, Jesse
Other Authors: Massachusetts Institute of Technology. Center for Theoretical Physics
Format: Article
Language:English
Published: American Physical Society (APS) 2020
Online Access:https://hdl.handle.net/1721.1/125462
_version_ 1826211275880792064
author Komiske, Patrick T.
Mastandrea, Radha R.
Metodiev, Eric Mario
Naik, Preksha
Thaler, Jesse
author2 Massachusetts Institute of Technology. Center for Theoretical Physics
author_facet Massachusetts Institute of Technology. Center for Theoretical Physics
Komiske, Patrick T.
Mastandrea, Radha R.
Metodiev, Eric Mario
Naik, Preksha
Thaler, Jesse
author_sort Komiske, Patrick T.
collection MIT
description We explore the metric space of jets using public collider data from the CMS experiment. Starting from 2.3  fb^{-1} of proton-proton collisions at sqrt[s]=7  TeV collected at the Large Hadron Collider in 2011, we isolate a sample of 1,690,984 central jets with transverse momentum above 375 GeV. To validate the performance of the CMS detector in reconstructing the energy flow of jets, we compare the CMS Open Data to corresponding simulated data samples for a variety of jet kinematic and substructure observables. Even without detector unfolding, we find very good agreement for track-based observables after using charged hadron subtraction to mitigate the impact of pileup. We perform a range of novel analyses, using the “energy mover’s distance” (EMD) to measure the pairwise difference between jet energy flows. The EMD allows us to quantify the impact of detector effects, visualize the metric space of jets, extract correlation dimensions, and identify the most and least typical jet configurations. To facilitate future jet studies with CMS Open Data, we make our datasets and analysis code available, amounting to around two gigabytes of distilled data and one hundred gigabytes of simulation files.
first_indexed 2024-09-23T15:03:40Z
format Article
id mit-1721.1/125462
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T15:03:40Z
publishDate 2020
publisher American Physical Society (APS)
record_format dspace
spelling mit-1721.1/1254622022-09-29T12:22:45Z Exploring the space of jets with CMS open data Komiske, Patrick T. Mastandrea, Radha R. Metodiev, Eric Mario Naik, Preksha Thaler, Jesse Massachusetts Institute of Technology. Center for Theoretical Physics We explore the metric space of jets using public collider data from the CMS experiment. Starting from 2.3  fb^{-1} of proton-proton collisions at sqrt[s]=7  TeV collected at the Large Hadron Collider in 2011, we isolate a sample of 1,690,984 central jets with transverse momentum above 375 GeV. To validate the performance of the CMS detector in reconstructing the energy flow of jets, we compare the CMS Open Data to corresponding simulated data samples for a variety of jet kinematic and substructure observables. Even without detector unfolding, we find very good agreement for track-based observables after using charged hadron subtraction to mitigate the impact of pileup. We perform a range of novel analyses, using the “energy mover’s distance” (EMD) to measure the pairwise difference between jet energy flows. The EMD allows us to quantify the impact of detector effects, visualize the metric space of jets, extract correlation dimensions, and identify the most and least typical jet configurations. To facilitate future jet studies with CMS Open Data, we make our datasets and analysis code available, amounting to around two gigabytes of distilled data and one hundred gigabytes of simulation files. Office of Nuclear Physics of the U.S. Department of Energy (DOE) under Grant No. DE-SC0011090 DOE Office of High Energy Physics under Grants No. DE-SC0012567 and No. DE-SC0019128 2020-05-26T20:37:22Z 2020-05-26T20:37:22Z 2020-02 2019-09 2020-02-11T15:24:26Z Article http://purl.org/eprint/type/JournalArticle 2470-0010 2470-0029 https://hdl.handle.net/1721.1/125462 Komiske, Patrick T., et al. "Exploring the space of jets with CMS open data." Physical Review D, 101, 3 (February 2020): 034009. en http://dx.doi.org/10.1103/PhysRevD.101.034009 Physical Review D Creative Commons Attribution 3.0 unported license http://creativecommons.org/licenses/by/3.0 application/pdf American Physical Society (APS) American Physical Society
spellingShingle Komiske, Patrick T.
Mastandrea, Radha R.
Metodiev, Eric Mario
Naik, Preksha
Thaler, Jesse
Exploring the space of jets with CMS open data
title Exploring the space of jets with CMS open data
title_full Exploring the space of jets with CMS open data
title_fullStr Exploring the space of jets with CMS open data
title_full_unstemmed Exploring the space of jets with CMS open data
title_short Exploring the space of jets with CMS open data
title_sort exploring the space of jets with cms open data
url https://hdl.handle.net/1721.1/125462
work_keys_str_mv AT komiskepatrickt exploringthespaceofjetswithcmsopendata
AT mastandrearadhar exploringthespaceofjetswithcmsopendata
AT metodievericmario exploringthespaceofjetswithcmsopendata
AT naikpreksha exploringthespaceofjetswithcmsopendata
AT thalerjesse exploringthespaceofjetswithcmsopendata