Exploring the space of jets with CMS open data
We explore the metric space of jets using public collider data from the CMS experiment. Starting from 2.3 fb^{-1} of proton-proton collisions at sqrt[s]=7 TeV collected at the Large Hadron Collider in 2011, we isolate a sample of 1,690,984 central jets with transverse momentum above 375 GeV. To va...
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
American Physical Society (APS)
2020
|
Online Access: | https://hdl.handle.net/1721.1/125462 |
_version_ | 1826211275880792064 |
---|---|
author | Komiske, Patrick T. Mastandrea, Radha R. Metodiev, Eric Mario Naik, Preksha Thaler, Jesse |
author2 | Massachusetts Institute of Technology. Center for Theoretical Physics |
author_facet | Massachusetts Institute of Technology. Center for Theoretical Physics Komiske, Patrick T. Mastandrea, Radha R. Metodiev, Eric Mario Naik, Preksha Thaler, Jesse |
author_sort | Komiske, Patrick T. |
collection | MIT |
description | We explore the metric space of jets using public collider data from the CMS experiment. Starting from 2.3 fb^{-1} of proton-proton collisions at sqrt[s]=7 TeV collected at the Large Hadron Collider in 2011, we isolate a sample of 1,690,984 central jets with transverse momentum above 375 GeV. To validate the performance of the CMS detector in reconstructing the energy flow of jets, we compare the CMS Open Data to corresponding simulated data samples for a variety of jet kinematic and substructure observables. Even without detector unfolding, we find very good agreement for track-based observables after using charged hadron subtraction to mitigate the impact of pileup. We perform a range of novel analyses, using the “energy mover’s distance” (EMD) to measure the pairwise difference between jet energy flows. The EMD allows us to quantify the impact of detector effects, visualize the metric space of jets, extract correlation dimensions, and identify the most and least typical jet configurations. To facilitate future jet studies with CMS Open Data, we make our datasets and analysis code available, amounting to around two gigabytes of distilled data and one hundred gigabytes of simulation files. |
first_indexed | 2024-09-23T15:03:40Z |
format | Article |
id | mit-1721.1/125462 |
institution | Massachusetts Institute of Technology |
language | English |
last_indexed | 2024-09-23T15:03:40Z |
publishDate | 2020 |
publisher | American Physical Society (APS) |
record_format | dspace |
spelling | mit-1721.1/1254622022-09-29T12:22:45Z Exploring the space of jets with CMS open data Komiske, Patrick T. Mastandrea, Radha R. Metodiev, Eric Mario Naik, Preksha Thaler, Jesse Massachusetts Institute of Technology. Center for Theoretical Physics We explore the metric space of jets using public collider data from the CMS experiment. Starting from 2.3 fb^{-1} of proton-proton collisions at sqrt[s]=7 TeV collected at the Large Hadron Collider in 2011, we isolate a sample of 1,690,984 central jets with transverse momentum above 375 GeV. To validate the performance of the CMS detector in reconstructing the energy flow of jets, we compare the CMS Open Data to corresponding simulated data samples for a variety of jet kinematic and substructure observables. Even without detector unfolding, we find very good agreement for track-based observables after using charged hadron subtraction to mitigate the impact of pileup. We perform a range of novel analyses, using the “energy mover’s distance” (EMD) to measure the pairwise difference between jet energy flows. The EMD allows us to quantify the impact of detector effects, visualize the metric space of jets, extract correlation dimensions, and identify the most and least typical jet configurations. To facilitate future jet studies with CMS Open Data, we make our datasets and analysis code available, amounting to around two gigabytes of distilled data and one hundred gigabytes of simulation files. Office of Nuclear Physics of the U.S. Department of Energy (DOE) under Grant No. DE-SC0011090 DOE Office of High Energy Physics under Grants No. DE-SC0012567 and No. DE-SC0019128 2020-05-26T20:37:22Z 2020-05-26T20:37:22Z 2020-02 2019-09 2020-02-11T15:24:26Z Article http://purl.org/eprint/type/JournalArticle 2470-0010 2470-0029 https://hdl.handle.net/1721.1/125462 Komiske, Patrick T., et al. "Exploring the space of jets with CMS open data." Physical Review D, 101, 3 (February 2020): 034009. en http://dx.doi.org/10.1103/PhysRevD.101.034009 Physical Review D Creative Commons Attribution 3.0 unported license http://creativecommons.org/licenses/by/3.0 application/pdf American Physical Society (APS) American Physical Society |
spellingShingle | Komiske, Patrick T. Mastandrea, Radha R. Metodiev, Eric Mario Naik, Preksha Thaler, Jesse Exploring the space of jets with CMS open data |
title | Exploring the space of jets with CMS open data |
title_full | Exploring the space of jets with CMS open data |
title_fullStr | Exploring the space of jets with CMS open data |
title_full_unstemmed | Exploring the space of jets with CMS open data |
title_short | Exploring the space of jets with CMS open data |
title_sort | exploring the space of jets with cms open data |
url | https://hdl.handle.net/1721.1/125462 |
work_keys_str_mv | AT komiskepatrickt exploringthespaceofjetswithcmsopendata AT mastandrearadhar exploringthespaceofjetswithcmsopendata AT metodievericmario exploringthespaceofjetswithcmsopendata AT naikpreksha exploringthespaceofjetswithcmsopendata AT thalerjesse exploringthespaceofjetswithcmsopendata |