Abstract

While “quark” and “gluon” jets are often treated as separate, well-defined objects in both theoretical and experimental contexts, no precise, practical, and hadron-level definition of jet flavor presently exists. To remedy this issue, we develop and advocate for a data-driven, operational definition of quark and gluon jets that is readily applicable at colliders. Rather than specifying a per-jet flavor label, we aggregately define quark and gluon jets at the distribution level in terms of measured hadronic cross sections. Intuitively, quark and gluon jets emerge as the two maximally separable categories within two jet samples in data. Benefiting from recent work on data-driven classifiers and topic modeling for jets, we show that the practical tools needed to implement our definition already exist for experimental applications. As an informative example, we demonstrate the power of our operational definition using Z+jet and dijet samples, illustrating that pure quark and gluon distributions and fractions can be successfully extracted in a fully well-defined manner.

Highlights

  • Even setting aside the issue of jet flavor, ambiguity is already present whenever one wants to identify jets in an event [43]

  • We develop an operational definition of quark and gluon jets that is formulated solely in terms of experimentally-accessible quantities, does not rely on specific theoretical constructs such as factorization theorems, and can be readily implemented in a realistic context

  • Ref. [66] was the first to apply weak supervision methods in a particle physics context, showing that given mixed samples with known signal fractions, a quark/gluon classifier on a few highlevel inputs could be trained without access to per-jet truth labels, a paradigm termed learning from label proportions (LLP)

Read more

Summary

Defining quark and gluon jets

Due to the complicated radiative showering and fundamentally non-perturbative hadronization that occurs in the course of jets emerging from partons, there is no unambiguous definition of “quark” or “gluon” jets at the hadron-level. A phase space region (as defined by an unambiguous hadronic fiducial cross section measurement) that yields an enriched sample of quarks (as interpreted by some suitable, though fundamentally ambiguous criterion) This definition is attractive for numerous reasons. The definition only tags specific regions of phase space as “quark” or “gluon”, such as low or high values of some substructure observable, and provides no framework for discussing jet flavor outside of these regions To remedy this issue, we seek to upgrade the conceptual definition to an operational one by giving a concrete, data-driven method for optimally identifying quark- or gluon-enriched regions of phase space and obtaining full quark and gluon jet distributions

Motivating the operational definition
An operational definition of quark and gluon jets
Data-driven jet taggers and topics
Classification without labels: training classifiers on collider data
Jet topics: extracting categories from collider data
Optimal taggers for optimal topics
Event generation
Extracting reducibility factors and fractions
Ndijets
Self-calibrating classifiers
Obtaining observable distributions from extracted fractions
Conclusions
A Theoretical exploration of Casimir- and Poisson-scaling observables
B Details of observables and machine learning models
Findings
C Sample dependence in parton shower events

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.