Abstract

Understanding jets initiated by quarks and gluons is of fundamental importance in collider physics. Efficient and robust techniques for quark versus gluon jet discrimination have consequences for new physics searches, precision αs studies, parton distribution function extractions, and many other applications. Numerous machine learning analyses have attacked the problem, demonstrating that good performance can be obtained but generally not providing an understanding for what properties of the jets are responsible for that separation power. In this paper, we provide an extensive and detailed analysis of quark versus gluon discrimination from first-principles theoretical calculations. Working in the strongly-ordered soft and collinear limits, we calculate probability distributions for fixed N -body kinematics within jets with up through three resolved emissions left(mathcal{O}left({alpha}_s^3right)right) . This enables explicit calculation of quantities central to machine learning such as the likelihood ratio, the area under the receiver operating characteristic curve, and reducibility factors within a well-defined approximation scheme. Further, we relate the existence of a consistent power counting procedure for discrimination to ideas for operational flavor definitions, and we use this relationship to construct a power counting for quark versus gluon discrimination as an expansion in {e}^{C_F-{C}_A}ll 1 , the exponential of the fundamental and adjoint Casimirs. Our calculations provide insight into the discrimination performance of particle multiplicity and show how observables sensitive to all emissions in a jet are optimal. We compare our predictions to the performance of individual observables and neural networks with parton shower event generators, validating that our predictions describe the features identified by machine learning.

Highlights

  • High energy quarks and gluons fragment and hadronize into jets of particles through quantum chromodynamics (QCD)

  • We compare our predictions to the performance of individual observables and neural networks with parton shower event generators, validating that our predictions describe the features identified by machine learning

  • Power counting for quark and gluon jets is intrinsically more difficult because, as we demonstrate on any phase space with finitelymany resolved emissions, quark and gluon jets are not strictly mutually irreducible

Read more

Summary

Introduction

High energy quarks and gluons fragment and hadronize into jets of particles through quantum chromodynamics (QCD). Working in the strongly-ordered soft and collinear limits, we explicitly calculate the resummed probability distribution of multiple infrared and collinear safe observables on a jet These multiple observables enable a characterization of the emission phase space and evaluation of the optimal observable for discrimination. Our first step in developing a theory of quark versus gluon discrimination is to establish a robust power counting scheme that can be used to construct individual observables, strictly from general statements about the singular limits of QCD. An observable parametrically separates jet categories if power counting identifies arbitrarily pure samples at the phase space boundaries, enabling an unambiguous definition in a singular limit. An appendix applies reducibility ideas to the problem of up vs. down quark jet discrimination

Approximations and observables
Power counting and mutual irreducibility
Theoretically bounding classification performance
Quark and gluon power counting rules
Resolving one emission
Resolving the one-emission phase space
Higher order effects
Resolving two emissions
Fixed-order analysis
CF2 4 CA2
Including resummation
IRC safety of the likelihood
AUC evaluation
Resolving three emissions
CF3 log τ1 log τ2 log τ3 τ1τ2τ3
CF CA2
Calculation of gluon reducibility for any number of emissions
IRC safe multiplicity
Relationship of multiplicity to individual N -subjettiness observables
Comparison to Monte Carlo simulation
Probing machine learning strategies
Findings
10 Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call