Abstract
Binary discrimination between well-defined signal and background datasets is a problem of fundamental importance in particle physics. With detailed event simulation and the advent of extensive deep learning tools, identification of the likelihood ratio has typically been reserved as a computational problem. However, this approach can obscure overtraining or excessive sensitivity to tuned features of the simulation that may not be well-defined theoretically. Here, we present the first analysis of binary discrimination for signal and background distributions for which their likelihood ratio is infrared and collinear safe, and can therefore be calculated order-by-order in perturbation theory. We present explicit, general formulas for receiver operator characteristic curves and the area under it through next-to-leading order. These results can then establish absolute upper bounds on discrimination performance because any realistic implementation will have measurement errors, undetected particles, or restrictions on fiducial phase space. As a demonstration of this formalism, we apply it to discrimination of highly-boosted Higgs decays from gluon splitting to bottom quarks. Effects at next-to-leading order are first sensitive to the flow of color in the jet and significantly modify discrimination performance at leading-order. In the limit of infinite boost, these events can be perfectly discriminated because only the gluon will radiate at finite angles from the bottom quarks, and we find that large effects persist at energies accessible at the Large Hadron Collider. Next-to-leading order is therefore required to qualitatively understand results using machine-learning methods.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
J. Neyman and E.S. Pearson, On the Problem of the Most Efficient Tests of Statistical Hypotheses, Phil. Trans. Roy. Soc. Lond. A 231 (1933) 289 [INSPIRE].
C. Bierlich et al., A comprehensive guide to the physics and usage of PYTHIA 8.3, SciPost Phys. Codeb. 2022 (2022) 8 [arXiv:2203.11601] [INSPIRE].
J. Bellm et al., Herwig 7.2 release note, Eur. Phys. J. C 80 (2020) 452 [arXiv:1912.06509] [INSPIRE].
Sherpa collaboration, Event Generation with Sherpa 2.2, SciPost Phys. 7 (2019) 034 [arXiv:1905.09127] [INSPIRE].
A.J. Larkoski, I. Moult and B. Nachman, Jet Substructure at the Large Hadron Collider: A Review of Recent Advances in Theory and Mach. Learn., Phys. Rept. 841 (2020) 1 [arXiv:1709.04464] [INSPIRE].
D. Guest, K. Cranmer and D. Whiteson, Deep Learning and its Application to LHC Physics, Ann. Rev. Nucl. Part. Sci. 68 (2018) 161 [arXiv:1806.11484] [INSPIRE].
K. Albertsson et al., Machine Learning in High Energy Physics Community White Paper, J. Phys. Conf. Ser. 1085 (2018) 022008 [arXiv:1807.02876] [INSPIRE].
A. Radovic et al., Machine learning at the energy and intensity frontiers of particle physics, Nature 560 (2018) 41 [INSPIRE].
G. Carleo et al., Machine learning and the physical sciences, Rev. Mod. Phys. 91 (2019) 045002 [arXiv:1903.10563] [INSPIRE].
D. Bourilkov, Machine and Deep Learning Applications in Particle Physics, Int. J. Mod. Phys. A 34 (2020) 1930019 [arXiv:1912.08245] [INSPIRE].
M.D. Schwartz, Modern Machine Learning and Particle Physics, arXiv:2103.12226 [https://doi.org/10.1162/99608f92.beeb1183] [INSPIRE].
G. Karagiorgi et al., Machine Learning in the Search for New Fundamental Physics, arXiv:2112.03769 [INSPIRE].
A. Boehnlein et al., Colloquium: Machine learning in nuclear physics, Rev. Mod. Phys. 94 (2022) 031003 [arXiv:2112.02309] [INSPIRE].
P. Shanahan et al., Snowmass 2021 Computational Frontier CompF03 Topical Group Report: Machine Learning, arXiv:2209.07559 [INSPIRE].
G.F. Sterman and S. Weinberg, Jets from Quantum Chromodynamics, Phys. Rev. Lett. 39 (1977) 1436 [INSPIRE].
R.K. Ellis, W.J. Stirling and B.R. Webber, QCD and collider physics 8. Cambridge University Press (2011) [https://doi.org/10.1017/cbo9780511628788].
G. Soyez et al., Pileup subtraction for jet shapes, Phys. Rev. Lett. 110 (2013) 162001 [arXiv:1211.2811] [INSPIRE].
A.J. Larkoski and E.M. Metodiev, A Theory of Quark vs. Gluon Discrimination, JHEP 10 (2019) 014 [arXiv:1906.01639] [INSPIRE].
G. Kasieczka, S. Marzani, G. Soyez and G. Stagnitto, Towards Machine Learning Analytics for Jet Substructure, JHEP 09 (2020) 195 [arXiv:2007.04319] [INSPIRE].
S. Catani et al., New clustering algorithm for multi-jet cross-sections in e+e− annihilation, Phys. Lett. B 269 (1991) 432 [INSPIRE].
S. Catani, Y.L. Dokshitzer, M.H. Seymour and B.R. Webber, Longitudinally invariant Kt clustering algorithms for hadron hadron collisions, Nucl. Phys. B 406 (1993) 187 [INSPIRE].
S.D. Ellis and D.E. Soper, Successive combination jet algorithm for hadron collisions, Phys. Rev. D 48 (1993) 3160 [hep-ph/9305266] [INSPIRE].
Y.L. Dokshitzer, G.D. Leder, S. Moretti and B.R. Webber, Better jet clustering algorithms, JHEP 08 (1997) 001 [hep-ph/9707323] [INSPIRE].
M. Wobisch and T. Wengler, Hadronization corrections to jet cross-sections in deep inelastic scattering, in the proceedings of the Workshop on Monte Carlo Generators for HERA Physics (Plenary Starting Meeting), Hamburg, Germany, April 27–30 (1998) [hep-ph/9907280] [INSPIRE].
J.M. Butterworth, A.R. Davison, M. Rubin and G.P. Salam, Jet substructure as a new Higgs search channel at the LHC, Phys. Rev. Lett. 100 (2008) 242001 [arXiv:0802.2470] [INSPIRE].
J. Lin, M. Freytsis, I. Moult and B. Nachman, Boosting H → \( b\overline{b} \) with Machine Learning, JHEP 10 (2018) 101 [arXiv:1807.10768] [INSPIRE].
K. Datta, A. Larkoski and B. Nachman, Automating the Construction of Jet Observables with Machine Learning, Phys. Rev. D 100 (2019) 095016 [arXiv:1902.07180] [INSPIRE].
E.A. Moreno et al., Interaction networks for the identification of boosted H → \( b\overline{b} \) decays, Phys. Rev. D 102 (2020) 012010 [arXiv:1909.12285] [INSPIRE].
A. Chakraborty, S.H. Lim and M.M. Nojiri, Interpretable deep learning for two-prong jet classification with jet spectra, JHEP 07 (2019) 135 [arXiv:1904.02092] [INSPIRE].
CMS collaboration, Identification of heavy, energetic, hadronically decaying particles using machine-learning techniques, 2020 JINST 15 P06005 [arXiv:2004.08262] [INSPIRE].
Y.-L. Chung, S.-C. Hsu and B. Nachman, Disentangling Boosted Higgs Boson Production Modes with Machine Learning, 2021 JINST 16 P07002 [arXiv:2009.05930] [INSPIRE].
B. Tannenwald et al., Benchmarking Machine Learning Techniques with Di-Higgs Production at the LHC, arXiv:2009.06754 [INSPIRE].
J. Guo, J. Li, T. Li and R. Zhang, Boosted Higgs boson jet reconstruction via a graph neural network, Phys. Rev. D 103 (2021) 116025 [arXiv:2010.05464] [INSPIRE].
M. Abbas, A. Khan, A.S. Qureshi and M.W. Khan, Extracting Signals of Higgs Boson From Background Noise Using Deep Neural Networks, arXiv:2010.08201 [INSPIRE].
C. Jang et al., Learning to increase matching efficiency in identifying additional b-jets in the \( \overline{tt}b\overline{b} \) process, Eur. Phys. J. Plus 137 (2022) 870 [arXiv:2103.09129] [INSPIRE].
C.K. Khosa and S. Marzani, Higgs boson tagging with the Lund jet plane, Phys. Rev. D 104 (2021) 055043 [arXiv:2105.03989] [INSPIRE].
A.J. Larkoski and J. Thaler, Unsafe but Calculable: Ratios of Angularities in Perturbative QCD, JHEP 09 (2013) 137 [arXiv:1307.1699] [INSPIRE].
M. Cacciari, G.P. Salam and G. Soyez, The anti-kt jet clustering algorithm, JHEP 04 (2008) 063 [arXiv:0802.1189] [INSPIRE].
CMS collaboration, Identification of heavy-flavour jets with the CMS detector in pp collisions at 13 TeV, 2018 JINST 13 P05011 [arXiv:1712.07158] [INSPIRE].
ATLAS collaboration, ATLAS flavour-tagging algorithms for the LHC Run 2 pp collision dataset, Eur. Phys. J. C 83 (2023) 681 [arXiv:2211.16345] [INSPIRE].
S. Caletti, A.J. Larkoski, S. Marzani and D. Reichelt, Practical jet flavour through NNLO, Eur. Phys. J. C 82 (2022) 632 [arXiv:2205.01109] [INSPIRE].
F. Caola et al., Flavored jets with exact anti-kt kinematics and tests of infrared and collinear safety, Phys. Rev. D 108 (2023) 094010 [arXiv:2306.07314] [INSPIRE].
JADE collaboration, Experimental Studies on Multi-Jet Production in e+e− Annihilation at PETRA Energies, Z. Phys. C 33 (1986) 23 [INSPIRE].
JADE collaboration, Experimental Investigation of the Energy Dependence of the Strong Coupling Strength, Phys. Lett. B 213 (1988) 235 [INSPIRE].
A. Banfi, G.P. Salam and G. Zanderighi, Infrared safe definition of jet flavor, Eur. Phys. J. C 47 (2006) 113 [hep-ph/0601139] [INSPIRE].
S. Caletti, A.J. Larkoski, S. Marzani and D. Reichelt, A fragmentation approach to jet flavor, JHEP 10 (2022) 158 [arXiv:2205.01117] [INSPIRE].
M. Czakon, A. Mitov and R. Poncelet, Infrared-safe flavoured anti-kT jets, JHEP 04 (2023) 138 [arXiv:2205.11879] [INSPIRE].
R. Gauld, A. Huss and G. Stagnitto, Flavor Identification of Reconstructed Hadronic Jets, Phys. Rev. Lett. 130 (2023) 161901 [arXiv:2208.11138] [INSPIRE].
V.N. Gribov and L.N. Lipatov, Deep inelastic e p scattering in perturbation theory, Sov. J. Nucl. Phys. 15 (1972) 438 [INSPIRE].
V.N. Gribov and L.N. Lipatov, e+e− pair annihilation and deep inelastic e p scattering in perturbation theory, Sov. J. Nucl. Phys. 15 (1972) 675 [INSPIRE].
L.N. Lipatov, The parton model and perturbation theory, Yad. Fiz. 20 (1974) 181 [INSPIRE].
Y.L. Dokshitzer, Calculation of the Structure Functions for Deep Inelastic Scattering and e+e− Annihilation by Perturbation Theory in Quantum Chromodynamics, Sov. Phys. JETP 46 (1977) 641 [INSPIRE].
G. Altarelli and G. Parisi, Asymptotic Freedom in Parton Language, Nucl. Phys. B 126 (1977) 298 [INSPIRE].
A.J. Larkoski, G.P. Salam and J. Thaler, Energy Correlation Functions for Jet Substructure, JHEP 06 (2013) 108 [arXiv:1305.0007] [INSPIRE].
A. Gehrmann-De Ridder and E.W.N. Glover, A Complete O(ααs) calculation of the photon +1 jet rate in e+e− annihilation, Nucl. Phys. B 517 (1998) 269 [hep-ph/9707224] [INSPIRE].
M. Ritzmann and W.J. Waalewijn, Fragmentation in Jets at NNLO, Phys. Rev. D 90 (2014) 054029 [arXiv:1407.3272] [INSPIRE].
R.K. Ellis, D.A. Ross and A.E. Terrano, The Perturbative Calculation of Jet Structure in e+e− Annihilation, Nucl. Phys. B 178 (1981) 421 [INSPIRE].
S. Frixione, Z. Kunszt and A. Signer, Three jet cross-sections to next-to-leading order, Nucl. Phys. B 467 (1996) 399 [hep-ph/9512328] [INSPIRE].
S. Catani and M.H. Seymour, A general algorithm for calculating jet cross-sections in NLO QCD, Nucl. Phys. B 485 (1997) 291 [hep-ph/9605323] [INSPIRE].
J.M. Campbell and E.W.N. Glover, Double unresolved approximations to multiparton scattering amplitudes, Nucl. Phys. B 527 (1998) 264 [hep-ph/9710255] [INSPIRE].
S. Catani and M. Grazzini, Infrared factorization of tree level QCD amplitudes at the next-to-next-to-leading order and beyond, Nucl. Phys. B 570 (2000) 287 [hep-ph/9908523] [INSPIRE].
A. Buckley, G. Callea, A.J. Larkoski and S. Marzani, An Optimal Observable for Color Singlet Identification, SciPost Phys. 9 (2020) 026 [arXiv:2006.10480] [INSPIRE].
G.P. Lepage, A New Algorithm for Adaptive Multidimensional Integration, J. Comput. Phys. 27 (1978) 192 [INSPIRE].
G.P. Lepage, VEGAS: an adaptive multidimensional integration program, CLNS-80/447 (1980) [INSPIRE].
T. Hahn, CUBA: A library for multidimensional numerical integration, Comput. Phys. Commun. 168 (2005) 78 [hep-ph/0404043] [INSPIRE].
Particle Data Group collaboration, Review of Particle Physics, PTEP 2022 (2022) 083C01 [INSPIRE].
C.W. Bauer and B.O. Lange, Scale setting and resummation of logarithms in pp → V + jets, arXiv:0905.4739 [INSPIRE].
A. Denner, S. Dittmaier, T. Kasprzik and A. Muck, Electroweak corrections to W + jet hadroproduction including leptonic W-boson decays, JHEP 08 (2009) 075 [arXiv:0906.1656] [INSPIRE].
M. Rubin, G.P. Salam and S. Sapeta, Giant QCD K-factors beyond NLO, JHEP 09 (2010) 084 [arXiv:1006.2144] [INSPIRE].
V. Del Duca et al., Tree-level splitting amplitudes for a quark into four collinear partons, JHEP 02 (2020) 189 [arXiv:1912.06425] [INSPIRE].
V. Del Duca et al., Tree-level splitting amplitudes for a gluon into four collinear partons, JHEP 10 (2020) 093 [arXiv:2007.05345] [INSPIRE].
ATLAS collaboration, Measurements of Higgs bosons decaying to bottom quarks from vector boson fusion production with the ATLAS experiment at \( \sqrt{s} \) = 13 TeV, Eur. Phys. J. C 81 (2021) 537 [arXiv:2011.08280] [INSPIRE].
ATLAS collaboration, Measurements of WH and ZH production in the H → \( b\overline{b} \) decay channel in pp collisions at 13 TeV with the ATLAS detector, Eur. Phys. J. C 81 (2021) 178 [arXiv:2007.02873] [INSPIRE].
CMS collaboration, A portrait of the Higgs boson by the CMS experiment ten years after the discovery, Nature 607 (2022) 60 [arXiv:2207.00043] [INSPIRE].
J. Gallicchio and M.D. Schwartz, Seeing in Color: Jet Superstructure, Phys. Rev. Lett. 105 (2010) 022001 [arXiv:1001.5027] [INSPIRE].
D. Curtin, R. Essig and B. Shuve, Boosted Multijet Resonances and New Color-Flow Variables, Phys. Rev. D 88 (2013) 034019 [arXiv:1210.5523] [INSPIRE].
A. Butter et al., The Machine Learning landscape of top taggers, SciPost Phys. 7 (2019) 014 [arXiv:1902.09914] [INSPIRE].
W.T. Giele and E.W.N. Glover, Higher order corrections to jet cross-sections in e+e− annihilation, Phys. Rev. D 46 (1992) 1980 [INSPIRE].
Acknowledgments
I thank Gregor Kasieczka for discussions and providing the motivation for this analysis and Simone Marzani for comments. This work was supported in part by the UC Southern California Hub, with funding from the UC National Laboratories division of the University of California Office of the President.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
ArXiv ePrint: 2309.14417
Rights and permissions
Open Access . This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.
About this article
Cite this article
Larkoski, A.J. Binary discrimination through next-to-leading order. J. High Energ. Phys. 2024, 57 (2024). https://doi.org/10.1007/JHEP03(2024)057
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/JHEP03(2024)057