skip to main content

This content will become publicly available on January 1, 2023

The Dark Machines Anomaly Score Challenge: Benchmark Data and Model Independent Event Classification for the Large Hadron Collider
We describe the outcome of a data challenge conducted as part of the Dark Machines (https://www.darkmachines.org) initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims to detect signals of new physics at the Large Hadron Collider (LHC) using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of >1 billion simulated LHC events corresponding to 10\, fb^{-1} 10 f b − 1 of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge.
Authors:
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more »
Award ID(s):
Publication Date:
NSF-PAR ID:
10323045
Journal Name:
SciPost Physics
Volume:
12
Issue:
1
ISSN:
2542-4653
Sponsoring Org:
National Science Foundation
##### More Like this
1. ; ; (Ed.)
A new paradigm for data-driven, model-agnostic new physics searches at colliders is emerging, and aims to leverage recent breakthroughs in anomaly detection and machine learning. In order to develop and benchmark new anomaly detection methods within this framework, it is essential to have standard datasets. To this end, we have created the LHC Olympics 2020, a community challenge accompanied by a set of simulated collider events. Participants in these Olympics have developed their methods using an R&D dataset and then tested them on black boxes: datasets with an unknown anomaly (or not). This paper will review the LHC Olympics 2020more »
2. A bstract A search for new physics with non-resonant signals in dielectron and dimuon final states in the mass range above 2 TeV is presented. This is the first search for non-resonant signals in dilepton final states at the LHC to use a background estimate from the data. The data, corresponding to an integrated luminosity of 139 fb − 1 , were recorded by the ATLAS experiment in proton-proton collisions at a center-of-mass energy of $$\sqrt{s}$$ s = 13 TeV during Run 2 of the Large Hadron Collider. The benchmark signal signature is a two-quark and two-lepton contactmore »
3. A bstract A search for a light pseudoscalar Higgs boson (a) decaying from the 125 GeV (or a heavier) scalar Higgs boson (H) is performed using the 2016 LHC proton-proton collision data at $$\sqrt{s}$$ s = 13 TeV, corresponding to an integrated luminosity of 35 . 9 fb − 1 , collected by the CMS experiment. The analysis considers gluon fusion and vector boson fusion production of the H, followed by the decay H → aa → μμττ , and considers pseudoscalar masses in the range 3 . 6 < m a < 21 GeV. Because of themore »
4. A bstract The results of a search for new phenomena in final states with b -jets and missing transverse momentum using 139 fb − 1 of proton-proton data collected at a centre-of-mass energy $$\sqrt{s}$$ s = 13 TeV by the ATLAS detector at the LHC are reported. The analysis targets final states produced by the decay of a pair-produced supersymmetric bottom squark into a bottom quark and a stable neutralino. The analysis also seeks evidence for models of pair production of dark matter particles produced through the decay of a generic scalar or pseudoscalar mediator state in associationmore »
5. A bstract A search is presented for new particles produced at the LHC in proton-proton collisions at $$\sqrt{s}$$ s = 13 TeV, using events with energetic jets and large missing transverse momentum. The analysis is based on a data sample corresponding to an integrated luminosity of 101 fb − 1 , collected in 2017–2018 with the CMS detector. Machine learning techniques are used to define separate categories for events with narrow jets from initial-state radiation and events with large-radius jets consistent with a hadronic decay of a W or Z boson. A statistical combination is made with anmore »