Anomaly Detection with Multiple Reference Datasets in High Energy Physics

Chen, Mayee; Nachman, Benjamin; Sala, Frederic

Citation Details

An important class of techniques for resonant anomaly detection in high energy physics builds models that can distinguish between reference and target datasets, where only the latter has appreciable signal. Such techniques, including Classification Without Labels (CWOLA) and Simulation Assisted Likelihood-free Anomaly Detection (SALAD) rely on a single reference dataset. They cannot take advantage of commonly-available multiple datasets and thus cannot fully exploit available information. In this work, we propose generalizations of CWOLA and SALAD for settings where multiple reference datasets are available, building on weak supervision techniques. We demonstrate improved performance in a number of settings with real and synthetic data. As an added benefit, our generalizations enable us to provide finite-sample guarantees, improving on existing asymptotic analyses. more »

Award ID(s):: 2106707

PAR ID:: 10427110

Author(s) / Creator(s):: Chen, Mayee; Nachman, Benjamin; Sala, Frederic

Date Published:: 2022-01-01

Journal Name:: NeurIPS Workshop on Machine Learning and the Physical Sciences (ML4PS)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this