Abstract
We develop a self-supervised method for density-based anomaly detection using contrastive learning, and test it using event-level anomaly data from CMS ADC2021. The AnomalyCLR technique is data-driven and uses augmentations of the background data to mimic non-Standard-Model events in a model-agnostic way. It uses a permutation-invariant Transformer Encoder architecture to map the objects measured in a collider event to the representation space, where the data augmentations define a representation space which is sensitive to potential anomalous features. An AutoEncoder trained on background representations then computes anomaly scores for a variety of signals in the representation space. With AnomalyCLR we find significant improvements on performance metrics for all signals when compared to the raw data baseline.
| Original language | English |
|---|---|
| Article number | 056 |
| Pages (from-to) | 1-19 |
| Number of pages | 19 |
| Journal | SciPost physics core |
| Volume | 7 |
| Issue number | 3 |
| Early online date | 16 Aug 2024 |
| DOIs | |
| Publication status | Published online - 16 Aug 2024 |
Bibliographical note
Publisher Copyright:Copyright B. M. Dillon et al.
Funding
BMD acknowledges funding from the Alexander von Humboldt Foundation. LF, TM, and TP are funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under grant 396021762- TRR 257: Particle Physics Phenomenology after the Higgs Discovery and Germany’s Excellence Strategy EXC 2181/1- 390900948 (the Heidelberg STRUCTURES Excellence Cluster).
| Funders | Funder number |
|---|---|
| Alexander von Humboldt Stiftung | |
| 396021762 - TRR 257, EXC 2181/1 - 390900948 |
Keywords
- machine-learning
- physics
- self-supervised
- LHC
- anomaly detection
- autoencoder
- transformer
Fingerprint
Dive into the research topics of 'Anomalies, representations, and self-supervision'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver