2023
DOI: 10.1007/jhep01(2023)008
|View full text |Cite
|
Sign up to set email alerts
|

RanBox: anomaly detection in the copula space

Abstract: The unsupervised search for overdense regions in high-dimensional feature spaces, where locally high population densities may be associated with anomalous contaminations to an otherwise more uniform population, is of relevance to applications ranging from fundamental research to industrial use cases. Motivated by the specific needs of searches for new phenomena in particle collisions, we propose a novel approach that targets signals of interest populating compact regions of the feature space. The method consis… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 29 publications
0
0
0
Order By: Relevance
“…Recently, there have been many proposals for automating AD methods with machine learning [63][64][65][66][67][68][69][70][71][72][73][74][75][76][77][78][79][80][81][82] (see refs. [81][82][83][84] for overviews of the field).…”
Section: Introductionmentioning
confidence: 99%
“…Recently, there have been many proposals for automating AD methods with machine learning [63][64][65][66][67][68][69][70][71][72][73][74][75][76][77][78][79][80][81][82] (see refs. [81][82][83][84] for overviews of the field).…”
Section: Introductionmentioning
confidence: 99%
“…In spite of this wealth of possible practical applications, the fundamental question still needs to be studied, namely what defines an anomaly search at the LHC? For large and stochas-tic datasets, the concept of outliers is difficult to define unambiguously, because any LHC jet or event configuration will occur with a finite probability, especially after we include detector imperfections [51][52][53][54][55]. In this situation, a simple, working definition of anomalous data is an event which lies in a low-density phase space region.…”
Section: Introductionmentioning
confidence: 99%
“…While massive top jets stick out among low-mass QCD jets, we cannot expect a QCD jet to be special in a sample of top jets [55] since its simpler features will be interpolated by the neural network. A better-suited definition is based on low-probability regions in the background phase space distributions [56][57][58][59][60]. Here we can encode the phase space density, for instance, using cluster algorithms, VAEs, or a normalizing flow [15], but none of these methods are especially successful at identifying anomalous jets once the signal becomes more challenging than top jets.…”
Section: Introductionmentioning
confidence: 99%