ABCDisCo: Automating the ABCD Method with Machine Learning

Kasieczka, Gregor; Nachman, Benjamin; Schwartz, Matthew D.; Shih, David

doi:10.48550/arxiv.2007.14400

Cited by 5 publications

(7 citation statements)

References 52 publications

(101 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In both approaches, the data are used directly to transport predictions from a control region to the signal sensitive region instead of relying on simulation for this extrapolation. In the ABCD method, two classifiers f and g and two working points a and b are constructed and then four regions called A, B, C and D are defined by f ≶ a and g ≶ b (for a machine learning version of ABCD, see [21][22][23]). If f and g are independent, then one can relate the background prediction in the region f > a and g > b to the other three regions.…”

Section: Model Dependence In Hep Data Analysismentioning

confidence: 99%

“…One can simply remove mass-sensitive features from the training, but powerful classifiers can learn the mass indirectly through subtle correlations with other useful features. A variety of decorrelation techniques exist to solve this problem [21,[26][27][28][29][30][31][32][33][34][35][36][37][38][39]. In the context of neural networks, one can add terms to the loss function to achieve automatic decorrelation:…”

Section: Model Dependence In Hep Data Analysismentioning

confidence: 99%

See 1 more Smart Citation

Anomaly Detection for Physics Analysis and Less than Supervised Learning

Nachman¹

2020

Preprint

Self Cite

View full text Add to dashboard Cite

Modern machine learning tools offer exciting possibilities to qualitatively change the paradigm for new particle searches. In particular, new methods can broaden the search program by gaining sensitivity to unforeseen scenarios by learning directly from data. There has been a significant growth in new ideas and they are just starting to be applied to experimental data. This chapter introduces these new anomaly detection methods, which range from fully supervised algorithms to unsupervised, and include weakly supervised methods.

show abstract

Section: Model Dependence In Hep Data Analysismentioning

confidence: 99%

Section: Model Dependence In Hep Data Analysismentioning

confidence: 99%

Anomaly Detection for Physics Analysis and Less than Supervised Learning

Nachman¹

2020

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…A key challenge facing such methods is that the machine learning classifiers must be relatively independent from the resonant feature, for otherwise artificial bumps can be formed. Many automated decorrelation methods have been proposed to ensure that classifiers are decorrelated from particular features by construction [39][40][41][42][43][44][45][46][47][48][49][50], but they may not apply in all cases. In particular, weakly supervised approaches that learn directly on the signal region cannot be simply combined with a decorrelation scheme because such an approach could degrade the performance in the presence of a signal.…”

Section: Introductionmentioning

confidence: 99%

Simulation-Assisted Decorrelation for Resonant Anomaly Detection

Benkendorfer,

Pottier,

Nachman

2020

Preprint

Self Cite

View full text Add to dashboard Cite

A growing number of weak-and unsupervised machine learning approaches to anomaly detection are being proposed to significantly extend the search program at the Large Hadron Collider and elsewhere. One of the prototypical examples for these methods is the search for resonant new physics, where a bump hunt can be performed in an invariant mass spectrum. A significant challenge to methods that rely entirely on data is that they are susceptible to sculpting artificial bumps from the dependence of the machine learning classifier on the invariant mass. We explore two solutions to this challenge by minimally incorporating simulation into the learning. In particular, we study the robustness of Simulation Assisted Likelihood-free Anomaly Detection (SALAD) to correlations between the classifier and the invariant mass. Next, we propose a new approach that only uses the simulation for decorrelation but the Classification without Labels (CWoLa) approach for achieving signal sensitivity. Both methods are compared using a full background fit analysis on simulated data from the LHC Olympics and are robust to correlations in the data.

show abstract

“…In particular, the advent of unsupervised and weaklysupervised Machine Learning (ML) techniques has allowed for the development of broad model independent NP search and characterisation strategies [1]. Simultaneously, there have been important efforts to reduce reliance of LHC measurements on Monte Carlo (MC) simulations of hadronic processes [2][3][4][5][6].…”

mentioning

confidence: 99%

“…So the model would over-parameterize the data making the inclusion of mixtures redundant. 2 Therefore the key insight is to instead write down a mixture model in terms of p(j n |z n ) and p(b n |z n ), such that the correlations between N j and N b in the dataset are parameterized by the class label alone. The number of parameters in this model is 2 × (d j + d b − 2) + 1.…”

mentioning

confidence: 99%

Bayesian Probabilistic Modelling for Four-Tops at the LHC

Alvarez,

Dillon,

Faroughy

et al. 2021

Preprint

View full text Add to dashboard Cite

generators are crucial for analyzing data at hadron colliders, however, even a small mismatch between the MC simulations and the experimental data can undermine the interpretation of LHC searches in the SM and beyond. The jet multiplicity distributions used in four-top searches, one of the ultimate rare processes in the SM currently being explored at the LHC, makes pp → t tt t an ideal testing ground to explore for new ways to reduce the impact of MC mismodelling on such observables. In this Letter, we propose a novel weakly-supervised method capable of disentangling the t tt t signal from the dominant background, while partially correcting for possible MC imperfections. A mixture of multinomial distributions is used to model the light-jet and b-jet multiplicities under the assumption that these are conditionally independent given a categorical latent variable. The signal and background distributions generated from a deliberately untuned MC simulator are used as model priors. The posterior distributions, as well as the signal fraction, are then learned from the data using Bayesian inference. We demonstrate that our method can mitigate the effects of large MC mismodellings using a realistic t tt t search in the same-sign dilepton channel, leading to corrected posterior distributions that better approximate the underlying truth-level spectra.1 Our results and discussion would apply equally well to other non-negligible backgrounds such as t th and t tZ.

show abstract

ABCDisCo: Automating the ABCD Method with Machine Learning

Cited by 5 publications

References 52 publications

Anomaly Detection for Physics Analysis and Less than Supervised Learning

Anomaly Detection for Physics Analysis and Less than Supervised Learning

Simulation-Assisted Decorrelation for Resonant Anomaly Detection

Bayesian Probabilistic Modelling for Four-Tops at the LHC

Contact Info

Product

Resources

About