iDECODe: In-distribution Equivariance for Conformal Out-of-distribution Detection

Kaur, Ramandeep; Jha, Susmit; Roy, Anirban; Park, Sang‐Don; Dobriban, Edgar; Sokolsky, Oleg; Lee, Insup

doi:10.48550/arxiv.2201.02331

Cited by 2 publications

(2 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Though originally based on the premise of exchangeable (e.g., independently and identically distributed) training and test data, the framework has since been generalized to handle various forms of distribution shift, including covariate shift [62,44], label shift [47], arbitrary distribution shifts in an online setting [20], and test distributions that are nearby the training distribution [15]. Conformal approaches have also been used to detect distribution shift [64,28,38,8,4,48,30].…”

Section: Prior Workmentioning

confidence: 99%

Conformal prediction for the design problem

Fannjiang¹,

Bates²,

Angelopoulos³

et al. 2022

Preprint

View full text Add to dashboard Cite

In many real-world deployments of machine learning, we use a prediction algorithm to choose what data to test next. For example, in the protein design problem, we have a regression model that predicts some real-valued property of a protein sequence, which we use to propose new sequences believed to exhibit higher property values than observed in the training data. Since validating designed sequences in the wet lab is typically costly, it is important to know how much we can trust the model's predictions. In such settings, however, there is a distinct type of distribution shift between the training and test data: one where the training and test data are statistically dependent, as the latter is chosen based on the former. Consequently, the model's error on the test data-that is, the designed sequences-has some non-trivial relationship with its error on the training data. Herein, we introduce a method to quantify predictive uncertainty in such settings. We do so by constructing confidence sets for predictions that account for the dependence between the training and test data. The confidence sets we construct have finite-sample guarantees that hold for any prediction algorithm, even when a trained model chooses the test-time input distribution. As a motivating use case, we demonstrate how our method quantifies uncertainty for the predicted fitness of designed protein using several real data sets.

show abstract

Section: Prior Workmentioning

confidence: 99%

Conformal prediction for the design problem

Fannjiang¹,

Bates²,

Angelopoulos³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…As discussed in [35], the mathematical structure of these methods is closely related to that of tolerance regions [18,32,38]. Inductive conformal anomaly detection [16,25] builds on ICP to guarantee a bounded false detection rate. In different literature, there are different terminology for the two user-specified inputs.…”

Section: Related Workmentioning

confidence: 99%

PAC-Wrap: Semi-Supervised PAC Anomaly Detection

Li,

Ji,

Dobriban

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Anomaly detection is essential for preventing hazardous outcomes for safety-critical applications like autonomous driving. Given their safety-criticality, these applications benefit from provable bounds on various errors in anomaly detection. To achieve this goal in the semi-supervised setting, we propose to provide Probably Approximately Correct (PAC) guarantees on the false negative and false positive detection rates for anomaly detection algorithms. Our method (PAC-Wrap) can wrap around virtually any existing semisupervised and unsupervised anomaly detection method, endowing it with rigorous guarantees. Our experiments with various anomaly detectors and datasets indicate that PAC-Wrap is broadly effective. CCS CONCEPTS• Security and privacy → Intrusion/anomaly detection and malware mitigation; • Theory of computation → Sample complexity and generalization bounds; • Computing methodologies → Semi-supervised learning settings.

show abstract

iDECODe: In-distribution Equivariance for Conformal Out-of-distribution Detection

Cited by 2 publications

References 23 publications

Conformal prediction for the design problem

Conformal prediction for the design problem

PAC-Wrap: Semi-Supervised PAC Anomaly Detection

Contact Info

Product

Resources

About