Facility Locations Utility for Uncovering Classifier Overconfidence

Maurer, Karsten; Bennette, Walter

doi:10.1109/icmla.2019.00071

Cited by 6 publications

(13 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Algorithmic approaches for finding high-confidence errors from machine learning classifiers mainly consist of the following components: 1) a utility function to measure the usefulness of queried points, 2) some search strategy to help maximize the utility function [4]- [6] In Lakkaraju (2017) [4], they defined a utility function that gave a uniform value for each error discovered above some threshold (65% for binary classification), and applied a penalty associated with the cost of the human providing the true label for the queried points. The suggested search was then to cluster data points from the unlabeled evaluation dataset using some feature space (which, in the case of black-box classifiers, may be different than the original feature space), and perform multiarmed bandit sampling to maximize the utility.…”

Section: A High-confidence Errorsmentioning

confidence: 99%

“…Meaning, the methods could be discovering errors accidentally, not by discovering commonalities between errors to increase the rate of error discovery. More recent works have instead focused on discovering classification errors at rates greater than expected, to encourage search methods that discover something about a model's weaknesses to increase the rate of error discovery [6], [7]. Of particular interest to this work is a search technique that leverages adversarial machine learning to point users towards high confidence errors.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Generalized Adversarial Distances to Efficiently Discover Classifier Errors

Bennette¹,

Dufek²,

Maurer³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Given a black-box classification model and an unlabeled evaluation dataset from some application domain, efficient strategies need to be developed to evaluate the model. Random sampling allows a user to estimate metrics like accuracy, precision, and recall, but may not provide insight to highconfidence errors. High-confidence errors are rare events for which the model is highly confident in its prediction, but is wrong. Such errors can represent costly mistakes and should be explicitly searched for. In this paper we propose a generalization to the Adversarial Distance search that leverages concepts from adversarial machine learning to identify predictions for which a classifier may be overly confident. These predictions are useful instances to sample when looking for high-confidence errors because they are prone to a higher rate of error than expected. Our generalization allows Adversarial Distance to be applied to any classifier or data domain. Experimental results show that the generalized method finds errors at rates greater than expected given the confidence of the sampled predictions, and outperforms competing methods.

show abstract

Section: A High-confidence Errorsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Generalized Adversarial Distances to Efficiently Discover Classifier Errors

Bennette¹,

Dufek²,

Maurer³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Maurer and Bennette (2019) [7] present an extension to [4] and [5] that identifies the flaw of valuing error discovery at the rate expected given model confidence. Meaning, the work identifies the fact that errors should be expected for confidence levels below 100%.…”

Section: A High-confidence Errorsmentioning

confidence: 99%

“…We consider the problem of discovering high-confidence errors at rates greater than what a model's confidence would suggest, which was recently introduced by [7]. Given a blackbox classifier, M , with M (x) = (ŷ x , px ), where x is an instance from an unlabeled evaluation set X, ŷx is the model's prediction, px is the model's confidence, and y x is the true label assigned by some oracle, the task is to find a query set of data points, Q ⊆ X, that maximize the Standardized Discovery Ratio (SDR).…”

Section: Problem Formulationmentioning

confidence: 99%

Harnessing Adversarial Distances to Discover High-Confidence Errors

Bennette¹,

Maurer²,

Sisti³

2020

Preprint

Self Cite

View full text Add to dashboard Cite

Given a deep neural network image classification model that we treat as a black box, and an unlabeled evaluation dataset, we develop an efficient strategy by which the classifier can be evaluated. Randomly sampling and labeling instances from an unlabeled evaluation dataset allows traditional performance measures like accuracy, precision, and recall to be estimated. However, random sampling may miss rare errors for which the model is highly confident in its prediction, but wrong. These highconfidence errors can represent costly mistakes, and therefore should be explicitly searched for. Past works have developed search techniques to find classification errors above a specified confidence threshold, but ignore the fact that errors should be expected at confidence levels anywhere below 100%. In this work, we investigate the problem of finding errors at rates greater than expected given model confidence. Additionally, we propose a query-efficient and novel search technique that is guided by adversarial perturbations to find these mistakes in black box models. Through rigorous empirical experimentation, we demonstrate that our Adversarial Distance search discovers high-confidence errors at a rate greater than expected given model confidence.

show abstract

“…Artificially intelligent agents are also overconfident (Tesauro, Gondek, Lenchner, Fan, & Prager, 2013), especially when assumptions underlying their models are violated (Attenberg, Ipeirotis, & Provost, 2011). The failure to account for unknown unknowns represents an important reason why they are overconfident (Maurer & Bennette, 2018). Any set of training data, however large, will be smaller than the universe of potential problems.…”

Section: An Epistemological Conundrummentioning

confidence: 99%

Overprecision is a property of thinking systems

Da¹

2019

Preprint

View full text Add to dashboard Cite

The ability to accurately calibrate one’s confidence is essential for effective decisions. Overprecision is the excessive faith that one knows the truth. This paper proposes a new theory to explain it. The theory holds that overprecision in judgment results from individuals’ neglect of all the ways in which they could be wrong. When there are an infinite number of ways to be wrong, it is impossible to consider them all. Overprecision in judgment, it turns out, is primarily attributable to being wrong and not knowing it. This explanation can account for the persistence of overprecision not only among people but also artificially intelligent agents.

show abstract

Facility Locations Utility for Uncovering Classifier Overconfidence

Cited by 6 publications

References 13 publications

Generalized Adversarial Distances to Efficiently Discover Classifier Errors

Generalized Adversarial Distances to Efficiently Discover Classifier Errors

Harnessing Adversarial Distances to Discover High-Confidence Errors

Overprecision is a property of thinking systems

Contact Info

Product

Resources

About