2021
DOI: 10.1029/2021ms002573
|View full text |Cite
|
Sign up to set email alerts
|

Controlled Abstention Neural Networks for Identifying Skillful Predictions for Classification Problems

Abstract: The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to more predictable behavior than others, often termed “forecasts of opportunity.” When these opportunities are not present, scientists need prediction systems that are capable of saying “I don't know.” We introduce a novel loss function, termed the “NotWrong loss,” that allows ne… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

1
1
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 22 publications
1
1
0
Order By: Relevance
“…Note that Eq. 4 is very similar to the abstention loss of Thulasidasan (2020, Chapter 4) and Barnes and Barnes (2021) for classification networks.…”
Section: Abstention Losssupporting
confidence: 57%
“…Note that Eq. 4 is very similar to the abstention loss of Thulasidasan (2020, Chapter 4) and Barnes and Barnes (2021) for classification networks.…”
Section: Abstention Losssupporting
confidence: 57%
“…In this initial study, we restrict attention to UE methods that can be computed deterministically from any of the model's ouputs or intermediate signals, without depending on the available data. This includes the conventional softmax response (also known as maximum class probability, MCP) and the entropy of the softmax output, for all models, as well as mutual information, predictive variance and mean variance, computed from the multiple realizations of the model's softmax output, in the case of ensemble models-but excludes any method that trains an auxiliary model to provide UE, such as [Geifman and El-Yaniv 2019, DeVries and Taylor 2018, Barnes and Barnes 2021, Corbière et al 2021. We focus on the task of selective classification [Geifman and El-Yaniv 2017] and, following [Galil et al 2022], evaluate all methods using the RC curve.…”
Section: Introductionmentioning
confidence: 99%