2023
DOI: 10.48550/arxiv.2302.09852
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Unsupervised Layer-wise Score Aggregation for Textual OOD Detection

Abstract: Out-of-distribution (OOD) detection is a rapidly growing field due to new robustness and security requirements driven by an increased number of AI-based systems. Existing OOD textual detectors often rely on anomaly scores (e.g., Mahalanobis distance) computed on the embedding output of the last layer of the encoder. In this work, we observe that OOD detection performance varies greatly depending on the task and layer output. More importantly, we show that the usual choice (the last layer) is rarely the best on… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 71 publications
(107 reference statements)
0
1
0
Order By: Relevance
“…This further demonstrates that the transductive loss can be useful in few-shot NLP. In the future, we would like to explore the application of transductive inference to other NLP tasks such as sequence generation (Pichler et al, 2022;Colombo et al, 2019Colombo et al, , 2021d and classification tasks (Chapuis et al, 2020;Colombo et al, 2022d,b;Himmi et al, 2023) as well as NLG evaluation (Colombo et al, 2021e, 2022c(Colombo et al, 2021e, , 2021c and Safe AI Picot et al, 2022a,b;Darrin et al, 2022Darrin et al, , 2023…”
Section: B41 Results Per Languagementioning
confidence: 99%
“…This further demonstrates that the transductive loss can be useful in few-shot NLP. In the future, we would like to explore the application of transductive inference to other NLP tasks such as sequence generation (Pichler et al, 2022;Colombo et al, 2019Colombo et al, , 2021d and classification tasks (Chapuis et al, 2020;Colombo et al, 2022d,b;Himmi et al, 2023) as well as NLG evaluation (Colombo et al, 2021e, 2022c(Colombo et al, 2021e, , 2021c and Safe AI Picot et al, 2022a,b;Darrin et al, 2022Darrin et al, , 2023…”
Section: B41 Results Per Languagementioning
confidence: 99%