2020
DOI: 10.48550/arxiv.2012.12643
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

On Calibration of Scene-Text Recognition Models

Abstract: In this work, we study the problem of word-level confidence calibration for scene-text recognition (STR). Although the topic of confidence calibration has been an active research area for the last several decades, the case of structured and sequence prediction calibration has been scarcely explored. We analyze several recent STR methods and show that they are consistently overconfident. We then focus on the calibration of STR models on the word rather than the character level. In particular, we demonstrate tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…While the reliability of the automation ability of a DU solution is deemed quintessential for generating business value in practice [11], DU research rarely reports any confidence evalu-ation. Some exceptions are in closely related task domains like scene text recognition [82] and QA [38,106].…”
Section: Discussionmentioning
confidence: 99%
“…While the reliability of the automation ability of a DU solution is deemed quintessential for generating business value in practice [11], DU research rarely reports any confidence evalu-ation. Some exceptions are in closely related task domains like scene text recognition [82] and QA [38,106].…”
Section: Discussionmentioning
confidence: 99%
“…Many methods have been developed to mitigate overconfidence in ML [20][21][22][23][24][25][26][27][28][29][30][31][32][33][34], for example by allowing an agnostic output, by suitable post-processing [7,[35][36][37][38][39][40], or through early stopping [41]. Additional relevant research includes that of [42][43][44][45][46][46][47][48][49][50][51][52], and these will provide us with informative benchmarks.…”
Section: Related Workmentioning
confidence: 99%
“…Some authors considered the sequential characteristics and attempted to incorporate them into the sequential framework [19]. The varying sequential length is utilized as the setup basis of temperature values based on the temperature scaling technique [10,22,36]. Attention scores calculated during the sequence modeling are also utilized as the weights of temperature values [20].…”
Section: Introductionmentioning
confidence: 99%