2018 IEEE Winter Conference on Applications of Computer Vision (WACV) 2018
DOI: 10.1109/wacv.2018.00030
|View full text |Cite
|
Sign up to set email alerts
|

Confidence Prediction for Lexicon-Free OCR

Abstract: Having a reliable accuracy score is crucial for real world applications of OCR, since such systems are judged by the number of false readings. Lexicon-based OCR systems, which deal with what is essentially a multi-class classification problem, often employ methods explicitly taking into account the lexicon, in order to improve accuracy. However, in lexicon-free scenarios, filtering errors requires an explicit confidence calculation. In this work we show two explicit confidence measurement techniques, and show … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 18 publications
(10 citation statements)
references
References 13 publications
0
10
0
Order By: Relevance
“…Note, however, that like the other rejection approaches, this carves out a small region around the origin, and potentially between two classes as "none of the above," and still leaves infinite open space risk and hence does not solve OSR. In addition, there is also active research in network uncertainty estimation (Gal and Ghahramani 2016;Lakshminarayanan, Pritzel, and Blundell 2017;Mor and Wolf 2018). The authors of such claim thresholding their uncertainity can reject outliers.…”
Section: Open Set Deep Networkmentioning
confidence: 99%
“…Note, however, that like the other rejection approaches, this carves out a small region around the origin, and potentially between two classes as "none of the above," and still leaves infinite open space risk and hence does not solve OSR. In addition, there is also active research in network uncertainty estimation (Gal and Ghahramani 2016;Lakshminarayanan, Pritzel, and Blundell 2017;Mor and Wolf 2018). The authors of such claim thresholding their uncertainity can reject outliers.…”
Section: Open Set Deep Networkmentioning
confidence: 99%
“…The LSTM model. We implement the OCR model described in [22], which consists of a convolutional layer, followed by a 3-layer deep bidirectional LSTM, and optimizes for CTC loss. The trained model achieves a precision score of 85.7% on the test set of ORAND-CAR-A, which would have achieved first place in the HDSRC 2014 competition.…”
Section: Lstm On Handwritten Numbersmentioning
confidence: 99%
“…The prediction accuracy of the OCR engine such as Tesseract [9] poses a limitation to our toolkit. The accuracy of Tesseract OCR is not always 100% [14], [15]. To handle this limitation and to enhance the accuracy, we used regionbased segmentation and image preprocessing techniques such as noise removal, canny edge detection, and contours finding.…”
Section: Limitationsmentioning
confidence: 99%