2016
DOI: 10.1007/978-3-319-50835-1_66
|View full text |Cite
|
Sign up to set email alerts
|

OCR as a Service: An Experimental Evaluation of Google Docs OCR, Tesseract, ABBYY FineReader, and Transym

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
29
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 59 publications
(29 citation statements)
references
References 22 publications
0
29
0
Order By: Relevance
“…Image recognition solves the classification problem: it associates images with classes to which they belong. The recognition quality of English text is at a very high-level [9]. Since the size of hieroglyphs alphabet exceeds 10000 (while the alphabet for Latin-base languages is usually less than 100) the recognition of hieroglyphs introduces new challenges to both training and recognizing processes.…”
Section: Hieroglyphs Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…Image recognition solves the classification problem: it associates images with classes to which they belong. The recognition quality of English text is at a very high-level [9]. Since the size of hieroglyphs alphabet exceeds 10000 (while the alphabet for Latin-base languages is usually less than 100) the recognition of hieroglyphs introduces new challenges to both training and recognizing processes.…”
Section: Hieroglyphs Recognitionmentioning
confidence: 99%
“…% of . To compare the results we use the recently released version of Tesseract 4.0.0 [9,25], which reaches 37.67% of . Comparing to Tesseract 4.0.0 our performance is better but still is not sufficiently good.…”
Section: Trainingmentioning
confidence: 99%
“…For instance, previous work focused on digitization the content of documents into a machine-readable format using optical character recognition (OCR). We refer to [1], [2] for a review of current OCR systems. Existing datasets include the ICDAR Robust Reading Competitions 2011, 2013, 2015, 2017 .…”
Section: Introductionmentioning
confidence: 99%
“…OCR in both handwritten and typewritten texts has been extensively studied in the domain of pattern recognition and computer vision. Prominent off-the-shelf OCR tools [4] include Google Cloud OCR, 1 Tesseract, 2 ABBYY FineReader, 3 and Transym. 4 All of these aforementioned software packages, with some configuration customization, can generate sufficiently machine-readable OCR output given a high-quality input image.…”
Section: Introductionmentioning
confidence: 99%
“…Prominent off-the-shelf OCR tools [4] include Google Cloud OCR, 1 Tesseract, 2 ABBYY FineReader, 3 and Transym. 4 All of these aforementioned software packages, with some configuration customization, can generate sufficiently machine-readable OCR output given a high-quality input image. OCR results, however, can be significantly dependent on scanning noise, page layout, and image resolution [5].…”
Section: Introductionmentioning
confidence: 99%