2019
DOI: 10.48550/arxiv.1906.01969
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Efficient, Lexicon-Free OCR using Deep Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 0 publications
0
7
0
Order By: Relevance
“…This 32 px limit is supported by Tong et al, who report that a height of 32 px produced the same accuracy as 48 px and 64 px, but that 16 px was too little to correctly recognize small characters[Ton+20]. On the other hand, Namsyl and Konya also state that "relatively long, free-form text lines [warrant] the use of taller samples"[NK19]. This leads us to the situation with current state of the art (non-HTR) CRNN systems geared towards historical typefaces, where line heights of 40 px [MLK20; MGK19] and, more commonly, 48 px [WRP18a; Bre+13; KHK19] are to be found 4 .…”
mentioning
confidence: 82%
See 3 more Smart Citations
“…This 32 px limit is supported by Tong et al, who report that a height of 32 px produced the same accuracy as 48 px and 64 px, but that 16 px was too little to correctly recognize small characters[Ton+20]. On the other hand, Namsyl and Konya also state that "relatively long, free-form text lines [warrant] the use of taller samples"[NK19]. This leads us to the situation with current state of the art (non-HTR) CRNN systems geared towards historical typefaces, where line heights of 40 px [MLK20; MGK19] and, more commonly, 48 px [WRP18a; Bre+13; KHK19] are to be found 4 .…”
mentioning
confidence: 82%
“…We omitted P = 0 because preliminary results showed that this did not work well at all. Following (G3) and the fact that at least 10 layers have been used in fully convolutional networks for text recognition [NK19], we focus more on depth (K) than on width (N and R). This is now a general practice in deep learning, as the "depth of a neural network is exponentially more valuable than the width of a neural network" [WR17].…”
Section: Findings ψmentioning
confidence: 99%
See 2 more Smart Citations
“…The problem of Optical Character Recognition (OCR) has been the scope of research for many years [1]- [3] due to the need for an efficient method to digitize printed documents, prevent their loss and gradual unavoidable wear, as well as increase their accessibility and portability. The challenges that face Arabic OCR systems stem from the cursive and continuous nature of Arabic scripts.…”
Section: Introductionmentioning
confidence: 99%