2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) 2017
DOI: 10.1109/icdar.2017.20
|View full text |Cite
|
Sign up to set email alerts
|

Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
194
1

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 223 publications
(197 citation statements)
references
References 10 publications
2
194
1
Order By: Relevance
“…The current state-of-the-art in many text recognition tasks additionally integrate CNNs for an improved low-level feature extraction prior to the recurrent layers. This approach is applied to offline HTR by [4]. Recently there have been developments towards fully convolutional architectures, i.e.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The current state-of-the-art in many text recognition tasks additionally integrate CNNs for an improved low-level feature extraction prior to the recurrent layers. This approach is applied to offline HTR by [4]. Recently there have been developments towards fully convolutional architectures, i.e.…”
Section: Related Workmentioning
confidence: 99%
“…Generally, the de facto standard for HTR tasks have been systems based on Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) [3], [4], which are utilizing the Connectionist Temporal Classification (CTC) [5] objective function. However, CTC-based architectures are subject to inherent limitations like strict monotonic input-output alignments and an output sequence length that is bound by the, possibly subsampled, input length.…”
Section: Introductionmentioning
confidence: 99%
“…We show that FCNs with dilated 1×3 convolutions detect long text lines as single entities significantly better than FCNs that use only un-dilated 3×3 convolutions. We choose to detect text at the line level as this is the input expected by state-of-the-art handwriting recognition methods [13]. Figure 2.…”
Section: Detectionmentioning
confidence: 99%
“…Bluche et al [12], [13] described an end-to-end system that uses an MD-RNN along with an LSTM to encode multiple lines of text. Although the described system shows promise to automatically recognise multiple lines, it may not be a practical solution as it requires a large amount of computational power [11]. Winglinton et al [14] utilised a region proposal network to find the starting positions of text lines and a line follower network was trained to trace the line of text.…”
Section: B Text Recognitionmentioning
confidence: 99%