2015 13th International Conference on Document Analysis and Recognition (ICDAR) 2015
DOI: 10.1109/icdar.2015.7333803
|View full text |Cite
|
Sign up to set email alerts
|

Paragraph text segmentation into lines with Recurrent Neural Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 45 publications
(27 citation statements)
references
References 13 publications
0
27
0
Order By: Relevance
“…Time taken (sec) Memory (gb) Ours 0.389 ± 0.0888 2.6 Bluche [4] 21.2 ± 5.6 N/a Bluche [5] 0.62 ± 0.14 N/a Wigington [6] 0.546 ± 0.447 9.7 Since our memory usage is substantially smaller, it is possible to run multiple images at the same time; effectively reducing the time required by a third.…”
Section: Methodsmentioning
confidence: 99%
“…Time taken (sec) Memory (gb) Ours 0.389 ± 0.0888 2.6 Bluche [4] 21.2 ± 5.6 N/a Bluche [5] 0.62 ± 0.14 N/a Wigington [6] 0.546 ± 0.447 9.7 Since our memory usage is substantially smaller, it is possible to run multiple images at the same time; effectively reducing the time required by a third.…”
Section: Methodsmentioning
confidence: 99%
“…A huge variety of degradations as well as different resolutions and orientations should be present. Since the landscape of document analysis has changed over the last years, and machine learning based algorithms get more and more popular not only for KWS [19] and HTR [20] but also for LA [21]- [23], the dataset should consist of hundreds of pages to provide an appropriate amount of training samples.…”
Section: Introductionmentioning
confidence: 99%
“…Related to the main line of this PhD Thesis, we found the work of [Moysset et al 2015]. In their approach MDLSTM are trained with Connectionist Temporal Classification (CTC) for the localization of line frontiers.…”
Section: Text Line As Sequencesmentioning
confidence: 99%
“…The image is converted into sequence of feature vectors by dividing the image into D non-overlapping vertical regions, then projection based features are extracted for each block and finally stacked for each row. In this work, they used a HMM to estimate the sequence alignment and the input features of the model are handcrafted by projection profiles, while in [Moysset et al 2015] used the CTC and MDLSTM that are directly applied on the raw image.…”
Section: Text Line As Sequencesmentioning
confidence: 99%
See 1 more Smart Citation