2020
DOI: 10.1007/978-3-030-57058-3_31
|View full text |Cite
|
Sign up to set email alerts
|

Fast and Lightweight Text Line Detection on Historical Documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…This problem has been little studied in the literature. For example, when confronted to touching bounding boxes, Melnikov and Zagaynov [31] suggested to remove the ascenders and descenders by reducing the height of the annotated boxes by 30% at the top and the bottom. Then, they downscaled the rasterized polygons to the input resolution to train their model.…”
Section: Evaluation Methodologymentioning
confidence: 99%
See 1 more Smart Citation
“…This problem has been little studied in the literature. For example, when confronted to touching bounding boxes, Melnikov and Zagaynov [31] suggested to remove the ascenders and descenders by reducing the height of the annotated boxes by 30% at the top and the bottom. Then, they downscaled the rasterized polygons to the input resolution to train their model.…”
Section: Evaluation Methodologymentioning
confidence: 99%
“…Object-level IoU P/R F1 P/R R@.85/.95 mAP@.65 mAP Barakat [23] Mechi [24] Renton [25] Doc-UFCN [6] dhSegment [7] Yang [27] Tarride [2] Soullard [30] Melnikov [31] To tackle this problem, metrics originally designed in the Information Retrieval community [34] have been adapted to images and used during the PASCAL VOC Challenge 2012 to compute the Precision at objectlevel. During this competition, the detection task was evaluated based on the Precision-Recall curve at objectlevel, where the detections are considered as true or false positives according to their area of overlap with the ground-truth objects.…”
Section: Pixel-levelmentioning
confidence: 99%
“…These approaches, however, make assumptions about the document, such as constant text orientation, and rely on only processing a single text column at time. Other authors rely on segmenting parts the text body, such as baseline or x-height area (see Figure 1), using CNNs first [13,12]. To convert the detected text x-height areas to text line polygons, Pastor-Pellicer et al [17] use local extreme points of binarized text to assign ascender and descender lines to each detected text line.…”
Section: Introductionmentioning
confidence: 99%