2021
DOI: 10.1007/978-3-030-68793-9_30
|View full text |Cite
|
Sign up to set email alerts
|

ICPR 2020 Competition on Text Block Segmentation on a NewsEye Dataset

Abstract: We present a competition on text block segmentation within the framework of the International Conference on Pattern Recognition (ICPR) 2020. The main goal of this competition is to automatically analyse the structure of historical newspaper pages with a subsequent evaluation of the participants' algorithms performance. In contrast to many existing segmentation methods, instead of working on pixels, the present study has a focus on clustering baselines/text lines into text blocks. Therefore, we introduce a new … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…The detection of text lines has been widely explored in historical manuscript text books [26,9] and other historical documents of different natures, such as newspapers [25], meteorological tables [1] finding aids [33], as well as many other supports. With index tables, one can consider the issue as a two-class image segmentation task: we separate text lines from the background.…”
Section: Document Image Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…The detection of text lines has been widely explored in historical manuscript text books [26,9] and other historical documents of different natures, such as newspapers [25], meteorological tables [1] finding aids [33], as well as many other supports. With index tables, one can consider the issue as a two-class image segmentation task: we separate text lines from the background.…”
Section: Document Image Analysismentioning
confidence: 99%
“…These models have been evaluated and compared on COCO challenges [16], and are fully integrated in popular toolkits such as Detectron2 or LayoutParser. Mask-RCNN has been prior used for document understanding such as on historical newspapers [25]. In contrast, YOLACT and YOLACT++ [10] are single-step approaches focusing on efficiency and increasing the number of frames per second (FPS), a metric that indicates the number of images processed in one second.…”
Section: Document Image Analysismentioning
confidence: 99%
“…-Text Recognition (Michael et al, 2019) and Article Separation (Michael et al, 2020), extracting the layout of newspapers (e.g. articles and graphical regions) from digitized newspapers and transforming the content to textual format, providing full articles through automatic layout analysis, text recognition and article separation.…”
Section: The Newseye Projectmentioning
confidence: 99%
“…Recent studies further improve by introducing border or counter awareness [47,42,56,8], local refinement [51,11], deformation convolution [39,43], Bezier curve [22], etc. Besides, document layout analysis [7,54,12,26,24] have been studied for years that usually take reading order of texts in document as consideration.…”
Section: Scene Text Detectionmentioning
confidence: 99%