2019
DOI: 10.1145/3355610
|View full text |Cite
|
Sign up to set email alerts
|

Document Layout Analysis

Abstract: Document layout analysis (DLA) is a preprocessing step of document understanding systems. It is responsible for detecting and annotating the physical structure of documents. DLA has several important applications such as document retrieval, content categorization, text recognition, and the like. The objective of DLA is to ease the subsequent analysis/recognition phases by identifying the document-homogeneous blocks and by determining their relationships. The DLA pipeline consists of several phases that could v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 123 publications
(26 citation statements)
references
References 113 publications
0
25
0
1
Order By: Relevance
“…For example, the Projection Profile method showed a 100% success rate with the confidence of ±0.3 • in the studies by Mahajan and Apoorva [26]. A comparison made by Binmakhashen and Mahmoud [3], about the Singh et al [38] studies show a small error rate of 0.05 • for documents with angles ≤ 150 • using the Hough Transform technique in document analysis.…”
Section: Main Techniques Used In Omr Processingmentioning
confidence: 97%
See 2 more Smart Citations
“…For example, the Projection Profile method showed a 100% success rate with the confidence of ±0.3 • in the studies by Mahajan and Apoorva [26]. A comparison made by Binmakhashen and Mahmoud [3], about the Singh et al [38] studies show a small error rate of 0.05 • for documents with angles ≤ 150 • using the Hough Transform technique in document analysis.…”
Section: Main Techniques Used In Omr Processingmentioning
confidence: 97%
“…Tilt detection and correction are closely linked to the segmentation phase, that is, to extracting regions from the document. The input images must be defined in a standard format, for example, all images with the skew angle at 0 • [3]. Some approaches use the base of the lines of writing to detect the skew angle [15,29].…”
Section: Main Techniques Used In Omr Processingmentioning
confidence: 99%
See 1 more Smart Citation
“…Tesseract offers a number of pre-processing mechanisms for document images, however, it does not implement the full range of state-of-the-art OCR. Image pre-processing as proposed and implemented by the OCR-D project (Binmakhashen and Mahmoud, 2019;Neudecker et al, 2019), is beneficial to additionally extend the tool with the latest developments in OCR.…”
Section: Foundations and Related Workmentioning
confidence: 99%
“…Although difficult, layout analysis is however essential for historical newspaper understanding and exploitation, and their quality has a direct impact on downstream processes [Binmakhashen and Mahmoud, 2019]. From an information retrieval and user viewpoint, being able to query at the level of meaningful segments such as articles -instead of whole pages-, and to facet over different types of segments are undeniable advantages.…”
Section: Introductionmentioning
confidence: 99%