2020
DOI: 10.1007/978-3-030-57321-8_23
|View full text |Cite
|
Sign up to set email alerts
|

A Clustering Backed Deep Learning Approach for Document Layout Analysis

Abstract: Large organizations generate documents and records on a daily basis, often to such an extent that processing them manually becomes unduly time consuming. Because of this, automated processing systems for documents are desirable, as they would reduce the time spent handling them. Unfortunately, documents are often not designed to be machine-readable, so parsing them is a difficult problem. Image segmentation techniques and deep-learning architectures have been proposed as a solution to this, but have difficulty… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…As such, document layout analysis (DLA) is used as a standard preprocessing and an essential prerequisite for developing any document image processing and analysis system. Thus, DLA has emerged as a priority topic and active research domain [3] and has increasingly become a significant interest in numerous research studies [4][5][6][7][8][9]. DLA algorithms can be carried out top-down or bottom-up with respect to their processing order [10].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…As such, document layout analysis (DLA) is used as a standard preprocessing and an essential prerequisite for developing any document image processing and analysis system. Thus, DLA has emerged as a priority topic and active research domain [3] and has increasingly become a significant interest in numerous research studies [4][5][6][7][8][9]. DLA algorithms can be carried out top-down or bottom-up with respect to their processing order [10].…”
Section: Introductionmentioning
confidence: 99%
“…Performing consecutive or cumulative connected component (CC) and pixel analyses on a document image was a typical dominant technique enforced to initially identify regions and then classify them, as adopted by the majority of proposed DLA systems [17,19,22]. Furthermore, advanced deep learning models were also used for empowering different DLA frameworks [7][8][9]23].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation