2014
DOI: 10.1007/s10044-014-0412-8
|View full text |Cite
|
Sign up to set email alerts
|

Image-based logical document structure recognition

Abstract: The paper presents a complete solution for recognition of textual and graphic structures in various types of documents acquired from the Internet. In the proposed approach, the document structure recognition problem is divided into sub-problems. The first one is localizing logical structure elements within the document. The second one is recognizing segmented logical structure elements. The input to the method is an image of document page, the output is the XML file containing all graphic and textual elements … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 37 publications
0
3
0
Order By: Relevance
“…In [30], the authors rely on features derived from the geometry of the document and perform hierarchical graph coloring to retrieve the structure of postal mails. In [39] text-lines are grouped based on alignment, distance and graphical features like font, thickness and color to form homogeneous zones. It is also common to gradually merge connected components to obtain text-blocks in printed documents [4,37] Clustering methods are also applied to find text-lines using generic features, such as orientation features [40,71].…”
Section: Bottom-up or Data-driven Strategiesmentioning
confidence: 99%
“…In [30], the authors rely on features derived from the geometry of the document and perform hierarchical graph coloring to retrieve the structure of postal mails. In [39] text-lines are grouped based on alignment, distance and graphical features like font, thickness and color to form homogeneous zones. It is also common to gradually merge connected components to obtain text-blocks in printed documents [4,37] Clustering methods are also applied to find text-lines using generic features, such as orientation features [40,71].…”
Section: Bottom-up or Data-driven Strategiesmentioning
confidence: 99%
“…Carry out automatic image segmentation. Image segmentation is meant to separate distinct elements in an image from other elements [26]. After these distinctive elements have been separated, further operations can be performed, such as identifying individual elements or measuring their size.…”
Section: Blood Vein Detection Algorithmmentioning
confidence: 99%
“…Nowadays, academic papers are widely available from popular databases such as Google Scholar 1 and CiNii 2 in Japan. To make the best use of papers, there has been much research into the recognition of logical structures in documents [9] and keyword extraction from academic papers [3]. In particular, tables are often used to show statistics and experimental results in academic papers, while graphical structures, rather than tabular structures, are better suited to visually comparing many values at once.…”
Section: Introductionmentioning
confidence: 99%