Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing 2014
DOI: 10.1145/2683483.2683550
|View full text |Cite
|
Sign up to set email alerts
|

Table Extraction from Document Images using Fixed Point Model

Abstract: The paper presents a novel learning-based framework to identify tables from scanned document images. The approach is designed as a structured labeling problem, which learns the layout of the document and labels its various entities as table header, table trailer, table cell and non-table region. We develop features which encode the foreground block characteristics and the contextual information. These features are provided to a fixed point model which learns the inter-relationship between the blocks. The fixed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…The system shows a 75% overall detection rate which was not very promising. Bansal et al [6] presented a learningbased framework which identifies tables from scanned document images. They proposed a scheme for analyzing and labeling different document elements, their contexts, and finally to define and understand the table boundaries from the context informations.…”
Section: Related Work and Motivationmentioning
confidence: 99%
“…The system shows a 75% overall detection rate which was not very promising. Bansal et al [6] presented a learningbased framework which identifies tables from scanned document images. They proposed a scheme for analyzing and labeling different document elements, their contexts, and finally to define and understand the table boundaries from the context informations.…”
Section: Related Work and Motivationmentioning
confidence: 99%
“…They identify a set of features that can be used to segregated headers from tabular data and build a classifier to detect table headers. In [2], researchers design learning-based framework to identify tables, it is a structured labeling problem, which learns the layout of the document and labels its various entities as table header, table trailer, table cell and non-table region. They develop features which encode the foreground block characteristics and the contextual information.…”
Section: Introductionmentioning
confidence: 99%
“…use an SVM model to classify the attributes of each line to determine whether the line belongs to the table Barlas et al (2014). use an MLP model to classify the connected component in the document and determined whether it belongs to tables Bansal et al (2014). use the leptonica library Bloomberg (1991) to segment the document, and then construct features containing surrounding information for each region.…”
mentioning
confidence: 99%