2021
DOI: 10.1007/978-3-030-68790-8_50
|View full text |Cite
|
Sign up to set email alerts
|

GFTE: Graph-Based Financial Table Extraction

Abstract: Tabular data is a crucial form of information expression, which can organize data in a standard structure for easy information retrieval and comparison. However, in financial industry and many other fields tables are often disclosed in unstructured digital files, e.g. Portable Document Format (PDF) and images, which are difficult to be extracted directly. In this paper, to facilitate deep learning based table extraction from unstructured digital files, we publish a standard Chinese dataset named FinTab, which … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
21
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 33 publications
(21 citation statements)
references
References 29 publications
0
21
0
Order By: Relevance
“…The early techniques about table detection and recognition can be found in the comprehensive survey [37]. With the great success of deep neural network in computer vision field, works began to focus on the image-based table with more general structures [21,30,24,9,13,36,23,14,33]. According to the basic components granularities, we roughly divide previous methods into two types: global-object-based methods and local-object-based methods.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…The early techniques about table detection and recognition can be found in the comprehensive survey [37]. With the great success of deep neural network in computer vision field, works began to focus on the image-based table with more general structures [21,30,24,9,13,36,23,14,33]. According to the basic components granularities, we roughly divide previous methods into two types: global-object-based methods and local-object-based methods.…”
Section: Related Workmentioning
confidence: 99%
“…After that, a group of methods [36,23,38] tries to recover the cell relations based on some heuristic rules and algorithms. Another type of methods [11,2,14,24,26] treat the detected boxes as nodes in a graph and attempt to predict the relations based on techniques of Graph Neural Networks [29]. [14] predicts the relations between nodes in three classes (the horizontal connection, the vertical connection, no connection) using several features such as visual features, text positions, word embedding, etc.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations