2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00459
|View full text |Cite
|
Sign up to set email alerts
|

PubTables-1M: Towards comprehensive table extraction from unstructured documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 54 publications
(11 citation statements)
references
References 15 publications
0
11
0
Order By: Relevance
“…With the flourish of transformer [43], several DETR [2]like TSR methods are proposed to extract boundary in object detection paradigm. In work [40], a set of table elements are detected by DETR to model the hierarchical structure. TRUST [9] has the similar spirit with SP-LERGE [41], where the features of row/column separators are extracted for a further vertex-based merging module to predict the linking relations between adjacent cells, which thus still suffers from boundary ambiguity problem.…”
Section: Candidate Components Extraction Separator Proposals Generati...mentioning
confidence: 99%
“…With the flourish of transformer [43], several DETR [2]like TSR methods are proposed to extract boundary in object detection paradigm. In work [40], a set of table elements are detected by DETR to model the hierarchical structure. TRUST [9] has the similar spirit with SP-LERGE [41], where the features of row/column separators are extracted for a further vertex-based merging module to predict the linking relations between adjacent cells, which thus still suffers from boundary ambiguity problem.…”
Section: Candidate Components Extraction Separator Proposals Generati...mentioning
confidence: 99%
“…extraction as defined in [5] that adds layout information and encourages the development of end-to-end systems that can tackle multiple tasks at once;…”
Section: • We Define the Task Of Contextualized Table Extraction An E...mentioning
confidence: 99%
“…PubLayNet is a collection of 358, 353 PDF pages with five types of regions annotated (title, text, list, table, image) [4]. PubTables-1M [5] is a collection of 947, 642 fully annotated tables, including information for table detection, recognition, and functional analysis (such as identifying column headers, projected rows, and table cells). The datasets are built to address different tasks, as summarized in Table 1.…”
Section: Subset Of Publaynet and Pubtables-1mmentioning
confidence: 99%
See 1 more Smart Citation
“…Recent advances with the transformer architecture [35] have led to improvements within document image analysis fields, including table understanding. The TableTransformer [32] is a model for detecting tables and extracting table structure from images and PDF documents. SegFormer [36] and later DocSegTr [8] are other attempts to use the transformer architecture for general document and image segmentation.…”
Section: Document Image Analysismentioning
confidence: 99%