Proceedings of the 2019 International Conference on Management of Data 2019
DOI: 10.1145/3299869.3319867
|View full text |Cite
|
Sign up to set email alerts
|

Visual Segmentation for Information Extraction from Heterogeneous Visually Rich Documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 32 publications
0
7
0
1
Order By: Relevance
“…These NLP-based approaches typically operate on text sequences and do not incorporate visual and layout information. Later studies [6,16,32] firstly tend to explore layout information to aid entity extraction from VRDs. Post-OCR [13] reconstructs the text sequences based on their bounding boxes.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…These NLP-based approaches typically operate on text sequences and do not incorporate visual and layout information. Later studies [6,16,32] firstly tend to explore layout information to aid entity extraction from VRDs. Post-OCR [13] reconstructs the text sequences based on their bounding boxes.…”
Section: Related Workmentioning
confidence: 99%
“…Post-OCR [13] reconstructs the text sequences based on their bounding boxes. VS2 [32] leverages the heterogeneous layout to perform the extraction in visual logical blocks. A range of other methods [6,16,51] represent a document as a 2D grid with text tokens to obtain the contextual embedding.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…how the summaries came to be, the answer still remains illusive. Explaining how a machine-generated summary was constructed, has become a necessity under the newly introduced General Data Privacy Regulation Act (ITGP, 2017), especially for applications in enterprise (Sarkhel and Nandi, 2019;Keymanesh et al, 2020) and biomedical domain (Moradi and Ghadiri, 2018;Sarkhel et al, 2018). Some recent efforts have proposed using interpretable heatmaps (Baan et al, 2019) generated from the attention distribution over an input sequence for interpreting model behaviour.…”
Section: Prototype Summarymentioning
confidence: 99%
“…However, as far as the state of the art is concerned, there are many machine learning models which must be trained for general named entities to be robust (Peters et al, 2018;Akbik et al, 2018;Devlin et al, 2018). To further increase training efficiency, we can use the documents of a previously defined layout, so that the models could learn how to extract a particular piece of information (Zhao et al, 2019;Denk and Reisswig, 2019;Liu et al, 2019a;Sarkhel and Nandi, 2019). On the other hand, more general extractors are still needed to deal with a variety of information.…”
Section: Introductionmentioning
confidence: 99%