Document Domain Randomization for Deep Learning Document Layout Extraction

Ling, Meng; Chen, Jian; Möller, Torsten B.; Isenberg, Petra; Isenberg, Tobias; Sedlmair, Michael; Laramee, Robert S.; Shen, Han; Wu, Jian; Giles, C. Lee

doi:10.1007/978-3-030-86549-8_32

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Other1

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Datasets and annotations for layout analysis of scientific articles

Gemelli,

Marinai,

Pisaneschi

et al. 2024

IJDAR

View full text Add to dashboard Cite

For a long time now, datasets containing scientific articles have been crucial to the analysis and recognition of document images. These document collections have frequently served as a testing ground for cutting-edge methods for optical character recognition, layout analysis, and document understanding in general. We thoroughly analyze and compare many datasets proposed for layout analysis of scientific documents, ranging from small collections of scanned papers to modern large-scale datasets containing digital-born papers, which have been proposed to train deep learning-based methods. Furthermore, we outline a detailed taxonomy of the annotation procedures used considering manual, automatic, and generative approaches, and we analyze their benefits and drawbacks. This survey is meant to provide the reader with a review of the most used benchmarks together with detailed information on data, annotations, and complexity, helping scholars to identify the most suitable dataset for their tasks of interest. We also discuss possible open problems to further enhance datasets to support research in the layout analysis of scientific articles.

show abstract