Proceedings of the 27th ACM International Conference on Information and Knowledge Management 2018
DOI: 10.1145/3269206.3269265
|View full text |Cite
|
Sign up to set email alerts
|

Extracting Figures and Captions from Scientific Publications

Abstract: Figures and captions convey essential information in scientific publications. As such, there is a growing interest in mining published figures and in utilizing their respective captions as a source of knowledge. There is also much interest in image captioning systems that can automatically generate captions for images, whose training requires large datasets of image-caption pairs. Notably, the first fundamental step of obtaining figures and captions from publications is neither well-studied nor yet well-addres… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 15 publications
0
3
0
Order By: Relevance
“…The prevalence of text with visuals has led researchers to explore how readers specifically understand information in figures with accompanying text in several domains. Li et al [25] conducted studies to demonstrate that figures with text can convey essential information and better aid understanding than just text alone for scientific publications in a biomedical domain. Odell et al [34] demonstrated that having text that accurately describes important findings in medical diagnostic images can increase physicians' speed and accuracy on Bayesian reasoning tasks while making life-critical judgments for patients.…”
Section: Cognitive Understanding Of Chartsmentioning
confidence: 99%
“…The prevalence of text with visuals has led researchers to explore how readers specifically understand information in figures with accompanying text in several domains. Li et al [25] conducted studies to demonstrate that figures with text can convey essential information and better aid understanding than just text alone for scientific publications in a biomedical domain. Odell et al [34] demonstrated that having text that accurately describes important findings in medical diagnostic images can increase physicians' speed and accuracy on Bayesian reasoning tasks while making life-critical judgments for patients.…”
Section: Cognitive Understanding Of Chartsmentioning
confidence: 99%
“…In future developments, the authors envision the implementation of new components to further enable multi-faceted content exploration. Such as the development of a storytelling component that connects geographic places and metadata to maps [2] and software that extracts images and caption from books [5] and integrate them into the ARCA knowledge graph.…”
Section: Discussionmentioning
confidence: 99%
“…This is a substantial oversight, given that figures often encapsulate the critical results of a study, and these data are not often found in the text sections in a given publication. There are only a few cases where data extraction from figures in scientific papers has been attempted. In particular, Nandy et al digitized thermalgravimetric analysis (TGA) data for approximately three thousand metal–organic framework papers. While their success is noteworthy, to expand this to a broader scope in materials science, there exists a pressing need to develop tools that can facilitate obtaining data found in figures.…”
Section: Introductionmentioning
confidence: 99%