2010
DOI: 10.2139/ssrn.3199484
|View full text |Cite
|
Sign up to set email alerts
|

Structured Literature Image Finder: Parsing Text and Figures in Biomedical Literature

Abstract: The SLIF project combines text-mining and image processing to extract structured information from biomedical literature.SLIF extracts images and their captions from published papers. The captions are automatically parsed for relevant biological entities (protein and cell type names), while the images are classified according to their type (e.g., micrograph or gel). Fluorescence microscopy images are further processed and classified according to the depicted subcellular localization.The results of this process … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…[22]. A decade later, Ahmed et al [3,4] improved the model for mining captioned figures. The latest version combines text-mining and image processing to extract structured information from biomedical literature.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…[22]. A decade later, Ahmed et al [3,4] improved the model for mining captioned figures. The latest version combines text-mining and image processing to extract structured information from biomedical literature.…”
Section: Related Workmentioning
confidence: 99%
“…For each category, we randomly reserved 25% of the images for a testing set and trained the SVM model on the remaining 75%. We used sklearn's grid search method to tune the SVM parameters (kernel, gamma, and penalty parameter) 3 . Once the model parameters are tuned, we evaluate the model by using the testing set and then trained the final model with all images.…”
Section: Figure Classificationmentioning
confidence: 99%
“…An existing approach towards this problem involves finding the entities from texts (or even a search box) and associating them with images from medical literature (using caption processing, image processing, and topic discovery), and can be found in the Structured Literature Image Finder [1]. The training data used manually annotated images from different subcellular locations.…”
Section: The Role Of Visual Rhetoric In Semantic Multimedia and Relatmentioning
confidence: 99%
“…The structured literature image finder (SLIF) system applies an existing optical character recognition (OCR) system to recognize figure text and identify potential image pointers. SLIF then parses text and figures in biomedical literature by matching image pointers in images and captions [ 7 ]. Other researchers have also applied existing OCR tools to extract figure text and then incorporate the figure text for applications, e.g., image and document retrieval [ 5 , 11 ].…”
Section: Introductionmentioning
confidence: 99%