2020
DOI: 10.1186/s13321-020-00465-0
|View full text |Cite
|
Sign up to set email alerts
|

A review of optical chemical structure recognition tools

Abstract: Structural information about chemical compounds is typically conveyed as 2D images of molecular structures in scientific documents. Unfortunately, these depictions are not a machine-readable representation of the molecules. With a backlog of decades of chemical literature in printed form not properly represented in open-access databases, there is a high demand for the translation of graphical molecular depictions into machine-readable formats. This translation process is known as Optical Chemical Structure Rec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
73
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 61 publications
(73 citation statements)
references
References 41 publications
0
73
0
Order By: Relevance
“…The problem of optical chemical structure recognition has long been studied in computational chemistry [6][7][8][9][10][11][12] , and the current state of the art was recently summarized in a review paper. 13 Most of the published approaches up to 2019 were rule-based methods. Such systems typically work by first vectorizing the input image (maybe even first identifying the image within a pdf page).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The problem of optical chemical structure recognition has long been studied in computational chemistry [6][7][8][9][10][11][12] , and the current state of the art was recently summarized in a review paper. 13 Most of the published approaches up to 2019 were rule-based methods. Such systems typically work by first vectorizing the input image (maybe even first identifying the image within a pdf page).…”
Section: Related Workmentioning
confidence: 99%
“…on Japanese Patent Office (JPO) data, obtained from Rajan et al 13 . Note that this data set contains many textual labels, including Japanese characters, and irregular features, including line thickness variations.…”
Section: Jpo a Collection Of 365 Images And Molecule Descriptions Basedmentioning
confidence: 99%
“…Over the course of the last three decades, there has been an active development in the field of Optical Chemical Structure Recognition (OCSR). OCSR is the translation of an image of a chemical structure into a machine-readable representation [ 4 ]. Most OCSR tools are only capable of processing images with pure chemical structure depictions.…”
Section: Introductionmentioning
confidence: 99%
“…Automatic extraction of a molecule from an image of its 2D chemical structure to a machine-readable format, termed optical chemical structure recognition, first emerged in the 1990s. [13][14][15][16][17][18] These systems were developed with the intent of mining ChemDraw type diagrams in the chemical literature to utilize the wealth of largely untapped chemical information that lies within publications. Over the following decades, more complex systems were developed, often based on the principles of their predecessors.…”
Section: Introductionmentioning
confidence: 99%
“…Over the following decades, more complex systems were developed, often based on the principles of their predecessors. [18][19][20][21][22][23][24][25][26][27][28][29] OSRA was the first chemical structure recognition open-source software, allowing new programs to be developed by direct extension.…”
Section: Introductionmentioning
confidence: 99%