2012
DOI: 10.1186/1471-2105-13-s17-s9
|View full text |Cite
|
Sign up to set email alerts
|

ChemEx: information extraction system for chemical data curation

Abstract: Background Manual chemical data curation from publications is error-prone, time consuming, and hard to maintain up-to-date data sets. Automatic information extraction can be used as a tool to reduce these problems. Since chemical structures usually described in images, information extraction needs to combine structure image recognition and text mining together. Results We have developed ChemEx, a chemical information extraction system. ChemEx processes … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 25 publications
(17 citation statements)
references
References 19 publications
0
16
0
Order By: Relevance
“…The data were collected at two different static magnetic field strengths (500 and 800 MHz) generating a series of 2 x 21 2D 1 H-15 N correlation maps measured as a function of CPMG frequency (S20 Fig). The spectra were processed using standard approaches and the program chemex [86,87] assuming a 2-state reaction [40,88], which has been applied to a few other domains [31,38]. Global fitted values of k ex and p bound were extracted (224 s -1 and 0.08) from these data.…”
Section: Methodsmentioning
confidence: 99%
“…The data were collected at two different static magnetic field strengths (500 and 800 MHz) generating a series of 2 x 21 2D 1 H-15 N correlation maps measured as a function of CPMG frequency (S20 Fig). The spectra were processed using standard approaches and the program chemex [86,87] assuming a 2-state reaction [40,88], which has been applied to a few other domains [31,38]. Global fitted values of k ex and p bound were extracted (224 s -1 and 0.08) from these data.…”
Section: Methodsmentioning
confidence: 99%
“…There is growing interest in hybrid machine learning and dictionary systems such as the one described in [ 10 ], which obtains interesting performance on chemical entity recognition in patent texts. The authors of [ 55 ] use different approaches for different entity types (machine learning for chemical names, dictionary-based for organism and assay entities); given the complementary application, this is not a hybrid approach in the strict sense. A contrastive overview that also covers rule-based approaches is given in [ 56 ].…”
Section: Discussionmentioning
confidence: 99%
“…Due to OSRA being an open-source tool, other developers were able to implement it in their projects and to develop improvements by using it in the backend. ChemEx is a tool that combines OSRA in combination with a text mining workflow in order to mine natural-product-related data from scientific publications [ 19 ]. In 2020, the tool ChemSchematicResolver was published with Python bindings for OSRA (PyOSRA).…”
Section: Rule-based Systemsmentioning
confidence: 99%