2019
DOI: 10.1021/acs.jcim.9b00537
|View full text |Cite
|
Sign up to set email alerts
|

Development and Application of a Data-Driven Reaction Classification Model: Comparison of an Electronic Lab Notebook and Medicinal Chemistry Literature

Abstract: Reaction classification has often been considered an important task for many different applications, and has traditionally been accomplished using hand-coded rule-based approaches. However, the availability of large collections of reactions enables data-driven approaches to be developed. We present the development and validation of a 336-class machine learning-based classification model integrated within a Conformal Prediction (CP) framework to associate reaction class predictions with confidence estimations. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
44
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 44 publications
(45 citation statements)
references
References 29 publications
1
44
0
Order By: Relevance
“…The process starts with a set of classified reactions. We use the hierarchical reaction classification system SHREC [13] which is compatible with the reaction vector approach and is summarised briefly below. SHREC is distributed across four hierarchical levels ranging from general reaction categories to increasingly more specific subclasses.…”
Section: Overviewmentioning
confidence: 99%
See 3 more Smart Citations
“…The process starts with a set of classified reactions. We use the hierarchical reaction classification system SHREC [13] which is compatible with the reaction vector approach and is summarised briefly below. SHREC is distributed across four hierarchical levels ranging from general reaction categories to increasingly more specific subclasses.…”
Section: Overviewmentioning
confidence: 99%
“…Thus, each entry in the training set represents one or more starting materials and a multi-label classification represented as a binary vector formation reactions. The reaction classifier was trained on data extracted from the USPD which was originally annotated with NameRxn classes, and a table showing the mapping of the original NameRxn labels to the four-level SHREC is shown in the Supporting Information of Ghiandoni et al [13]. Note that the four-level hierarchical labelling in SHREC is not exhaustive in terms of nomenclature due to its bias towards the USPD and NameRxn.…”
Section: Overviewmentioning
confidence: 99%
See 2 more Smart Citations
“…Unfortunately, the limited set of reaction classes used (only 50 most important ones) makes it difficult to judge how this algorithm would perform on a more comprehensive set. Recent work by Ghiandoni et al 13 introduced an alternative hierarchical classification scheme and random forest classifier for reaction classification. However, the model requires reaction center information as input, which limits the applicability of the method.…”
Section: Introductionmentioning
confidence: 99%