2020
DOI: 10.1021/acs.jcim.0c00193
|View full text |Cite
|
Sign up to set email alerts
|

Impact of Chemist-In-The-Loop Molecular Representations on Machine Learning Outcomes

Abstract: The development of molecular descriptors is a central challenge in cheminformatics. Most approaches use algorithms that extract atomic environments or end-to-end machine learning. However, a looming question is that how do these approaches compare with the critical eye of trained chemists. The CAS fingerprint engages expert chemists to curate chemical motifs, which they deem could influence bioactivity. In this paper, we benchmark the CAS fingerprint against commonly used fingerprints using a well-established … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(12 citation statements)
references
References 16 publications
0
12
0
Order By: Relevance
“…Even though this review has no intention of providing an exhaustive historical account of the development of descriptors for atomic structures, it is worth providing a brief overview. A “data-driven” philosophy emerged early in the field of chemical and molecular science, where the combinatorial extent of the space of possible molecules, and the possibility of accessing this space with comparatively simple synthetic strategies, encouraged the development of quantitative structure–property relationship (QSPR) techniques, attempting to map descriptors of molecular structurebased on cheminformatics fingerprints, , chemical-intuition driven descriptors, molecular graphs, or indicators obtained from quantum chemical calculationsto the behavior of a selected compound, usually focusing on properties of direct applicative interest such as solubility, toxicity, or pharmacological activity. , …”
Section: Representations For Materials and Moleculesmentioning
confidence: 99%
“…Even though this review has no intention of providing an exhaustive historical account of the development of descriptors for atomic structures, it is worth providing a brief overview. A “data-driven” philosophy emerged early in the field of chemical and molecular science, where the combinatorial extent of the space of possible molecules, and the possibility of accessing this space with comparatively simple synthetic strategies, encouraged the development of quantitative structure–property relationship (QSPR) techniques, attempting to map descriptors of molecular structurebased on cheminformatics fingerprints, , chemical-intuition driven descriptors, molecular graphs, or indicators obtained from quantum chemical calculationsto the behavior of a selected compound, usually focusing on properties of direct applicative interest such as solubility, toxicity, or pharmacological activity. , …”
Section: Representations For Materials and Moleculesmentioning
confidence: 99%
“…The US is the leading country of origin: 15 of the 34 papers in Tables 1−3 are affiliated with US organizations. Other countries with significant numbers of important AI documents are Germany (6) and Switzerland (5). Among organizations, the Massachusetts Institute of Technology (US) and the University of Basel (Switzerland) were the two biggest contributors.…”
Section: Publicationsmentioning
confidence: 99%
“…Even though this review has no intention of providing a comprehensive discussion of the development of descriptors for atomic structures, it is worth providing a brief overview. A "data-driven" philosophy emerged early in the field of chemical and molecular science, where the combinatorial extent of the space of possible molecules, 31 and the possibility of accessing this space with comparatively simple synthetic strategies, encouraged the development of quantitative structure/property relationships (QSPR) techniques, attempting to map 32 descriptors of molecular structure -based on cheminformatics fingerprints, 33,34 chemical-intuition driven descriptors 35 , molecular graphs, 36 or indicators obtained from quantum chemical calculations 37 -to the behavior of a selected compound, usually focusing on properties of direct applicative interest [38][39][40] such as solubility, toxicity, 41 or pharmacological activity. 42,43 Cartesian coordinates This approach should be contrasted with that of "bottom-up" predictions, that aim to use models of the interactions between the atomic constituents of a material to simulate the behavior of the system on an atomic time and length scale.…”
Section: Representations For Materials and Moleculesmentioning
confidence: 99%
“…135 to learn interatomic forces. The fact that equivariant features of the form (35) follow O(3) transformation rules means that they can be combined using established relationships in the quantum theory of angular momentum. In particular, the coupled-basis representation used in the definition of Eqs.…”
Section: G Equivariant Representations and Tensorial Featuresmentioning
confidence: 99%
See 1 more Smart Citation