2023
DOI: 10.1093/bib/bbad422
|View full text |Cite
|
Sign up to set email alerts
|

From intuition to AI: evolution of small molecule representations in drug discovery

Miles McGibbon,
Steven Shave,
Jie Dong
et al.

Abstract: Within drug discovery, the goal of AI scientists and cheminformaticians is to help identify molecular starting points that will develop into safe and efficacious drugs while reducing costs, time and failure rates. To achieve this goal, it is crucial to represent molecules in a digital format that makes them machine-readable and facilitates the accurate prediction of properties that drive decision-making. Over the years, molecular representations have evolved from intuitive and human-readable formats to bespoke… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2024
2024
2025
2025

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(4 citation statements)
references
References 129 publications
0
4
0
Order By: Relevance
“…These machine learning techniques for quantitative structure-property relationship are of ever-increasing interest and mostly just the starting point of every state-of-the-art lab, whether for efficiency enhancing design-of-experiment approaches or autonomous experimental platforms. 3 , 4 Therefore, the ability to utilize such foundational techniques should be of great interest to any scientist planning to optimize or explore new compound in the chemical space of molecules.…”
Section: Expected Outcomesmentioning
confidence: 99%
“…These machine learning techniques for quantitative structure-property relationship are of ever-increasing interest and mostly just the starting point of every state-of-the-art lab, whether for efficiency enhancing design-of-experiment approaches or autonomous experimental platforms. 3 , 4 Therefore, the ability to utilize such foundational techniques should be of great interest to any scientist planning to optimize or explore new compound in the chemical space of molecules.…”
Section: Expected Outcomesmentioning
confidence: 99%
“…In modern chemistry and data analytics, the proper encoding of molecular and reactional data holds paramount significance for efficient storage, retrieval, and processing. Accurate representation of chemical structures is essential for maintaining the integrity and reliability of vast databases used in various fields such as drug discovery, material science, and computational chemistry. Proper molecular data encoding ensures a seamless exchange of information between different software tools and facilitates complex analyses and simulations.…”
Section: Introductionmentioning
confidence: 99%
“…20,21 Concurrently, due to the intrinsic link between molecular structure and its corresponding function, traditional molecular descriptors have primarily focused on capturing structural properties. 22 The combination of two-dimensional representations of molecular structures (such as various molecular fingerprints) with GNNs can further enhance the model's performance. 23 The introduction of the transformer in 2017 has revolutionized deep learning, particularly in natural language processing, computer vision, and biological sequence modeling.…”
Section: Introductionmentioning
confidence: 99%
“…Graph neural networks (GNNs) are a type of deep learning model specifically designed for processing various types of graph-structured data . Due to their excellent performance and ability to learn natural and expressive features from molecular graph representations using node and edge relationships, GNNs have been widely applied in different areas of drug design. , Concurrently, due to the intrinsic link between molecular structure and its corresponding function, traditional molecular descriptors have primarily focused on capturing structural properties . The combination of two-dimensional representations of molecular structures (such as various molecular fingerprints) with GNNs can further enhance the model’s performance …”
Section: Introductionmentioning
confidence: 99%