2020
DOI: 10.1039/d0cp00305k
|View full text |Cite
|
Sign up to set email alerts
|

Are 2D fingerprints still valuable for drug discovery?

Abstract: Recently, molecular fingerprints extracted from three-dimensional (3D) structures using advanced mathematics, such as algebraic topology, differential geometry, and graph theory have been paired with efficient machine learning, especially deep learning algorithms to outperform other methods in drug discovery applications and competitions. This raises the question of whether classical 2D fingerprints are still valuable in computer-aided drug discovery. This work considers 23 datasets associated with four typica… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
108
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 101 publications
(108 citation statements)
references
References 60 publications
0
108
0
Order By: Relevance
“…Only 2D molecular descriptors were utilized to train machine learning models. There are fields that 3D molecular descriptors perform better then 2D ones [20]. Some other applications of machine learning in predictions of organic compounds emission wavelengths were published [21][22].…”
Section: Discussionmentioning
confidence: 99%
“…Only 2D molecular descriptors were utilized to train machine learning models. There are fields that 3D molecular descriptors perform better then 2D ones [20]. Some other applications of machine learning in predictions of organic compounds emission wavelengths were published [21][22].…”
Section: Discussionmentioning
confidence: 99%
“…The first approach, used as the baseline, employs the Extended Connectivity Fingerprint (ECFP) as molecular representation. These bit vectors are widely used in the prediction of physicochemical properties, biological activity or toxicity of chemical compounds [24]. The model output is a real number, which is the estimated pIC 50 .…”
Section: The Predictor Modelmentioning
confidence: 99%
“…Most 2D descriptors are calculated with absolute accuracy while the 3D descriptors carry the errors of the methodological approximations they have been calculated with (Raevsky et al, 2019). Admitting that the 3D descriptors provide more detailed information, such as atomic distances and energy data of the compounds, there is yet no clear evidence about their impacts on the solubility predictions (Balakin et al 2006;Gao et al, 2020;Yan et al, 2004;Salahinejad et al, 2013). Although a large number of chemical descriptors are available, it is usually preferred to use a modest number of relevant descriptors to avoid redundancy and overfitting issues during the training of ML models (Wang and Hou 2011).…”
Section: The Relevance Of Chemical Descriptorsmentioning
confidence: 99%