2023
DOI: 10.1093/bib/bbac626
|View full text |Cite
|
Sign up to set email alerts
|

Reducing false positive rate of docking-based virtual screening by active learning

Abstract: Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecula… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 45 publications
0
4
0
Order By: Relevance
“…Accurate prediction of protein–ligand binding affinity remains one of the grand challenges of computational chemistry and biology. 1–3 With the ever increasing amount of high-resolution experimentally determined protein–ligand structures, 4 the binding affinity prediction methods have switched from physics-based 5–11 to empirical scoring functions 12–15 and knowledge-based, 16,17 and in the last decade to machine learning 18–27 and deep learning based methods. 28–38 Especially, deep learning is an end-to-end method that is ideally suited to find hidden nonlinear relationships 39 between 3D protein–ligand complex structures and binding affinity.…”
Section: Introductionmentioning
confidence: 99%
“…Accurate prediction of protein–ligand binding affinity remains one of the grand challenges of computational chemistry and biology. 1–3 With the ever increasing amount of high-resolution experimentally determined protein–ligand structures, 4 the binding affinity prediction methods have switched from physics-based 5–11 to empirical scoring functions 12–15 and knowledge-based, 16,17 and in the last decade to machine learning 18–27 and deep learning based methods. 28–38 Especially, deep learning is an end-to-end method that is ideally suited to find hidden nonlinear relationships 39 between 3D protein–ligand complex structures and binding affinity.…”
Section: Introductionmentioning
confidence: 99%
“…In two cases where RMSD exceeded 1Å (PDB:5i96 and 5vv0), all the differences were in the part of the molecules exposed to the solvent, while poses inside the pocket were determined with high accuracy. Although the structures used appeared to predict binding correctly when tested with "native" ligands, the presence of false positives in virtual screens is inevitable [33]. Testing binding properties experimentally for all identified hits was not possible in the context of this study.…”
Section: Resultsmentioning
confidence: 89%
“…To remove the false positive results, the use of more than one docking software and clustering of the predicted conformations are common practices in the field of in silico research of bioactive compounds. For the same reason, to improve the molecular docking studies, another additional method used in the modern drug design research is represented by the molecular dynamics studies [60][61][62][63][64][65].…”
Section: Molecular Dockingmentioning
confidence: 99%