2021
DOI: 10.1186/s13321-021-00560-w
|View full text |Cite
|
Sign up to set email alerts
|

The impact of cross-docked poses on performance of machine learning classifier for protein–ligand binding pose prediction

Abstract: Structure-based drug design depends on the detailed knowledge of the three-dimensional (3D) structures of protein–ligand binding complexes, but accurate prediction of ligand-binding poses is still a major challenge for molecular docking due to deficiency of scoring functions (SFs) and ignorance of protein flexibility upon ligand binding. In this study, based on a cross-docking dataset dedicatedly constructed from the PDBbind database, we developed several XGBoost-trained classifiers to discriminate the near-na… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2
1

Relationship

2
8

Authors

Journals

citations
Cited by 30 publications
(39 citation statements)
references
References 68 publications
0
39
0
Order By: Relevance
“… a “_Re” represents that the model is trained based on redocked poses in PDBbind-CrossDocked-Refined while “_Cross” represents that the model is trained based on cross-docked poses. Except for DeepDockM and RTMScore, the data of methods is copied from ref . b The top20 success rate of each docking method for a certain data set, and it represents the upper limit of the value in each column. c “re” represents the performance on redocked poses, while “cross” denotes that on cross-docked poses. …”
Section: Resultsmentioning
confidence: 99%
“… a “_Re” represents that the model is trained based on redocked poses in PDBbind-CrossDocked-Refined while “_Cross” represents that the model is trained based on cross-docked poses. Except for DeepDockM and RTMScore, the data of methods is copied from ref . b The top20 success rate of each docking method for a certain data set, and it represents the upper limit of the value in each column. c “re” represents the performance on redocked poses, while “cross” denotes that on cross-docked poses. …”
Section: Resultsmentioning
confidence: 99%
“…Additionally, artificially redocked poses also help models minimize the importance of ligand size. Many attempts have been made to distinguish between crystal and decoy poses, as well as to predict binding affinity with redocked poses 24,[83][84][85][86] . However, considerable care should be taken in loss function and model design because of data imbalance.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…Additionally, artificially redocked poses also help models minimize the dependency on ligand size. Many attempts have been made to distinguish between crystal and decoy poses, and to predict binding affinity with redocked poses 24,[84][85][86][87] . However, caution should be taken in loss function and model design because of data imbalance.…”
Section: Conclusion and Discussionmentioning
confidence: 99%