2020
DOI: 10.1039/c9ra09211k
|View full text |Cite
|
Sign up to set email alerts
|

Improved method of structure-based virtual screening based on ensemble learning

Abstract: Virtual screening has become a successful alternative and complementary technique to experimental high-throughput screening technologies for drug design. This paper proposed a target-specific virtual screening method based on ensemble learning named ENS-VS.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 46 publications
0
11
0
Order By: Relevance
“…In addition to evaluating this default ensemble, we also show results for the General ensemble, which combines the simplest model, Default2018, with the smallest training set, redocked poses from the 2016 PDBbind General set, and the Dense ensemble, which combines the largest model with the largest training set, CrossDocked2020 [ 45 ]. The variations in architecture and training data allow us to compare the effects of these aspects of the CNN scoring functions on virtual screening performance, while the ensembles themselves are expected to improve average predictive accuracy by reducing the effects of bias from individual learners [ 47 ] and in theory allow us to approximate the uncertainty in our predictions [ 48 , 49 ].…”
Section: Methodsmentioning
confidence: 99%
“…In addition to evaluating this default ensemble, we also show results for the General ensemble, which combines the simplest model, Default2018, with the smallest training set, redocked poses from the 2016 PDBbind General set, and the Dense ensemble, which combines the largest model with the largest training set, CrossDocked2020 [ 45 ]. The variations in architecture and training data allow us to compare the effects of these aspects of the CNN scoring functions on virtual screening performance, while the ensembles themselves are expected to improve average predictive accuracy by reducing the effects of bias from individual learners [ 47 ] and in theory allow us to approximate the uncertainty in our predictions [ 48 , 49 ].…”
Section: Methodsmentioning
confidence: 99%
“…In addition to evaluating this default ensemble, we also show results for the General ensemble, which combines the simplest model, Default2018, with the smallest training set, redocked poses from the 2016 PDBbind General set, and the Dense ensemble, which combines the largest model with the largest training set, CrossDocked2020 [44]. The variations in architecture and training data allow us to compare the effects of these aspects of the CNN scoring functions on virtual screening performance, while the ensembles themselves are expected to improve average predictive accuracy by reducing the effects of bias from individual learners [47] and in theory allow us to approximate the uncertainty in our predictions [48,49].…”
Section: Modelsmentioning
confidence: 99%
“…However, cross-docking comparative studies, more indicative of a real prospective study than self-docking, are much less common and indicate significantly lower pose prediction accuracy than self-docking. In addition, retrospective virtual screening studies rarely demonstrate area under the receiver operating curve (AUROC) values exceeding 0.8, although machine learning techniques have been proposed to improve this accuracy. , Among the challenges noted are the large chemical diversity in ligands, numerous classes of proteins with varying structural features, as well as biases in available testing sets. , …”
Section: Introductionmentioning
confidence: 99%