2013
DOI: 10.1021/ci300463g
|View full text |Cite
|
Sign up to set email alerts
|

Boosting Virtual Screening Enrichments with Data Fusion: Coalescing Hits from Two-Dimensional Fingerprints, Shape, and Docking

Abstract: Virtual screening is an effective way to find hits in drug discovery, with approaches ranging from fast information-based similarity methods to more computationally intensive physics-based docking methods. However, the best approach to use for a given project is not clear in advance of the screen. In this work, we show that combining results from multiple methods using a standard score (Z-score) can significantly improve virtual screening enrichments over any of the single screening methods. We show that an au… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
91
0
1

Year Published

2014
2014
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 77 publications
(94 citation statements)
references
References 75 publications
2
91
0
1
Order By: Relevance
“…Since the rank alone lacks information about the data spread, we instead used Z scores for the normalization of raw scores. 52 In addition, our Z scores are based on the median value instead of the average value for each scoring function, which reduces the sensitivity of Z scores to outliers.…”
Section: ■ Results and Discussionmentioning
confidence: 99%
“…Since the rank alone lacks information about the data spread, we instead used Z scores for the normalization of raw scores. 52 In addition, our Z scores are based on the median value instead of the average value for each scoring function, which reduces the sensitivity of Z scores to outliers.…”
Section: ■ Results and Discussionmentioning
confidence: 99%
“…Furthermore, our results were compared with recent similar studies such as rank- based group fusion by Chen et al [42] and standard score (Z-score) by Sastry et al [39]. In Chen et al study, the mean recall of their RKP method for MDDR1 data set range from 94.20 to 94.30, while in our method the minimum value of the upper band is 95.27 for Top10 and the maximum value is 99.95 for the Top100 method.…”
Section: Resultsmentioning
confidence: 78%
“…• Once all predictors have calculated their predictions, the predictions are combined ("fused", as in [7]) into one matrix.…”
Section: The Chemogenomics Pipelinementioning
confidence: 99%
“…Inside Janssen Pharmaceutica, a method is used which takes advantage of the fact that compounds with a similar structure often interact similarly with the same proteins. The Chemogenomics project inside Janssen Pharmaceutica attempts to identify candidate compounds by deriving information from existing compound-protein databases by means of machine learning methods [3,6,7]. To this end, a number of "predictor" programs have been developed.…”
Section: Introductionmentioning
confidence: 99%