Applying DEKOIS 2.0 in structure-based virtual screening to probe the impact of preparation procedures and score normalization

Ibrahim, Tamer M.; Bauer, Matthias R.; Boeckler, Frank M.

doi:10.1186/s13321-015-0074-6

Cited by 25 publications

(34 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although we do not claim that our collection is comprehensive, there are no commonly used, state-of-the-art evaluation data sets available. Standardized benchmark sets are accessible for broadly applied modeling approaches such as pharmacophore searches [61] and molecular docking [16,62]. In contrast, the high diversity of the applied benchmark sets for binding site comparison makes it difficult to draw definitive conclusions in comparing the different tools.…”

Section: Benchmark Data Setsmentioning

confidence: 99%

“…Usually, published binding site comparison algorithms have been benchmarked using specific data sets, which are highly correlated with distinct application domains. However, standardized benchmark data sets, as known for other in silico methodologies [14][15][16], have never been developed for cavity comparison tools. This often precludes the selection of a suitable tool.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A benchmark driven guide to binding site comparison: An exhaustive evaluation using tailor-made data sets (ProSPECCTs)

2018

View full text Add to dashboard Cite

The automated comparison of protein-ligand binding sites provides useful insights into yet unexplored site similarities. Various stages of computational and chemical biology research can benefit from this knowledge. The search for putative off-targets and the establishment of polypharmacological effects by comparing binding sites led to promising results for numerous projects. Although many cavity comparison methods are available, a comprehensive analysis to guide the choice of a tool for a specific application is wanting. Moreover, the broad variety of binding site modeling approaches, comparison algorithms, and scoring metrics impedes this choice. Herein, we aim to elucidate strengths and weaknesses of binding site comparison methodologies. A detailed benchmark study is the only possibility to rationalize the selection of appropriate tools for different scenarios. Specific evaluation data sets were developed to shed light on multiple aspects of binding site comparison. An assembly of all applied benchmark sets (ProSPECCTs–Protein Site Pairs for the Evaluation of Cavity Comparison Tools) is made available for the evaluation and optimization of further and still emerging methods. The results indicate the importance of such analyses to facilitate the choice of a methodology that complies with the requirements of a specific scientific challenge.

show abstract

Section: Benchmark Data Setsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A benchmark driven guide to binding site comparison: An exhaustive evaluation using tailor-made data sets (ProSPECCTs)

2018

View full text Add to dashboard Cite

show abstract

“…New databases were designed with an increasing complexity in the decoys selection methodologies (see section Benchmarking Databases). Nowadays, benchmarking databases are widely used to evaluate various VS tools (Kellenberger et al, 2004 ; Warren et al, 2006 ; McGaughey et al, 2007 ; von Korff et al, 2009 ; Braga and Andrade, 2013 ; Ibrahim et al, 2015a ; Pei et al, 2015 ) and to support the identification of hit/lead compounds using LBVS and SBVS (Allen et al, 2015 ; Ruggeri et al, 2015 ).…”

Section: The History Of Decoys Selectionmentioning

confidence: 99%

“…On the contrary, the possible presence of active compounds in the decoy compounds set may introduce an artificial underestimation of the enrichment (Verdonk et al, 2004 ; Good and Oprea, 2008 ) since decoys are usually assumed to be inactive rather than proved to be true inactive compounds (i.e., confirmed inactive through experimental bioassays). New databases were designed to minimize those biases (Rohrer and Baumann, 2009 ; Vogel et al, 2011 ; Mysinger et al, 2012 ; Ibrahim et al, 2015a ). Finally, many studies pointed out that the VS performance depends on the target and its structural properties (structural flexibility, binding site physicochemical properties, etc.…”

Section: Introductionmentioning

confidence: 99%

“…; Cummings et al, 2005 ). Taking this into consideration, and despite the growing number of protein families represented in databases, decoy datasets generation tools were made publicly available in order to allow any scientist to fine-tune target-dependant and reliable benchmarking datasets (Mysinger et al, 2012 ; Ibrahim et al, 2015a ).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Decoys Selection in Benchmarking Datasets: Overview and Perspectives

et al. 2018

View full text Add to dashboard Cite

Virtual Screening (VS) is designed to prospectively help identifying potential hits, i.e., compounds capable of interacting with a given target and potentially modulate its activity, out of large compound collections. Among the variety of methodologies, it is crucial to select the protocol that is the most adapted to the query/target system under study and that yields the most reliable output. To this aim, the performance of VS methods is commonly evaluated and compared by computing their ability to retrieve active compounds in benchmarking datasets. The benchmarking datasets contain a subset of known active compounds together with a subset of decoys, i.e., assumed non-active molecules. The composition of both the active and the decoy compounds subsets is critical to limit the biases in the evaluation of the VS methods. In this review, we focus on the selection of decoy compounds that has considerably changed over the years, from randomly selected compounds to highly customized or experimentally validated negative compounds. We first outline the evolution of decoys selection in benchmarking databases as well as current benchmarking databases that tend to minimize the introduction of biases, and secondly, we propose recommendations for the selection and the design of benchmarking datasets.

show abstract