We present Fleksy, a new approach to consider both ligand and receptor flexibility in small molecule docking. Pivotal to our method is the use of a receptor ensemble to describe protein flexibility. To construct these ensembles, we use a backbone-dependent rotamer library and implement the concept of interaction sampling. The latter allows the evaluation of different orientations of ambivalent interaction partners. The docking stage consists of an ensemble-based soft-docking experiment using FlexX-Ensemble, followed by an effective flexible receptor-ligand complex optimization using Yasara. Fleksy produces a set of receptor-ligand complexes ranked using a consensus scoring function combining docking scores and force field energies. Averaged over three cross-docking datasets, containing 35 different receptor-ligand complexes in total, Fleksy reproduces the observed binding mode within 2.0 Å for 78% of the complexes. This compares favorably to the rigid receptor FlexX program, which on average reaches a success rate of 44% for these datasets.
Bioisosteres have been defined as structurally different molecules or substructures that can form comparable intermolecular interactions, and therefore, fragments that bind to similar protein structures exhibit a degree of bioisosterism. We present KRIPO (Key Representation of Interaction in POckets): a new method for quantifying the similarities of binding site subpockets based on pharmacophore fingerprints. The binding site fingerprints have been optimized to improve their performance for both intra- and interprotein family comparisons. A range of attributes of the fingerprints was considered in the optimization, including the placement of pharmacophore features, whether or not the fingerprints are fuzzified, and the resolution and complexity of the pharmacophore fingerprints (2-, 3-, and 4-point fingerprints). Fuzzy 3-point pharmacophore fingerprints were found to represent the optimal balance between computational resource requirements and the identification of potential replacements. The complete PDB was converted into a database comprising almost 300,000 optimized fingerprints of local binding sites together with their associated ligand fragments. The value of the approach is demonstrated by application to two crystal structures from the Protein Data Bank: (1) a MAP kinase P38 structure in complex with a pyridinylimidazole inhibitor (1A9U) and (2) a complex of thrombin with melagatran (1K22). Potentially valuable bioisosteric replacements for all subpockets of the two studied protein are identified.
Electronic properties located on the atoms of a molecule such as partial atomic charges as well as electronegativity and polarizability values are encoded by an autocorrelation vector accounting for the constitution of a molecule. This encoding procedure is able to distinguish between compounds being dopamine agonists and those being benzodiazepine receptor agonists even after projection into a two-dimensional self-organizing network. The two types of compounds can still be distinguished if they are buried in a dataset of 8323 compounds of a chemical supplier catalog comprising a wide structural variety. The maps obtained by this sequence of events, calculation of empirical physicochemical effects, encoding in a topological autocorrelation vector, and projection by a self-organizing neural network, can thus be used for searching for structural similarity, and, in particular, for finding new lead structures with biological activity.
Predictions of potential metabolites based on chemical structure are becoming increasingly important in drug discovery to guide medicinal chemistry efforts that address metabolic issues and to support experimental metabolite screening and identification. Herein we present a novel rule-based method, SyGMa (Systematic Generation of potential Metabolites), to predict the potential metabolites of a given parent structure. A set of reaction rules covering a broad range of phase 1 and phase 2 metabolism has been derived from metabolic reactions reported in the Metabolite Database to occur in humans. An empirical probability score is assigned to each rule representing the fraction of correctly predicted metabolites in the training database. This score is used to refine the rules and to rank predicted metabolites. The current rule set of SyGMa covers approximately 70 % of biotransformation reactions observed in humans. Evaluation of the rule-based predictions demonstrated a significant enrichment of true metabolites in the top of the ranking list: while in total, 68 % of all observed metabolites in an independent test set were reproduced by SyGMa, a large part, 30 % of the observed metabolites, were identified among the top three predictions. From a subset of cytochrome P450 specific metabolites, 84 % were reproduced overall, with 66 % in the top three predicted phase 1 metabolites. A similarity analysis of the reactions present in the database was performed to obtain an overview of the metabolic reactions predicted by SyGMa and to support ongoing efforts to extend the rules. Specific examples demonstrate the use of SyGMa in experimental metabolite identification and the application of SyGMa to suggest chemical modifications that improve the metabolic stability of compounds.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.