A procedure for analyzing and classifying publicly available crystal structures has been developed. It has been used to identify high-resolution protein-ligand complexes that can be assessed by reconstructing the electron density for the ligand using the deposited structure factors. The complexes have been clustered according to the protein sequences, and clusters have been discarded if they do not represent proteins thought to be of direct interest to the pharmaceutical or agrochemical industry. Rules have been used to exclude complexes containing non-drug-like ligands. One complex from each cluster has been selected where a structure of sufficient quality was available. The final Astex diverse set contains 85 diverse, relevant protein-ligand complexes, which have been prepared in a format suitable for docking and are to be made freely available to the entire research community (http://www.ccdc.cam.ac.uk). The performance of the docking program GOLD against the new set is assessed using a variety of protocols. Relatively unbiased protocols give success rates of approximately 80% for redocking into native structures, but it is possible to get success rates of over 90% with some protocols.
We implemented a novel approach to score water mediation and displacement in the protein-ligand docking program GOLD. The method allows water molecules to switch on and off and to rotate around their three principal axes. A constant penalty, sigma(p), representing the loss of rigid-body entropy, is added for water molecules that are switched on, hence rewarding water displacement. We tested the methodology in an extensive validation study. First, sigma(p) is optimized against a training set of 58 protein-ligand complexes. For this training set, our algorithm correctly predicts water mediation/displacement in approximately 92% of the cases. We observed small improvements in the quality of the predicted binding modes for water-mediated complexes. In the second part of this work, an entirely independent set of 225 complexes is used. For this test set, our algorithm correctly predicts water mediation/displacement in approximately 93% of the cases. Improvements in binding mode quality were observed for individual water-mediated complexes.
Inhibitors of the molecular chaperone heat shock protein 90 (Hsp90) are currently generating significant interest in clinical development as potential treatments for cancer. In a preceding publication (DOI: 10.1021/jm100059d ) we describe Astex's approach to screening fragments against Hsp90 and the subsequent optimization of two hits into leads with inhibitory activities in the low nanomolar range. This paper describes the structure guided optimization of the 2,4-dihydroxybenzamide lead molecule 1 and details some of the drug discovery strategies employed in the identification of AT13387 (35), which has progressed through preclinical development and is currently being tested in man.
Inhibitors of the chaperone Hsp90 are potentially useful as chemotherapeutic agents in cancer. This paper describes an application of fragment screening to Hsp90 using a combination of NMR and high throughput X-ray crystallography. The screening identified an aminopyrimidine with affinity in the high micromolar range and subsequent structure-based design allowed its optimization into a low nanomolar series with good ligand efficiency. A phenolic chemotype was also identified in fragment screening and was found to bind with affinity close to 1 mM. This fragment was optimized using structure based design into a resorcinol lead which has subnanomolar affinity for Hsp90, excellent cell potency, and good ligand efficiency. This fragment to lead campaign improved affinity for Hsp90 by over 1,000,000-fold with the addition of only six heavy atoms. The companion paper (DOI: 10.1021/jm100060b) describes how the resorcinol lead was optimized into a compound that is now in clinical trials for the treatment of cancer.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.