The medicinal chemistry community has become increasingly aware of the value of tracking calculated physical properties such as molecular weight, topological polar surface area, rotatable bonds, and hydrogen bond donors and acceptors. We hypothesized that the shift to high-throughput synthetic practices over the past decade may be another factor that may predispose molecules to fail by steering discovery efforts toward achiral, aromatic compounds. We have proposed two simple and interpretable measures of the complexity of molecules prepared as potential drug candidates. The first is carbon bond saturation as defined by fraction sp(3) (Fsp(3)) where Fsp(3) = (number of sp(3) hybridized carbons/total carbon count). The second is simply whether a chiral carbon exists in the molecule. We demonstrate that both complexity (as measured by Fsp(3)) and the presence of chiral centers correlate with success as compounds transition from discovery, through clinical testing, to drugs. In an attempt to explain these observations, we further demonstrate that saturation correlates with solubility, an experimental physical property important to success in the drug discovery setting.
Molecular docking programs are widely used modeling tools for predicting ligand binding modes and structure based virtual screening. In this study, six molecular docking programs (DOCK, FlexX, GLIDE, ICM, PhDOCK, and Surflex) were evaluated using metrics intended to assess docking pose and virtual screening accuracy. Cognate ligand docking to 68 diverse, high-resolution X-ray complexes revealed that ICM, GLIDE, and Surflex generated ligand poses close to the X-ray conformation more often than the other docking programs. GLIDE and Surflex also outperformed the other docking programs when used for virtual screening, based on mean ROC AUC and ROC enrichment values obtained for the 40 protein targets in the Directory of Useful Decoys (DUD). Further analysis uncovered general trends in accuracy that are specific for particular protein families. Modifying basic parameters in the software was shown to have a significant effect on docking and virtual screening results, suggesting that expert knowledge is critical for optimizing the accuracy of these methods.
Summary
We describe the proceedings and conclusions from a “Workshop on Applications of Protein Models in Biomedical Research” that was held at University of California at San Francisco on 11 and 12 July, 2008. At the workshop, international scientists involved with structure modeling explored (i) how models are currently used in biomedical research, (ii) what the requirements and challenges for different applications are, and (iii) how the interaction between the computational and experimental research communities could be strengthened to advance the field.
Structural comparisons of SCD with representative members of the metalloproteinase superfamily clearly highlight the conservation of key secondary structural elements, in spite of major variations in the sequences including insertions and deletions of functional domains. However, the three-dimensional structure of SCD, which is generally closely related to the collagenases, shows significant differences not only in the peripheral regions but also in the specificity pockets; these latter differences should facilitate the rational design of specific inhibitors.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.