When searching for new leads, testing molecules that are too "similar" is wasteful, but when investigating a lead, testing molecules that are "similar" to the lead is efficient. Two questions then arise. Which are the molecular descriptors that should be "similar"? How much "similarity" is enough? These questions are answered by demonstrating that, if a molecular descriptor is to be a valid and useful measure of "similarity" in drug discovery, a plot of differences in its values vs differences in biological activities for a set of related molecules will exhibit a characteristic trapezoidal distribution enhancement, revealing a "neighborhood behavior" for the descriptor. Applying this finding to 20 datasets allows 11 molecular diversity descriptors to be ranked by their validity for compound library design. In order of increasing frequency of usefulness, these are random numbers = log P = MR = strain energy < connectivity indices < 2D fingerprints (whole molecule) = atom pairs = autocorrelation indices < steric CoMFA fields = 2D fingerprints (side chain only) = H-bonding CoMFA fields.
A novel molecular descriptor (EVA) based upon calculated infrared range vibrational frequencies is evaluated for use in QSAR studies. The descriptor is invariant to both translation and rotation of the structures concerned. The method was applied to 11 QSAR datasets exhibiting both a range of biological endpoints and various degrees of structural diversity. This study demonstrates that robust QSAR models can be obtained using the EVA descriptor and examines the effect of EVA parameter changes on these models; recommendations are made as to the appropriate choice of parameters. The performance of EVA was found to be comparable in statistical terms to that of CoMFA, despite the fact that EVA does not require the generation of a structural alignment. Models derived using semiempirical (MOPAC AM1 and PM3) and AMBER mechanics calculated normal mode frequencies are compared, with the overall conclusion that the semiempirical methods perform equally well and both outperform the AMBER-based models.
The comparative molecular field analysis steric field of a single “topomeric” conformer is introduced as a molecular diversity descriptor particularly useful for combinatorial chemistry involving variations around a fixed “core”. Using this new descriptor, 736 commercially available thiols are divided into 231 bioisosteric clusters, whose compositions agree at least as well with medicinal chemical experience and intuition as do clusters derived from Tanimoto differences between 2D fragment occurrences. However, in practice topomeric steric fields complement 2D fingerprints, being the two most frequently useful descriptors yet found for neighborhood-based design of combinatorial libraries.
A new descriptor of molecular structure, EVA, for use in the derivation of robustly predictive QSAR relationships is described. It is based on theoretically derived normal coordinate frequencies, and has been used extensively and successfully in proprietary chemical discovery programmes within Shell Research. As a result of informal dissemination of the methodology, it is now being used successfully in related areas such as pharmaceutical drug discovery. Much of the experimental data used in development remain proprietary, and are not available for publication. This paper describes the method and illustrates its application to the calculation of nonproprietary data, log P(ow), in both explanatory and predictive modes. It will be followed by other publications illustrating its application to a range of data derived from biological systems.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.