Two extensive studies quantifying the ability of topomer shape similarity to forecast a variety of biological similarities are described. In a prospective trial of "lead hopping", using topomer similarity for virtual screening and queries from the patent literature, biological assays of 308 selected compounds (representing 0.03% of those available, per assay type) yielded 11 successful "lead hops" in the 13 assays attempted. The hit rate averaged over all assays was 39% ("activity"defined as inhibition > or =20% at 10 microM), significantly greater than an unexpectedly high negative control hit rate of 15%. The average "Tanimoto 2D fingerprint similarity" between query and "lead hop" structures (0.36) was little more than the Tanimoto similarity between random drug-like structures. Topomer shape and Tanimoto 2D fingerprint similarities were also compared retrospectively, in their tendencies to concentrate together potential and actual drugs reported to belong to the same "activity class", for twenty classes. Among the most similar 3% of structures (corresponding to "> or =0.85 Tanimoto" for these structures), an average of 62% of the topomer similar selection possessed a near neighbor belonging to the same activity class, roughly a one-third superiority over the "Tanimoto > or = 0.85" selection containing 48% actives in avoiding false positives. Conversely, the least similar 75% of structures contained 0.3% actives for topomer similarity vs 1.0% actives for Tanimoto 2D fingerprint similarity, a 3-fold superiority for topomers in avoiding false negatives.
SYBYL line notation (SLN) is a powerful way to represent molecular structures, reactions, libraries of structures, molecular fragments, formulations, molecular queries, and reaction queries. Nearly any chemical structure imaginable, including macromolecules, pharmaceuticals, catalysts, and even combinatorial libraries can be represented as an SLN string. The language provides a rich syntax for database queries comparable to SMARTS. It provides full Markush, R-Group, reaction, and macro atom capabilities in a single unified notation. It includes the ability to specify 3D conformations and 2D depictions. All the information necessary to recreate the structure in a modeling or drawing package is present in a single, concise string of ASCII characters. This makes SLN ideal for structure communication over global computer networks between applications sitting at remote sites. Unlike SMILES and its derivatives, SLN accomplishes this within a single unified syntax. Structures, queries, compounds, reactions, and virtual libraries can all be represented in a single notation.
AllChem is a system that is intended to make practical the generation and searching of an unprecedentedly vast number ( approximately 10(20)) of synthetically accessible and medicinally relevant structures. Also, by providing possible synthetic routes to a structure along with its design rationale, AllChem encourages simultaneous consideration of both costs and benefits during each lead discovery and optimization decision, thereby promising to be effective with synthetic chemists among its primary users. AllChem is still under intensive development so the following initial description necessarily has more the character of an interim progress report than of a finished research publication.
The hypothesis underlying topomer development is that describing molecular structures consistently may be at least as productive as describing them more realistically but incompletely. A general protocol is detailed for deterministically generating self-consistent shapes of molecular fragments from their topologies. These and other extensions to the topomer methodology are validated by repetition of earlier benchmark studies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.