Andrey V. Skorenko scite author profile

Solubility of organic compounds in DMSO is an important issue for commercial and academic organizations handling large compound collections or performing biological screening. In particular, solubility data are critical for the optimization of storage conditions and for the selection of compounds for bioscreening compatible with the assay protocol. Solubility is largely determined by the solvation energy and the crystal disruption energy, and these molecular phenomena should be assessed in structure-solubility correlation studies. The authors summarize our long-term experimental observations and theoretical studies of physicochemical determinants of DMSO solubility of organic substances. They compiled a comprehensive reference database of proprietary data on compound solubility (55,277 compounds with good DMSO solubility and 10,223 compounds with poor DMSO solubility), calculated specific molecular descriptors (topological, electromagnetic, charge, and lipophilicity parameters), and applied an advanced machine-learning approach for training neural networks to address the solubility. Both supervised (feed-forward, back-propagated neural networks) and unsupervised (Kohonen neural networks) learning methods were used. The resulting neural network models were validated by successfully predicting DMSO solubility of compounds in independent test selections. (Journal of Biomolecular Screening 2004:22-31)

show abstract

Advanced Exact Structure Searching in Large Databases of Chemical Compounds

Trepalin¹,

Skorenko²,

Balakin³

et al. 2003

J. Chem. Inf. Comput. Sci.

View full text Add to dashboard Cite

Efficient recognition of tautomeric compound forms in large corporate or commercially available compound databases is a difficult and labor intensive task. Our data indicate that up to 0.5% of commercially available compound collections for bioscreening contain tautomers. Though in the large registry databases, such as Beilstein and CAS, the tautomers are found in an automated fashion using high-performance computational technologies, their real-time recognition in the nonregistry corporate databases, as a rule, remains problematic. We have developed an effective algorithm for tautomer searching based on the proprietary chemoinformatics platform. This algorithm reduces the compound to a canonical structure. This feature enables rapid, automated computer searching of most of the known tautomeric transformations that occur in databases of organic compounds. Another useful extension of this methodology is related to the ability to effectively search for different forms of compounds that contain ionic and semipolar bonds. The computations are performed in the Windows environment on a standard personal computer, a very useful feature. The practical application of the proposed methodology is illustrated by several examples of successful recovery of tautomers and different forms of ionic compounds from real commercially available nonregistry databases.

show abstract

Kohonen Maps for Prediction of Binding to Human Cytochrome P450 3a4

Balakin¹,

Ekins²,

Bugrim³

et al. 2004

Drug Metab Dispos

View full text Add to dashboard Cite

The drug development process utilizes the parallel assessment of activity at a therapeutic target as well as absorption, distribution, metabolism, excretion, and toxicity properties of molecules. The development of novel, reliable, and inexpensive computational methods for the early assessment of metabolism and toxicity is becoming increasingly an important part of this process. We have used a computational approach for the assessment of drugs and drug-like compounds which bind to the cytochromes P450 (P450s) with experimentally determined Km values. The physicochemical properties of these compounds were calculated using molecular descriptor software and then analyzed using Kohonen self-organizing maps. This approach was applied to generate a P450-specific classification of nearly 500 drug compounds. We observed statistically significant differences in the molecular properties of low Km molecules for various P450s and suggest a relationship between 33 of these compounds and their CYP3A4-inhibitory activity. A test set of additional CYP3A4 inhibitors was used, and 13 of 15 of these molecules were colocated in the regions of low Km values. This computational approach represents a novel method for use in the generation of metabolism models, enabling the scoring of libraries of compounds for their Km values to numerous P450s.

show abstract

Structure-Based versus Property-Based Approaches in the Design of G-Protein-Coupled Receptor-Targeted Libraries

Balakin¹,

Lang²,

Skorenko³

et al. 2003

J. Chem. Inf. Comput. Sci.

View full text Add to dashboard Cite

In this work, two alternative approaches to the design of small-molecule libraries targeted for several G-protein-coupled receptor (GPCR) classes were explored. The first approach relies on the selection of structural analogues of known active compounds using a substructural similarity method. The second approach, based on an artificial neural network classification procedure, searches for compounds that possess physicochemical properties typical of the GPCR-specific agents. As a reference base, 3365 GPCR-active agents belonging to nine different GPCR classes were used. General rules were developed which enabled us to assess possible areas where both approaches would be useful. The predictability of the neural network algorithm based on 14 physicochemical descriptors was found to exceed the predictability of the similarity-based approach. The structural diversity of high-scored subsets obtained with the neural network-based method exceeded the diversity obtained with the similarity-based approach. In addition, the descriptor distributions of the compounds selected by the neural network algorithm more closely approximate the corresponding distributions of the real, active compounds than did those selected using the alternative method.

show abstract

Synthesis, Structure and Properties of Some Thiophene‐Containing Sulfamido Acids.

et al. 2005

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.