This work is aimed at describing the workflow for a methodology that combines chemoinformatics and pharmacoepidemiology methods and at reporting the first predictive model developed with this methodology. The new model is able to predict complex networks of AIDS prevalence in the US counties, taking into consideration the social determinants and activity/structure of anti-HIV drugs in preclinical assays. We trained different Artificial Neural Networks (ANNs) using as input information indices of social networks and molecular graphs. We used a Shannon information index based on the Gini coefficient to quantify the effect of income inequality in the social network. We obtained the data on AIDS prevalence and the Gini coefficient from the AIDSVu database of Emory University. We also used the Balaban information indices to quantify changes in the chemical structure of anti-HIV drugs. We obtained the data on anti-HIV drug activity and structure (SMILE codes) from the ChEMBL database. Last, we used Box-Jenkins moving average operators to quantify information about the deviations of drugs with respect to data subsets of reference (targets, organisms, experimental parameters, protocols). The best model found was a Linear Neural Network (LNN) with values of Accuracy, Specificity, and Sensitivity above 0.76 and AUROC > 0.80 in training and external validation series. This model generates a complex network of AIDS prevalence in the US at county level with respect to the preclinical activity of anti-HIV drugs in preclinical assays. To train/validate the model and predict the complex network we needed to analyze 43,249 data points including values of AIDS prevalence in 2,310 counties in the US vs ChEMBL results for 21,582 unique drugs, 9 viral or human protein targets, 4,856 protocols, and 10 possible experimental measures.
In this communication we carry out an in-depth review of a very versatile QSPR-like method. The method name is MARCH-INSIDE (MARkov CHains Ivariants for Network Selection and DEsign) and is a simple but efficient computational approach to the study of QSPR-like problems in biomedical sciences. The method uses the theory of Markov Chains to generate parameters that numerically describe the structure of a system. This approach generates two principal types of parameters Stochastic Topological Indices (sto-TIs). The use of these parameters allows the rapid collection, annotation, retrieval, comparison and mining structures of molecular, macromolecular, supramolecular, and non-molecular systems within large databases. Here, we review and comment by the first time on the several applications of MARCH-INSIDE to predict drugs ADMET, Activity, Metabolizing Enzymes, and Toxico-Proteomics biomarkers discovery. The MARCH-INSIDE models reviewed are: a) drug-tissue distribution profiles, b) assembling drug-tissue complex networks, c) multi-target models for anti-parasite/anti-microbial activity, c) assembling drug-target networks, d) drug toxicity and side effects, e) web-server for drug metabolizing enzymes, f) models in drugs toxico-proteomics. We close the review with some legal remarks related to the use of this class of QSPR-like models.
Quantitative Structure-Activity Relationship (QSAR) models have been used in Pharmaceutical design and Medicinal Chemistry for the discovery of anti-parasite drugs. QSAR models predict biological activity using as input different types of structural parameters of molecules. Topological Indices (TIs) are a very interesting class of these parameters. We can derive TIs from graph representations based on only nodes (atoms) and edges (chemical bonds). TIs are not time-consuming in terms of computational resources because they depend only on atom-atom connectivity information. This information expressed in the molecular graphs can be tabulated in the form of adjacency matrices easy to manipulate with computers. Consequently, TIs allow the rapid collection, annotation, retrieval, comparison and mining of molecular structures within large databases. The interest in TIs has exploded because we can use them to describe also macromolecular and macroscopic systems represented by complex networks of interactions (links) between the different parts of a system (nodes) such as: drug-target, protein-protein, metabolic, host-parasite, brain cortex, parasite disease spreading, Internet, or social networks. In this work, we review and comment on the following topics related to the use of TIs in anti-parasite drugs and target discovery. The first topic reviewed was: Topological Indices and QSAR for antiparasitic drugs. This topic included: Theoretical Background, QSAR for anti-malaria drugs, QSAR for anti-Toxoplasma drugs. The second topic was: TOMO-COMD approach to QSAR of antiparasitic drugs. We included in this topic: TOMO-COMD theoretical background and TOMO-COMD models for antihelmintic activity, Trichomonas, anti-malarials, anti-trypanosome compounds. The third section was inserted to discuss Topological Indices in the context of Complex Networks. The last section is devoted to the MARCH-INSIDE approach to QSAR of antiparasitic drugs and targets. This begins with a theoretical background for drugs and parameters for proteins. Next, we reviewed MARCH-INSIDE models for Pharmaceutical Design of antiparasitic drugs including: flukicidal drugs and anti-coccidial drugs. We close MARCH-NSIDE topic with a review of multi-target QSAR of antiparasitic drugs, MARCH-INSIDE assembly of complex networks of antiparasitic drugs. We closed the MARCH-INSIDE section discussing the prediction of proteins in parasites and MARCH-INSIDE web-servers for Protein-Protein interactions in parasites: Plasmod-PPI and Trypano-PPI web-servers. We closed this revision with an important section devoted to review some legal issues related to QSAR models.
Quantitative Structure-Activity/Property Relationships (QSAR/QSPR) models have been largely used for different kind of problems in Medicinal Chemistry and other Biosciences as well. Nevertheless, the applications of QSAR models have been restricted to the study of small molecules in the past. In this context, many authors use molecular graphs, atoms (nodes) connected by chemical bonds (links) to represent and numerically characterize the molecular structure. On the other hand, Complex Networks are useful in solving problems in drug research and industry, developing mathematical representations of different systems. These systems move in a wide range from relatively simple graph representations of drug molecular structures (molecular graphs used in classic QSAR) to large systems. We can cite for instance, drug-target interaction networks, protein structure networks, protein interaction networks (PINs), or drug treatment in large geographical disease spreading networks. In any case, all complex networks have essentially the same components: nodes (atoms, drugs, proteins, microorganisms and/or parasites, geographical areas, drug policy legislations, etc.) and links (chemical bonds, drug-target interactions, drug-parasite treatment, drug use, etc.). Consequently, we can use the same type of numeric parameters called Topological Indices (TIs) to describe the connectivity patterns in all these kinds of Complex Networks irrespective the nature of the object they represent and use these TIs to develop QSAR/QSPR models beyond the classic frontiers of drugs small-sized molecules. The goal of this work, in first instance, is to offer a common background to all the manuscripts presented in this special issue. In so doing, we make a review of the most used software and databases, common types of QSAR/QSPR models, and complex networks involving drugs or their targets. In addition, we review both classic TIs that have been used to describe the molecular structure of drugs and/or larger complex networks. In second instance, we use for the first time a Markov chain model to generalize Spectral moments to higher order analogues coined here as the Stochastic Spectral Moments TIs of order k (πk). Lastly, we report for the first time different QSAR/QSPR models for different classes of networks found in drug research, nature, technology, and social-legal sciences using πk values. This work updates our previous reviews Gonzalez-Diaz et al. Curr Top Med Chem. 2007; 7(10): 1015-29 and Gonzalez-Diaz et al. Curr Top Med Chem. 2008; 8(18):1676-90. It has been prepared in response to the kind invitation of the editor Prof. AB Reitz in commemoration of the 10th anniversary of this journal in 2010.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.