Motivation Identification of blood-brain barrier (BBB) permeability of a compound is a major challenge in neurotherapeutic drug discovery. Conventional approaches for BBB permeability measurement are expensive, time-consuming, and labor-intensive. BBB permeability is associated with diverse chemical properties of compounds. However, BBB permeability prediction models have been developed using small datasets and limited features, which are usually not practical due to their low coverage of chemical diversity of compounds. Aim of this study is to develop a BBB permeability prediction model using a large dataset for practical applications. This model can be used for facilitated compound screening in the early stage of brain drug discovery. Results A dataset of 7162 compounds with BBB permeability (5453 BBB+ and 1709 BBB-) was compiled from the literature, where BBB+ and BBB- denote BBB-permeable and non-permeable compounds, respectively. We trained a machine learning model based on Light Gradient Boosting Machine (LightGBM) algorithm and achieved an overall accuracy of 89%, an area under the curve (AUC) of 0.93, specificity of 0.77, and sensitivity of 0.93, when ten-fold cross-validation was performed. The model was further evaluated using 74 central nerve system (CNS) compounds (39 BBB+ and 35 BBB-) obtained from the literature and showed an accuracy of 90%, sensitivity of 0.85, and specificity of 0.94. Our model outperforms over existing BBB permeability prediction models. Availability The prediction server is available at http://ssbio.cau.ac.kr/software/bbb.
Motivation Blockade of the human ether-à-go-go-related gene (hERG) channel by small compounds causes a prolonged QT interval that can lead to severe cardiotoxicity and is a major cause of the many failures in drug development. Thus, evaluating the hERG-blocking activity of small compounds is important for successful drug development. To this end, various computational prediction tools have been developed, but their prediction performances in terms of sensitivity and negative predictive value (NPV) need to be improved to reduce false negative predictions. Results We propose a computational framework, DeepHIT, which predicts hERG blockers and non-blockers for input compounds. For the development of DeepHIT, we generated a large-scale gold-standard dataset, which includes 6632 hERG blockers and 7808 hERG non-blockers. DeepHIT is designed to contain three deep learning models to improve sensitivity and NPV, which, in turn, produce fewer false negative predictions. DeepHIT outperforms currently available tools in terms of accuracy (0.773), MCC (0.476), sensitivity (0.833) and NPV (0.643) on an external test dataset. We also developed an in silico chemical transformation module that generates virtual compounds from a seed compound, based on the known chemical transformation patterns. As a proof-of-concept study, we identified novel urotensin II receptor (UT) antagonists without hERG-blocking activity derived from a seed compound of a previously reported UT antagonist (KR-36676) with a strong hERG-blocking activity. In summary, DeepHIT will serve as a useful tool to predict hERG-induced cardiotoxicity of small compounds in the early stages of drug discovery and development. Availability and implementation https://bitbucket.org/krictai/deephit and https://bitbucket.org/krictai/chemtrans Supplementary information Supplementary data are available at Bioinformatics online.
Background: Drug candidates often cause an unwanted blockage of the potassium ion channel of the human ether-ago go related gene (hERG). The blockage leads to long QT syndrome (LQTS), which is a severe life-threatening cardiac side effect. Therefore, a virtual screening method to predict drug-induced hERG-related cardiotoxicity could facilitate drug discovery by filtering out toxic drug candidates. Result: In this study, we generated a reliable hERG-related cardiotoxicity dataset composed of 2130 compounds, which were carried out under constant conditions. Based on our dataset, we developed a computational hERG-related cardiotoxicity prediction model. The neural network model achieved an area under the receiver operating characteristic curve (AUC) of 0.764, with an accuracy of 90.1%, a Matthews correlation coefficient (MCC) of 0.368, a sensitivity of 0.321, and a specificity of 0.967, when tenfold cross-validation was performed. The model was further evaluated using ten drug compounds tested on guinea pigs and showed an accuracy of 80.0%, an MCC of 0.655, a sensitivity of 0.600, and a specificity of 1.000, which were better than the performances of existing hERG-toxicity prediction models. Conclusion: The neural network model can predict hERG-related cardiotoxicity of chemical compounds with a high accuracy. Therefore, the model can be applied to virtual high-throughput screening for drug candidates that do not cause cardiotoxicity. The prediction tool is available as a web-tool at http://ssbio.cau.ac.kr/CardPred.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.