In vitro chemical safety testing methods offer the potential for efficient and economical tools to provide relevant assessments of human health risk. To realize this potential, methods are needed to relate in vitro effects to in vivo responses, i.e., in vitro to in vivo extrapolation (IVIVE). Currently available IVIVE approaches need to be refined before they can be utilized for regulatory decision-making. To explore the capabilities and limitations of IVIVE within this context, the U.S. Environmental Protection Agency Office of Research and Development and the National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods co-organized a workshop and webinar series. Here, we integrate content from the webinars and workshop to discuss activities and resources that would promote inclusion of IVIVE in regulatory decision-making. We discuss properties of models that successfully generate predictions of in vivo doses from effective in vitro concentration, including the experimental systems that provide input parameters for these models, areas of success, and areas for improvement to reduce model uncertainty. Finally, we provide case studies on the uses of IVIVE in safety assessments, which highlight the respective differences, information requirements, and outcomes across various approaches when applied for decision-making.
Background The logarithmic acid dissociation constant pKa reflects the ionization of a chemical, which affects lipophilicity, solubility, protein binding, and ability to pass through the plasma membrane. Thus, pKa affects chemical absorption, distribution, metabolism, excretion, and toxicity properties. Multiple proprietary software packages exist for the prediction of pKa, but to the best of our knowledge no free and open-source programs exist for this purpose. Using a freely available data set and three machine learning approaches, we developed open-source models for pKa prediction. Methods The experimental strongest acidic and strongest basic pKa values in water for 7912 chemicals were obtained from DataWarrior, a freely available software package. Chemical structures were curated and standardized for quantitative structure–activity relationship (QSAR) modeling using KNIME, and a subset comprising 79% of the initial set was used for modeling. To evaluate different approaches to modeling, several datasets were constructed based on different processing of chemical structures with acidic and/or basic pKas. Continuous molecular descriptors, binary fingerprints, and fragment counts were generated using PaDEL, and pKa prediction models were created using three machine learning methods, (1) support vector machines (SVM) combined with k-nearest neighbors (kNN), (2) extreme gradient boosting (XGB) and (3) deep neural networks (DNN). Results The three methods delivered comparable performances on the training and test sets with a root-mean-squared error (RMSE) around 1.5 and a coefficient of determination (R2) around 0.80. Two commercial pKa predictors from ACD/Labs and ChemAxon were used to benchmark the three best models developed in this work, and performance of our models compared favorably to the commercial products. Conclusions This work provides multiple QSAR models to predict the strongest acidic and strongest basic pKas of chemicals, built using publicly available data, and provided as free and open-source software on GitHub.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.