Search citation statements
Paper Sections
Citation Types
Year Published
Publication Types
Relationship
Authors
Journals
Predicting human clearance with high accuracy from in silico-derived parameters alone is highly desirable, as it is fast, saves in vitro resources, and is animal-sparing. We derived random forest (RF) models from 1340 compounds with human intravenous pharmacokinetic (PK) data, the largest data set publicly available today. To assess the general applicability of the RF models, we systematically removed structural-therapeutic class analogues and other compounds with structural similarity from the training sets. For a quasi-prospective test set of 343 compounds, we show that RF models devoid of structurally similar compounds in the training set predict human clearance with a geometric mean fold error (GMFE) of 3.3. While the observed GMFE illustrates how difficult it is to generate a useful model that is broadly applicable, we posit that our RF models yield a more realistic assessment of how well human clearance can be predicted prospectively. We deployed the conformal prediction formalism to assess the model applicability and to determine the prediction confidence intervals for each prediction. We observed that clearance can be predicted better for renally cleared compounds than for other clearance mechanisms. We show that applying a classification model for predicting renal clearance identifies a subset of compounds for which clearance can be predicted with higher accuracy, yielding a GMFE of 2.3. In addition, our in silico RF human clearance models compared well to models derived from scaling human hepatocytes or preclinical in vivo data.
Predicting human clearance with high accuracy from in silico-derived parameters alone is highly desirable, as it is fast, saves in vitro resources, and is animal-sparing. We derived random forest (RF) models from 1340 compounds with human intravenous pharmacokinetic (PK) data, the largest data set publicly available today. To assess the general applicability of the RF models, we systematically removed structural-therapeutic class analogues and other compounds with structural similarity from the training sets. For a quasi-prospective test set of 343 compounds, we show that RF models devoid of structurally similar compounds in the training set predict human clearance with a geometric mean fold error (GMFE) of 3.3. While the observed GMFE illustrates how difficult it is to generate a useful model that is broadly applicable, we posit that our RF models yield a more realistic assessment of how well human clearance can be predicted prospectively. We deployed the conformal prediction formalism to assess the model applicability and to determine the prediction confidence intervals for each prediction. We observed that clearance can be predicted better for renally cleared compounds than for other clearance mechanisms. We show that applying a classification model for predicting renal clearance identifies a subset of compounds for which clearance can be predicted with higher accuracy, yielding a GMFE of 2.3. In addition, our in silico RF human clearance models compared well to models derived from scaling human hepatocytes or preclinical in vivo data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.