Cardio-vascular safety beyond hERG: in silico modelling of a guinea pig right atrium assay

Fenu, Luca A.; Teisman, Ard; Buck, Stefan De; Sinha, Vikash; Gilissen, RMCA mammalogy - Emmanuel; Nijsen, Marjoleen; Mackie, Claire; Sanderson, Wendy

doi:10.1007/s10822-009-9306-z

Cited by 6 publications

(1 citation statement)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The activity thresholds used to distinguish blockers and nonblockers ranged from 1 to 40 μM, − suggesting a high variation in the training set compositions. While some considered data extracted from single or multiple assay/data sources, − most studies used in-house data or proprietary data from the pharma industry that are not publicly accessible. ,− A limited number of studies ,, reported classification models based on hERG data extracted from publicly accessible bioactivity databases such as ChEMBL and PubChem. The heterogeneous activity data obtained from such databases were shown to possess a considerable level of experimental uncertainty, and recommendations were made regarding how such data must be curated before model development. − Additional limitations such as small numbers of compounds used in modeling (often a few hundred), narrow or unreported applicability domains, and lack of proof of validation (e.g., Y-randomization tests) restrict the use of most previously published models.…”

Section: Introductionmentioning

confidence: 99%

The Catch-22 of Predicting hERG Blockade Using Publicly Accessible Bioactivity Data

Siramshetty

Chen

Devarakonda

et al. 2018

J. Chem. Inf. Model.

View full text Add to dashboard Cite

Drug-induced inhibition of the human ether-à-go-go-related gene (hERG)-encoded potassium ion channels can lead to fatal cardiotoxicity. Several marketed drugs and promising drug candidates were recalled because of this concern. Diverse modeling methods ranging from molecular similarity assessment to quantitative structure-activity relationship analysis employing machine learning techniques have been applied to data sets of varying size and composition (number of blockers and nonblockers). In this study, we highlight the challenges involved in the development of a robust classifier for predicting the hERG end point using bioactivity data extracted from the public domain. To this end, three different modeling methods, nearest neighbors, random forests, and support vector machines, were employed to develop predictive models using different molecular descriptors, activity thresholds, and training set compositions. Our models demonstrated superior performance in external validations in comparison with those reported in the previous studies from which the data sets were extracted. The choice of descriptors had little influence on the model performance, with minor exceptions. The criteria used to filter bioactivity data, the activity threshold settings used to separate blockers from nonblockers, and the structural diversity of blockers in training data set were found to be the crucial indicators of model performance. Training sets based on a binary threshold of 1 μM/10 μM to separate blockers (IC/ K ≤ 1 μM) from nonblockers (IC/ K > 10 μM) provided superior performance in comparison with those defined using a single threshold (1 μM or 10 μM). A major limitation in using the public domain hERG activity data is the abundance of blockers in comparison with nonblockers at usual activity thresholds, since not many studies report the latter.

show abstract