2018
DOI: 10.1093/toxsci/kfy121
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Drug-Induced Liver Injury Using Ensemble Learning Methods and Molecular Fingerprints

Abstract: Drug-induced liver injury (DILI) is a major safety concern in the drug-development process, and various methods have been proposed to predict the hepatotoxicity of compounds during the early stages of drug trials. In this study, we developed an ensemble model using 3 machine learning algorithms and 12 molecular fingerprints from a dataset containing 1241 diverse compounds. The ensemble model achieved an average accuracy of 71.1 ± 2.6%, sensitivity (SE) of 79.9 ± 3.6%, specificity (SP) of 60.3 ± 4.8%, and area … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

3
97
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 73 publications
(100 citation statements)
references
References 29 publications
3
97
0
Order By: Relevance
“…Predicting liver toxicity from chemical structure has been a preoccupation of over two decades, first starting somewhat timidly with local models [44,45], to move later also to global models [46,47]. Many of the previous models used relatively small size datasets (less than 400 compounds) [48,49]; studies based on datasets larger than DILIrank have also been published [11,17,[50][51][52], but the authors of DILIrank used a methodology that, in theory at least, was superior and more consistent with the totality of available clinical data. The total DILIrank dataset includes 1036 compounds, but 254 were classified as of "ambiguous DILI-concern", because the causality evidence was limited; besides, it took great care in defining DILI negatives, which varied among four large sources previously studied, labeling some compounds previously considered of no DILI concern as of less DILI concern.…”
Section: Discussionmentioning
confidence: 99%
“…Predicting liver toxicity from chemical structure has been a preoccupation of over two decades, first starting somewhat timidly with local models [44,45], to move later also to global models [46,47]. Many of the previous models used relatively small size datasets (less than 400 compounds) [48,49]; studies based on datasets larger than DILIrank have also been published [11,17,[50][51][52], but the authors of DILIrank used a methodology that, in theory at least, was superior and more consistent with the totality of available clinical data. The total DILIrank dataset includes 1036 compounds, but 254 were classified as of "ambiguous DILI-concern", because the causality evidence was limited; besides, it took great care in defining DILI negatives, which varied among four large sources previously studied, labeling some compounds previously considered of no DILI concern as of less DILI concern.…”
Section: Discussionmentioning
confidence: 99%
“…Accurate and timely prediction of drug induced liver injury remains a challenging research topic with a great potential impact in drug R&D. A number of efforts have been made to build in silico models that can predict DILI [32][33][34]. Since the datasets and feature selection vary, a direct comparison between these methods and our approach may be inaccurate.…”
Section: Discussionmentioning
confidence: 99%
“…Recently, Ai et al introduced an ensemble model for model improvement to better predict DILI compounds. 22 They used 12 types of molecular fingerprints and three machine learning algorithms to create 36 corresponding classifiers and took the averaged probabilistic values returned by each classifier. According to reported findings, Ai et al’s model is the current best model to predict DILI compounds.…”
Section: Introductionmentioning
confidence: 99%