2022
DOI: 10.1021/acs.jcim.2c01088
|View full text |Cite
|
Sign up to set email alerts
|

Structural Analysis and Prediction of Hematotoxicity Using Deep Learning Approaches

Abstract: Hematotoxicity has been becoming a serious but overlooked toxicity in drug discovery. However, only a few in silico models have been reported for the prediction of hematotoxicity. In this study, we constructed a high-quality dataset comprising 759 hematotoxic compounds and 1623 nonhematotoxic compounds and then established a series of classification models based on a combination of seven machine learning (ML) algorithms and nine molecular representations. The results based on two data partitioning strategies a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 61 publications
0
5
0
Order By: Relevance
“…In this context, SARpy software was employed to extract SAs, and the accuracy was calculated for SA analysis. We established specific thresholds for SA extraction in both data sets, wherein the occurrence frequency ( N P ) of a SA in toxic compounds had to be greater than or equal to 8, and the accuracy, defined as the ratio of toxic compounds containing the SA to all compounds containing the SA, had to be higher than 0.70 . In the end, we selected 10 SAs for each data set, which are presented in Tables and .…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In this context, SARpy software was employed to extract SAs, and the accuracy was calculated for SA analysis. We established specific thresholds for SA extraction in both data sets, wherein the occurrence frequency ( N P ) of a SA in toxic compounds had to be greater than or equal to 8, and the accuracy, defined as the ratio of toxic compounds containing the SA to all compounds containing the SA, had to be higher than 0.70 . In the end, we selected 10 SAs for each data set, which are presented in Tables and .…”
Section: Resultsmentioning
confidence: 99%
“…DL has been increasingly applied in various research domains. While in many instances, DL models outperform traditional ML models in compound property prediction, , it is important to note that when the data set is limited in size, ML often yields better results than DL. ,, Furthermore, DL algorithms are considered to perform well on large data sets, as they can learn more complex features from the training set . In the models we constructed, due to the use of undersampling methods, the modeling data rely on a limited number of samples.…”
Section: Discussionmentioning
confidence: 99%
“…We further used the SHAP method to interpret the result analyzed by the retrained LSTM model . Among all fingerprints, MACCS_keys_86 had the strongest predictive value for our model, followed quite closely by MACCS_keys_53.…”
Section: Resultsmentioning
confidence: 99%
“…Furthermore, the chemical diversity of a group of molecules can be confirmed by comparing their molecular scaffolds (Bajorath, 2018). In this study, the chemical scaffold analysis was conducted using the Murcko scaffold (Long et al, 2023), which represents the only ligand and ring system remaining after removing all substituents. Derived from the Murcko scaffold, the carbon backbone is generated by converting all heteroatoms into carbon atoms and all bonds into single bonds (Bemis & Mark, 1996).…”
Section: Methodsmentioning
confidence: 99%