2020
DOI: 10.1186/s13321-020-00468-x
|View full text |Cite
|
Sign up to set email alerts
|

Structure–activity relationship-based chemical classification of highly imbalanced Tox21 datasets

Abstract: The specificity of toxicant-target biomolecule interactions lends to the very imbalanced nature of many toxicity datasets, causing poor performance in Structure–Activity Relationship (SAR)-based chemical classification. Undersampling and oversampling are representative techniques for handling such an imbalance challenge. However, removing inactive chemical compound instances from the majority class using an undersampling technique can result in information loss, whereas increasing active toxicant instances in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
63
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 58 publications
(65 citation statements)
references
References 64 publications
1
63
1
Order By: Relevance
“…By MCC Test values our results for four Tox21 endpoints are 0.71, 0.63, 0.37, and 0.57 ( Table S3 ), what is higher than the corresponding values obtained by Abdelaziz et al 0.25, 0.08, 0.36, and 0.59 (respectively) [ 33 ], which are second the best overall results on the Tox21 Data Challenge. Moreover, MCC Test values obtained in this study are noticeably higher than in the study by Idakwo et al [ 34 ] (0.29, 016, 0.62, and 0.55) and for endpoint no. 3 in Uesawa et al [ 45 ] being 0.5 and 0.48 for two cases of dichotomization of toxicity of endpoint SR-MMP (Stress response panel - mitochondrial membrane potential).…”
Section: Resultscontrasting
confidence: 78%
See 4 more Smart Citations
“…By MCC Test values our results for four Tox21 endpoints are 0.71, 0.63, 0.37, and 0.57 ( Table S3 ), what is higher than the corresponding values obtained by Abdelaziz et al 0.25, 0.08, 0.36, and 0.59 (respectively) [ 33 ], which are second the best overall results on the Tox21 Data Challenge. Moreover, MCC Test values obtained in this study are noticeably higher than in the study by Idakwo et al [ 34 ] (0.29, 016, 0.62, and 0.55) and for endpoint no. 3 in Uesawa et al [ 45 ] being 0.5 and 0.48 for two cases of dichotomization of toxicity of endpoint SR-MMP (Stress response panel - mitochondrial membrane potential).…”
Section: Resultscontrasting
confidence: 78%
“…Even though most of the models in this study show a relatively low MCC, this is not uncommon in biological studies. A recent study by Idakwo et al [ 34 ] on the Tox21 data set, which became a popular data set for many QSAR and machine learning experiments [ 35 , 36 , 37 ], shows that some of the toxicological endpoints even when conducted on cell lines can have even negative values for MCC. It is therefore not unexpected that whole organism toxicity at low concentration ranges is hard to model given the MCC metrics which is expected to be more sensitive considering other often employed metrics, such as accuracy, BA, or real accuracy.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations