2021
DOI: 10.1093/bioinformatics/btab769
|View full text |Cite
|
Sign up to set email alerts
|

Explainable multimodal machine learning model for classifying pregnancy drug safety

Abstract: Motivation Teratogenic drugs can cause severe fetal malformation and therefore have critical impact on the health of the fetus, yet the teratogenic risks are unknown for most approved drugs. This article proposes an explainable machine learning model for classifying pregnancy drug safety based on multimodal data and suggests an orthogonal ensemble for modeling multimodal data. To train the proposed model, we created a set of labeled drugs by processing over 100 000 textual responses collected… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(14 citation statements)
references
References 29 publications
0
14
0
Order By: Relevance
“…Under stratified 10-fold cross-validation, our final model achieved Matthew's correlation coefficient (MCC) of 0.80, overall accuracy of 91%, sensitivity of 0.91, and specificity of 0.93, outperforming alternative proposed approaches 1,17 (Table 2).…”
Section: ■ Results and Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Under stratified 10-fold cross-validation, our final model achieved Matthew's correlation coefficient (MCC) of 0.80, overall accuracy of 91%, sensitivity of 0.91, and specificity of 0.93, outperforming alternative proposed approaches 1,17 (Table 2).…”
Section: ■ Results and Discussionmentioning
confidence: 99%
“…Teratogenic drugs lead to malformation of the fetus and are strongly associated with lifelong physical and mental disabilities. 1 Significant effort, therefore, is invested in trying to avoid exposure to potentially teratogenic drugs during pregnancy. This was particularly brought to the public's attention in the 1960s, when the drug Thalidomide, approved as a sedative for treating nausea and vomiting during pregnancy, was found to have led to severe birth defects and deaths.…”
Section: ■ Introductionmentioning
confidence: 99%
“…The modalities we evaluated are the different biomedical text embeddings, the drug’s molecular structure [ 29 ], and the DrugBank drug category. DrugBank’s drug category combines the drug’s ATC codes of different levels, and other manually curated drug information [ 30 ]. The DrugBank category is considered the most informative modality for drug safety classification, but it is not available during the early stages of a drug’s development.…”
Section: Resultsmentioning
confidence: 99%
“…The AUC and AUPR metrics, using 10-fold cross-validation were used for the comparison. The code published by [ 30 ] for feature engineering and model generation was used.…”
Section: Resultsmentioning
confidence: 99%
“…We constructed an ensemble supervised machine learning model based on the ‘stacking’ method, which refers to fitting multiple machine learning models on the same dataset and using secondary modeling to learn how to best combine their predictions ( Shtar et al., 2021 ). A single sub-model is called a first-level learner, while the combined model is called a second-level learner.…”
Section: Methodsmentioning
confidence: 99%