Prediction of Compound Cytotoxicity Based on Compound Structures and Cell Line Molecular Characteristics

Nakano, Takashi; Brown, J.

doi:10.2751/jcac.21.1

Cited by 5 publications

(8 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Previous models in cytotoxicity predictions have used many machine learning approaches such as Random Forests, 20,21 Bayesian learning, 22 and deep learning; 23 these have been trained on features such as physicochemical descriptors and molecular fingerprints as well as cell line descriptors of mRNA expression data. 24 Previously, assays developed based on morphology screens have been shown to identify similar sets of compounds compared to a standard cytotoxicity assay. 25 In recent years, models have started to also use other highdimensional readouts for toxicity prediction, such as the combination between molecular fingerprints and gene expression data.…”

Section: ■ Introductionmentioning

confidence: 99%

“…Such deviations could arise because the in vitro dose may be completely irrelevant to target organ exposure in vivo (among other possible factors). Previous models in cytotoxicity predictions have used many machine learning approaches such as Random Forests, , Bayesian learning, and deep learning; these have been trained on features such as physicochemical descriptors and molecular fingerprints as well as cell line descriptors of mRNA expression data . Previously, assays developed based on morphology screens have been shown to identify similar sets of compounds compared to a standard cytotoxicity assay .…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Comparison of Cellular Morphological Descriptors and Molecular Fingerprints for the Prediction of Cytotoxicity- and Proliferation-Related Assays

Seal

Yang

Vollmers

et al. 2021

Chem. Res. Toxicol.

View full text Add to dashboard Cite

Cell morphology features, such as those from the Cell Painting assay, can be generated at relatively low costs and represent versatile biological descriptors of a system and thereby compound response. In this study, we explored cell morphology descriptors and molecular fingerprints, separately and in combination, for the prediction of cytotoxicity- and proliferation-related in vitro assay endpoints. We selected 135 compounds from the MoleculeNet ToxCast benchmark data set which were annotated with Cell Painting readouts, where the relatively small size of the data set is due to the overlap of required annotations. We trained Random Forest classification models using nested cross-validation and Cell Painting descriptors, Morgan and ErG fingerprints, and their combinations. While using leave-one-cluster-out cross-validation (with clusters based on physicochemical descriptors), models using Cell Painting descriptors achieved higher average performance over all assays (Balanced Accuracy of 0.65, Matthews Correlation Coefficient of 0.28, and AUC-ROC of 0.71) compared to models using ErG fingerprints (BA 0.55, MCC 0.09, and AUC-ROC 0.60) and Morgan fingerprints alone (BA 0.54, MCC 0.06, and AUC-ROC 0.56). While using random shuffle splits, the combination of Cell Painting descriptors with ErG and Morgan fingerprints further improved balanced accuracy on average by 8.9% (in 9 out of 12 assays) and 23.4% (in 8 out of 12 assays) compared to using only ErG and Morgan fingerprints, respectively. Regarding feature importance, Cell Painting descriptors related to nuclei texture, granularity of cells, and cytoplasm as well as cell neighbors and radial distributions were identified to be most contributing, which is plausible given the endpoint considered. We conclude that cell morphological descriptors contain complementary information to molecular fingerprints which can be used to improve the performance of predictive cytotoxicity models, in particular in areas of novel structural space.

show abstract

Section: ■ Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Comparison of Cellular Morphological Descriptors and Molecular Fingerprints for the Prediction of Cytotoxicity- and Proliferation-Related Assays

Seal

Yang

Vollmers

et al. 2021

Chem. Res. Toxicol.

View full text Add to dashboard Cite

show abstract

“…This makes it a powerful approach to develop models that are able to predict drug synergy based on drug combination screening experiments and other relevant data. Several ML models for drug synergy prediction have been described in the literature [11,[17][18][19][20][21]. Many of these studies used tree-based ML methods, such as random forests (RFs) [17,18,20] or gradient boosting [18,19,21].…”

Section: Introductionmentioning

confidence: 99%

A systematic evaluation of deep learning methods for the prediction of drug synergy in cancer

2023

View full text Add to dashboard Cite

One of the main obstacles to the successful treatment of cancer is the phenomenon of drug resistance. A common strategy to overcome resistance is the use of combination therapies. However, the space of possibilities is huge and efficient search strategies are required. Machine Learning (ML) can be a useful tool for the discovery of novel, clinically relevant anti-cancer drug combinations. In particular, deep learning (DL) has become a popular choice for modeling drug combination effects. Here, we set out to examine the impact of different methodological choices on the performance of multimodal DL-based drug synergy prediction methods, including the use of different input data types, preprocessing steps and model architectures. Focusing on the NCI ALMANAC dataset, we found that feature selection based on prior biological knowledge has a positive impact—limiting gene expression data to cancer or drug response-specific genes improved performance. Drug features appeared to be more predictive of drug response, with a 41% increase in coefficient of determination (R2) and 26% increase in Spearman correlation relative to a baseline model that used only cell line and drug identifiers. Molecular fingerprint-based drug representations performed slightly better than learned representations—ECFP4 fingerprints increased R2 by 5.3% and Spearman correlation by 2.8% w.r.t the best learned representations. In general, fully connected feature-encoding subnetworks outperformed other architectures. DL outperformed other ML methods by more than 35% (R2) and 14% (Spearman). Additionally, an ensemble combining the top DL and ML models improved performance by about 6.5% (R2) and 4% (Spearman). Using a state-of-the-art interpretability method, we showed that DL models can learn to associate drug and cell line features with drug response in a biologically meaningful way. The strategies explored in this study will help to improve the development of computational methods for the rational design of effective drug combinations for cancer therapy.

show abstract

“…Several ML models for drug synergy prediction have been described in the literature [8,[11][12][13][14][15]. Many of these studies used tree-based ML methods, such as random forests (RFs) [11,12,14] or gradient boosting [12,13,15].…”

Section: Introductionmentioning

confidence: 99%

“…This makes it a powerful 2/22 approach to develop models that are able to predict drug synergy based on drug combination screening experiments and other relevant data. Several ML models for drug synergy prediction have been described in the literature [8,[11][12][13][14][15]. Many of these studies used tree-based ML methods, such as random forests (RFs) [11,12,14] or gradient boosting [12,13,15].…”

Section: Introductionmentioning

confidence: 99%

A systematic evaluation of deep learning methods for the prediction of drug synergy in cancer

Baptista

Ferreira

Rocha

2022

Preprint

View full text Add to dashboard Cite

One of the main obstacles to the successful treatment of cancer is the phenomenon of drug resistance. A common strategy to overcome resistance is the use of combination therapies. However, the space of possibilities is huge and efficient search strategies are required. Machine Learning (ML) can be a useful tool for the discovery of novel, clinically relevant anti-cancer drug combinations. In particular, deep learning (DL) has become a popular choice for modeling drug combination effects. Here, we set out to examine the impact of different methodological choices on the performance of multimodal DL-based drug synergy prediction methods, including the use of different input data types, preprocessing steps and model architectures. Focusing on the NCI ALMANAC dataset, we found that feature selection based on prior biological knowledge has a positive impact on performance. Drug features appeared to be more predictive of drug response. Molecular fingerprint-based drug representations performed slightly better than learned representations, and gene expression data of cancer or drug response-specific genes also improved performance. In general, fully connected feature-encoding subnetworks outperformed other architectures, with DL outperforming other ML methods. Using a state-of-the-art interpretability method, we showed that DL models can learn to associate drug and cell line features with drug response in a biologically meaningful way. The strategies explored in this study will help to improve the development of computational methods for the rational design of effective drug combinations for cancer therapy.Author summaryCancer therapies often fail because tumor cells become resistant to treatment. One way to overcome resistance is by treating patients with a combination of two or more drugs. Some combinations may be more effective than when considering individual drug effects, a phenomenon called drug synergy. Computational drug synergy prediction methods can help to identify new, clinically relevant drug combinations. In this study, we developed several deep learning models for drug synergy prediction. We examined the effect of using different types of deep learning architectures, and different ways of representing drugs and cancer cell lines. We explored the use of biological prior knowledge to select relevant cell line features, and also tested data-driven feature reduction methods. We tested both precomputed drug features and deep learning methods that can directly learn features from raw representations of molecules. We also evaluated whether including genomic features, in addition to gene expression data, improves the predictive performance of the models. Through these experiments, we were able to identify strategies that will help guide the development of new deep learning models for drug synergy prediction in the future.

show abstract

Prediction of Compound Cytotoxicity Based on Compound Structures and Cell Line Molecular Characteristics

Cited by 5 publications

References 14 publications

Comparison of Cellular Morphological Descriptors and Molecular Fingerprints for the Prediction of Cytotoxicity- and Proliferation-Related Assays

Comparison of Cellular Morphological Descriptors and Molecular Fingerprints for the Prediction of Cytotoxicity- and Proliferation-Related Assays

A systematic evaluation of deep learning methods for the prediction of drug synergy in cancer

A systematic evaluation of deep learning methods for the prediction of drug synergy in cancer

Contact Info

Product

Resources

About