Identifying spatial domain by adapting transcriptomics with histology through contrastive learning

Zeng, Yuansong; Yin, Rui; Luo, Mai; Chen, Jianing; Pan, Zixiang; Lu, Yutong; Yu, Weijiang; Yang, Yuedong

doi:10.1093/bib/bbad048

Cited by 23 publications

(12 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, for some proteins, such as Abeta, expression may be disconnected on the spot level, especially in late stages of progressive diseases, potentially making it challenging for Proust to accurately identify all regions where the protein occurs, as individual spots may absorb dissimilar neighboring information during model training. To address this issue, a potential solution is to enhance the graph structure with weighted edges based on the similarity of spot-level protein information or to incorporate inter-modality contrastive learning to maximize the mutual information between gene expression and proteomics [23, 34]. We are also interested in exploring the use of statistical inference here as a way to explore the uncertainty of the predicted spatial domains.…”

Section: Discussionmentioning

confidence: 99%

“…A second approach is to continue with only one omic data modality, but to incorporate spatial information to account for the correlation of molecular information between the spatial coordinates. Some examples of these methods include (i) unsupervised learning approaches (BayesSpace [18], Giotto [19], STAGATE [20], CCST [21]) and (ii) self-supervised learning approaches (GraphST [15], SpaceFlow [22], ConGI [23], CAST [24]). In particular, the methods using contrastive self-supervised learning aim to maximize the similarity between adjacent spatial coordinates and dissimilarity between non-adjacent spatial coordinates, while also showing great promise in their ability to detect discrete spatial domains using only one data modality.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Spatial domain detection using contrastive self-supervised learning for spatial multi-omics technologies

Yao,

Yu,

Caffo

et al. 2024

Preprint

View full text Add to dashboard Cite

Recent advances in spatially-resolved single-omics and multi-omics technologies have led to the emergence of computational tools to detect or predict spatial domains. Additionally, histological images and immunofluorescence (IF) staining of proteins and cell types provide multiple perspectives and a more complete understanding of tissue architecture. Here, we introduce Proust, a scalable tool to predict discrete domains using spatial multi-omics data by combining the low-dimensional representation of biological profiles based on graph-based contrastive self-supervised learning. Our scalable method integrates multiple data modalities, such as RNA, protein, and H&E images, and predicts spatial domains within tissue samples. Through the integration of multiple modalities, Proust consistently demonstrates enhanced accuracy in detecting spatial domains, as evidenced across various benchmark datasets and technological platforms.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Spatial domain detection using contrastive self-supervised learning for spatial multi-omics technologies

Yao,

Yu,

Caffo

et al. 2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Although methods using deep learning-based models have been developed to decipher spatial domains by combining histological images with gene expression data (Pham et al . 2020; Zeng et al . 2023), several potential drawbacks exist.…”

Section: Discussionmentioning

confidence: 99%

Uncover spatially informed shared variations for single-cell spatial transcriptomics with STew

Guo,

Vargas,

Fritz

et al. 2023

Preprint

View full text Add to dashboard Cite

Motivation: The recent spatial transcriptomics (ST) technologies have enabled characterization of gene expression patterns and spatial information, advancing our understanding of cell lineages within diseased tissues. Several analytical approaches have been proposed for ST data, but effectively utilizing spatial information to unveil the shared variation with gene expression remains a challenge. Results: We introduce STew, a Spatial Transcriptomic multi-viEW representative learning method, to jointly analyze spatial information and gene expression in a scalable manner, followed by a data-driven statistical framework to measure the goodness of model fit. Through benchmarking using Human DLPFC data with true manual annotations, STew achieved superior performance in both clustering accuracy and continuity of identified spatial domains compared with other methods. STew is also robust to generate consistent results insensitive to model parameters, including sparsity constraints. We next applied STew to various ST data acquired from 10x Visium and Slide-seqV2, encompassing samples from both mouse and human brain, which revealed spatially informed cell type clusters. We further identified a pro-inflammatory fibroblast spatial niche using ST data from psoriatic skins. Hence, STew is a generalized method to identify both spatially informed clusters and disease-relevant niches in complex tissues. Availability: Source code and the R software tool STew are available from github.com/fanzhanglab/STew.

show abstract

“…A contrastive cross-modal model encoding image patches and spatial transcriptomic profiles was created, similar to the model implemented by Zeng et al 21 . Input images patches of size 224x224 were encoded into embeddings of size 512 units, using the feature extraction portion of a CNN initialized with weights initialized from the ResNet model trained by Ciga et al Spatial transcriptomics profiles containing expression of the most spatially variable 1000 genes across Visium slides, selected to avoid overfitting on genes with imprecise expression, were encoded with three standard fully connected (FC) layers of size 512.…”

Section: Methodsmentioning

confidence: 99%

Assessment of Emerging Pretraining Strategies in Interpretable Multimodal Deep Learning for Cancer Prognostication

Azher¹,

Suvarna²,

Chen

et al. 2022

Preprint

View full text Add to dashboard Cite

Deep learning models have demonstrated the remarkable ability to infer cancer patient prognosis from molecular and anatomic pathology information. Studies in recent years have demonstrated that leveraging information from complementary multimodal data can improve prognostication, further illustrating the potential utility of such methods. Model interpretation is crucial for facilitating the clinical adoption of deep learning methods by fostering practitioner understanding and trust in the technology. However, while prior works have presented novel multimodal neural network architectures as means to improve prognostication performance, these approaches: 1) do not comprehensively leverage biological and histomorphological relationships and 2) make use of emerging strategies to "pretrain" models (i.e., train models on a slightly orthogonal dataset/modeling objective) which may aid prognostication by reducing the amount of information required for achieving optimal performance. Here, we develop an interpretable multimodal modeling framework that combines DNA methylation, gene expression, and histopathology (i.e., tissue slides) data, and we compare the performances of crossmodal pretraining, contrastive learning, and transfer learning versus the standard procedure in this context. Our models outperform the existing state-of-the-art method (average 11.54% C-index increase), and baseline clinically driven models. Our results demonstrate that the selection of pretraining strategies is crucial for obtaining highly accurate prognostication models, even more so than devising an innovative model architecture, and further emphasize the all-important role of the tumor microenvironment on disease progression.

show abstract

Identifying spatial domain by adapting transcriptomics with histology through contrastive learning

Cited by 23 publications

References 36 publications

Spatial domain detection using contrastive self-supervised learning for spatial multi-omics technologies

Spatial domain detection using contrastive self-supervised learning for spatial multi-omics technologies

Uncover spatially informed shared variations for single-cell spatial transcriptomics with STew

Assessment of Emerging Pretraining Strategies in Interpretable Multimodal Deep Learning for Cancer Prognostication

Contact Info

Product

Resources

About