Contrastive self-supervised clustering of scRNA-seq data

Ciortan, Madalina; Defrance, Matthieu

doi:10.1186/s12859-021-04210-8

Cited by 48 publications

(42 citation statements)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Momentum contrastive self-supervised learning achieved comparable performance in visual representation learning of images as compared with supervised representation learning ( Chen et al., 2020b ; He et al., 2019 ). As compared with a similar method proposed by Ciotran and colleagues ( Ciortan and Defrance, 2021 ) used a network of 3 linear layers as feature encoder, we observed that Miscell achieved better performance ( Figure S6 ). This is probably because of better representation capacity of the feature encoder used by Miscell.…”

Section: Discussionmentioning

confidence: 53%

Miscell: An efficient self-supervised learning approach for dissecting single-cell transcriptome

Shen

Feng

et al. 2021

iScience

View full text Add to dashboard Cite

Summary We developed Miscell, a self-supervised learning approach with deep neural network as latent feature encoder for mining information from single-cell transcriptomes. We demonstrated the capability of Miscell with canonical single-cell analysis tasks including delineation of single-cell clusters and identification of cluster-specific marker genes. We evaluated Miscell along with three state-of-the-art methods on three heterogeneous datasets. Miscell achieved at least comparable or better performance than the other methods by significant margin on a variety of clustering metrics such as adjusted rand index, normalized mutual information, and V -measure score. Miscell can identify cell-type specific markers by quantifying the influence of genes on cell clusters via deep learning approach.

show abstract

Section: Discussionmentioning

confidence: 53%

Miscell: An efficient self-supervised learning approach for dissecting single-cell transcriptome

Shen

Feng

et al. 2021

iScience

View full text Add to dashboard Cite

show abstract

“…Most metrics, however, require the ground truth labelling, which were not available in this study. Besides, the clustering itself can be approached in many different ways, using the classical or the newly developed deep-learning based algorithms (Ciortan and Defrance, 2021). In this study, we only intended to fairly compare clustering results, obtained under identical conditions (same algorithm, grid search parameters, evaluation metrics, etc.)…”

Section: Appendix E Discussionmentioning

confidence: 99%

“…Finally, self-supervision has been successfully applied to cell segmentation, annotation and clustering (Lu et al, 2019;Santos-Pata et al, 2021). Most recently, a self-supervised contrastive learning framework has been proposed by Ciortan and Defrance (2021) to learn representations of scRNA-seq data. The authors follow the idea of SimCLR (Chen et al, 2020) and show state-of-the-art (SOTA) performance on clustering task.…”

Section: Appendix a Related Workmentioning

confidence: 99%

Comparing representations of biological data learned with different AI paradigms, augmenting and cropping strategies

Dmitrenko¹,

Masiero²,

Zamboni³

2022

Preprint

View full text Add to dashboard Cite

Recent advances in computer vision and robotics enabled automated large-scale biological image analysis. Various machine learning approaches have been successfully applied to phenotypic profiling. However, it remains unclear how they compare in terms of biological feature extraction. In this study, we propose a simple CNN architecture and implement 4 different representation learning approaches. We train 16 deep learning setups on the 770k cancer cell images dataset under identical conditions, using different augmenting and cropping strategies. We compare the learned representations by evaluating multiple metrics for each of three downstream tasks: i) distance-based similarity analysis of known drugs, ii) classification of drugs versus controls, iii) clustering within cell lines. We also compare training times and memory usage. Among all tested setups, multi-crops and random augmentations generally improved performance across tasks, as expected. Strikingly, selfsupervised (implicit contrastive learning) models showed competitive performance being up to 11 times faster to train. Self-supervised regularized learning required the most of memory and computation to deliver arguably the most informative features. We observe that no single combination of augmenting and cropping strategies consistently results in top performance across tasks and recommend prospective research directions.

show abstract

“…Following its success in computer vision, this strategy has been adopted in several applications in other research fields including classification of electrocardiograms 31 and clustering of scRNA-seq data. 32 It has been demonstrated that the development of modality-specific data augmentation is critical to the performance of models trained using contrastive learning.…”

Section: Introductionmentioning

confidence: 99%

Self-supervised clustering of mass spectrometry imaging data using contrastive learning

Bindu

Laskin

2022

Chem. Sci.

View full text Add to dashboard Cite

show abstract

Contrastive self-supervised clustering of scRNA-seq data

Cited by 48 publications

References 45 publications

Miscell: An efficient self-supervised learning approach for dissecting single-cell transcriptome

Miscell: An efficient self-supervised learning approach for dissecting single-cell transcriptome

Comparing representations of biological data learned with different AI paradigms, augmenting and cropping strategies

Self-supervised clustering of mass spectrometry imaging data using contrastive learning

Contact Info

Product

Resources

About