2020
DOI: 10.1101/2020.06.11.146845
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SCIM: Universal Single-Cell Matching with Unpaired Feature Sets

Abstract: Motivation: Recent technological advances have led to an increase in the production and availability of single-cell data. The ability to integrate a set of multi-technology measurements would allow the identification of biologically or clinically meaningful observations through the unification of the perspectives afforded by each technology. In most cases, however, profiling technologies consume the used cells and thus pairwise correspondences between datasets are lost. Due to the sheer size single-cell datase… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 29 publications
0
4
0
Order By: Relevance
“…For each time point and line separately, we performed Canonical Correlation Analysis (CCA) on gene activities and gene expression data using the Seurat function based on 2000 features, which were selected using the Seurat function . In CCA space, we performed minimum-cost maximum-flow (MCMF) bipartite matching between the modalities as described (20) (https://github.com/ratschlab/scim). The function was used with , , and otherwise default parameters.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For each time point and line separately, we performed Canonical Correlation Analysis (CCA) on gene activities and gene expression data using the Seurat function based on 2000 features, which were selected using the Seurat function . In CCA space, we performed minimum-cost maximum-flow (MCMF) bipartite matching between the modalities as described (20) (https://github.com/ratschlab/scim). The function was used with , , and otherwise default parameters.…”
Section: Methodsmentioning
confidence: 99%
“…S1, B to D). We constructed 'multi-omic metacells' containing information on both transcriptome and chromatin accessibility using minimum-cost, maximum-flow (MCMF) bipartite matching (20) within the CCA space (Fig. 1A).…”
Section: Single-cell Multiomic Reconstruction Of Cerebral Organoid Developmentmentioning
confidence: 99%
“…While features sets might not be easily comparable, the pathway structure underlying these feature sets are shared. This indicates that pathway-factorized latent representations, like those learned by pmVAE, could be used more easily integrate [5,9,28,43] these technologies.…”
Section: Discussionmentioning
confidence: 99%
“…Apart from the feature-converted data, Seurat v3 15 and bindSC 30 also devised heuristic strategies to utilize information in the original feature space, which probably explains their improved performance than methods that do not 16, 17 . At the cell level, known cell types have also been used via (semi-)supervised learning 47, 48 , but this approach incurs substantial limitations in terms of applicability since such supervision is typically unavailable and in many cases serves as the purpose of multi-omics integration per se 26 . Notably, one of these methods was proposed with a similar autoencoder architecture and adversarial alignment 48 , but it relied on matched cell types or clusters to orient the alignment.…”
Section: Discussionmentioning
confidence: 99%