2022
DOI: 10.1038/s43588-022-00251-y
|View full text |Cite
|
Sign up to set email alerts
|

Adversarial domain translation networks for integrating large-scale atlas-level single-cell datasets

Abstract: The rapid emergence of large-scale atlas-level single-cell RNA-seq datasets presents remarkable opportunities for broad and deep biological investigations through integrative analyses. However, harmonizing such datasets requires integration approaches to be not only computationally scalable, but also capable of preserving a wide range of fine-grained cell populations. We created Portal, a unified framework of adversarial domain translation to learn harmonized representations of datasets. With innovation in mod… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 25 publications
(3 citation statements)
references
References 71 publications
0
3
0
Order By: Relevance
“…Therefore, we added Harmony 37 , which was demonstrated to have the best performance and shortest running time in previous batch-effect removal benchmark study in scRNA-seq data 38 . In the meantime, we also included Portal 39 , a recently published integration method which has not been benchmarked and can take gene scores as input. After performing the integration between reference and target datasets, we apply MLP as classifier to transfer cell labels according to the integrated output (details in Methods section).…”
Section: Resultsmentioning
confidence: 99%
“…Therefore, we added Harmony 37 , which was demonstrated to have the best performance and shortest running time in previous batch-effect removal benchmark study in scRNA-seq data 38 . In the meantime, we also included Portal 39 , a recently published integration method which has not been benchmarked and can take gene scores as input. After performing the integration between reference and target datasets, we apply MLP as classifier to transfer cell labels according to the integrated output (details in Methods section).…”
Section: Resultsmentioning
confidence: 99%
“…By effectively designing the shared latent space and accounting for slice-specific and gene-specific effects, STitch3D is able to preserve biologically meaningful information that varies along the z-axis, while also removing batch effects across slices. Additionally, it has been shown by previous studies that integrating more data improves statistical power in single-cell data analysis [40,41,42,43]. Here, as a large amount of information is shared across slices, such as similar gene expression levels in adjacent slices, STitch3D's joint analysis of multiple slices also improves accuracy.…”
Section: Discussionmentioning
confidence: 98%
“…Cells that were deemed low quality (i.e., low in transcript and/or gene count) and technical doublets were removed from downstream analysis. Next, to ensure cell types are comparable across species, we applied Portal 135 to integrate data (one-to-one orthologs only) from different species. Portal projects data into a space that minimizes species differences, from which an integrated UMAP is generated to visualize cell clustering from different species.…”
Section: Methodsmentioning
confidence: 99%