2019
DOI: 10.1101/791947
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Joint Model of RNA Expression and Surface Protein Abundance in Single Cells

Abstract: Cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq) combines unbiased single-cell transcriptome measurements with surface protein quantification comparable to flow cytometry, the gold standard for cell type identification. However, current analysis pipelines cannot address the two primary challenges of CITE-seq data: combining both modalities in a shared latent space that harnesses the power of the paired measurements, and handling the technical artifacts of the protein measurement, which… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(13 citation statements)
references
References 21 publications
0
13
0
Order By: Relevance
“…In practice, users can choose more complex inference models if it is warranted for certain annotation types. Moreover, we expect prediction results to improve with more accurate and reproducible annotation methods, such as consistent cell type taxonomies provided by the Cell Ontology 48 project and better modeling of multimodal expression data 17 .…”
Section: Discussionmentioning
confidence: 99%
“…In practice, users can choose more complex inference models if it is warranted for certain annotation types. Moreover, we expect prediction results to improve with more accurate and reproducible annotation methods, such as consistent cell type taxonomies provided by the Cell Ontology 48 project and better modeling of multimodal expression data 17 .…”
Section: Discussionmentioning
confidence: 99%
“…We next benchmarked WNN analysis against two recently introduced methods for multimodal integration: Multi-omics factor analysis v2 (MOFA+) [25], which uses a statistical framework based on factor analysis, and totalVI [26], which combines deep neural networks with a hierarchical Bayesian model. Both methods integrate the modalities into a latent space, which we used to construct an integrated k-NN graph and a 2D UMAP visualization.…”
Section: Wnn Analysis Is a Robust And Flexible Approach For Multimodamentioning
confidence: 99%
“…These mixture models were fitted to the counts for each protein, while we used empty droplets to account for protein-specific background and mixture models to fit counts from all proteins within each droplet/cell to infer the technical component reflective of library size (Figs 1B-C). Another recent method defined protein expression as a mixture of biological cell statedependent foreground and noise-associated background 10 . It uses variational inference to learn the parameters of a probabilistic model that incorporates both latent cell-state variables and noise/technical factors.…”
Section: Discussionmentioning
confidence: 99%