2022
DOI: 10.1101/gr.276868.122
|View full text |Cite
|
Sign up to set email alerts
|

Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network

Abstract: Cross-species comparative analyses of single-cell RNA sequencing (scRNA-seq) data allow us to explore, at single-cell resolution, the origins of cellular diversity and evolutionary mechanisms that shape cellular form and function. Cell-type assignment is a crucial step to achieve that. However, the poorly annotated genome and limited known biomarkers hinder us from assigning cell identities for nonmodel species. Here, we design a heterogeneous graph neural network model CAME to learn aligned and interpretable … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 25 publications
(11 citation statements)
references
References 68 publications
0
11
0
Order By: Relevance
“…To explore the capability of GeneCompass on cross-species downstream tasks, we integrated GeneCompass with the SOTA method CAME 28 for cross-species cell type annotation. Gene embeddings generated by GeneCompass are utilized as initial gene-nodes features within CAME.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To explore the capability of GeneCompass on cross-species downstream tasks, we integrated GeneCompass with the SOTA method CAME 28 for cross-species cell type annotation. Gene embeddings generated by GeneCompass are utilized as initial gene-nodes features within CAME.…”
Section: Resultsmentioning
confidence: 99%
“…We aim to perform integration and cell-type assignment while preserving biological variability by utilizing the universal gene embeddings from our generative pre-trained model. We apply GeneCompass to CAME 28 which is a heterogeneous graph neural network called GeneCompass-CAME, where cells and genes are modeled as heterogeneous nodes. Also, like CAME, we create the heterogeneous graph with six heterogeneous types: ‘cell to gene’, ‘gene to cell’, ‘cell to cell’, ‘gene to gene’, ‘cell self-loop’, ‘gene self-loop’, where we denote the corresponding weights (shared across species) as W cg , W gc , W cc , W gg , W c and W g , respectively.…”
Section: Methodsmentioning
confidence: 99%
“…However, some neuronal subtypes, such as MGEderived INs, are transcriptomically more conserved across evolution than other primary neurons, including cortical PNs [13,44]. In the future, these IN subtypes could be used as a way to validate SIMS to perform trans-species predictions [96]. Additional modifications, such as gene module extraction could provide increased accuracy for label transfer, as meta-modules could prove to be more conserved between evolutionary distant species than gene orthologs [92,97,98].…”
Section: Discussionmentioning
confidence: 99%
“…There are cases when two orthogonal graphs are being estimated. One such case is the comparison of cells between species (Liu et al 2023 ); the traditional Euclidean distance between cells is problematic because it is not clear which genes in species A should be compared to which genes in species B. However, assuming that the cells are lined up correctly, and with some knowledge of homology (based on gene sequences), it is possible to find which genes correlate and thus correspond.…”
Section: Nonlinear Models and Neural Networkmentioning
confidence: 99%