“…We aim to perform integration and cell-type assignment while preserving biological variability by utilizing the universal gene embeddings from our generative pre-trained model. We apply GeneCompass to CAME 28 which is a heterogeneous graph neural network called GeneCompass-CAME, where cells and genes are modeled as heterogeneous nodes. Also, like CAME, we create the heterogeneous graph with six heterogeneous types: ‘cell to gene’, ‘gene to cell’, ‘cell to cell’, ‘gene to gene’, ‘cell self-loop’, ‘gene self-loop’, where we denote the corresponding weights (shared across species) as W cg , W gc , W cc , W gg , W c and W g , respectively.…”