2024
DOI: 10.1101/2024.01.27.577455
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Parameter-Efficient Fine-Tuning Enhances Adaptation of Single Cell Large Language Model for Cell Type Identification

Fei He,
Ruixin Fei,
Mingyue Gao
et al.

Abstract: Single-cell sequencing transformed biology and medicine, providing an unprecedented high-resolution view at the cellular level. However, the vast variability inherent in single-cell sequencing data impedes its utility for in-depth downstream analysis. Inspired by the foundation models in natural language processing, recent advancements have led to the development of single-cell Large Language Models (scLLMs). These models are designed to discern universal patterns across diverse single-cell datasets, thereby e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 31 publications
0
4
0
Order By: Relevance
“…Unsupervised clustering methods like Louvain produce inconsistent results based on resolution, UMAP dimensionality, or programming language (R vs. Python) 38 . Finding optimal parameters is time-consuming and often suboptimal 24 . In contrast, TACIT automates cell type annotation, mimicking manual gating with enhanced scalability and precision.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Unsupervised clustering methods like Louvain produce inconsistent results based on resolution, UMAP dimensionality, or programming language (R vs. Python) 38 . Finding optimal parameters is time-consuming and often suboptimal 24 . In contrast, TACIT automates cell type annotation, mimicking manual gating with enhanced scalability and precision.…”
Section: Discussionmentioning
confidence: 99%
“…Even with extensive parameter tuning combined with multi-step clustering to identify cell populations of interest, the desired results remain elusive 21,22 . Deep learning algorithms are increasingly utilized in spatial omics for cell type identification, but they require comprehensive and diverse training data to improve their accuracy and applicability to handle the complexities of spatial multiomics 23,24 .…”
Section: Introductionmentioning
confidence: 99%
“…Even with extensive parameter tuning combined with multi-step clustering to identify cell populations of interest, the desired results remain elusive 21,22 . Deep learning algorithms are increasingly utilized in spatial ‘omics for cell type identification, but it requires comprehensive and diverse training data to improve the accuracy and applicability of deep learning models in handling the complexities of spatial multiomics 23,24 .…”
Section: Introductionmentioning
confidence: 99%
“…The diversity and complexity of these tasks are useful to thoroughly probe the model's performance and to evaluate the robustness of the learned representation and the model's ability to generalize to complex predictive tasks. Current results are promising but not entirely replicated in independent benchmarks [45][46][47][48][49][50] . Notably, to date, none of these models account for spatial relationships of cells during training, with the exception of CellPLM 40 , which, however, is trained on a limited dataset of 9 million dissociated and 2 million spatial transcriptomics cells 40 and not fine-tuned on spatial tasks beyond gene imputation.We propose Nicheformer, a novel spatial omics foundation model to understand tissue dependencies.…”
mentioning
confidence: 99%