Beyond prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations

Yu, F. Richard; Zhao, Meng; Nie, Ping; Wattenhofer, Roger; Sachan, Mrinmaya

doi:10.18653/v1/2022.emnlp-main.587

Cited by 21 publications

(15 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Using poisoned training single-cell datasets (for example, wrong data or make-up data) as an attack can test the robustness of single-cell LLMs. For model fine-tuning, instruction tuning [71] is a potential direction to explore. In this context, cells could be considered as prompts, as described in scGPT.…”

Section: Discussionmentioning

confidence: 99%

Evaluating the Utilities of Foundation Models in Single-cell Data Analysis

Liu,

Li,

Wang

et al. 2023

Preprint

View full text Add to dashboard Cite

Large Language Models (LLMs) have made significant strides in both industrial and scientific domains. In this paper, we evaluate the performance of LLMs in single-cell sequencing data analysis through comprehensive experiments across eight downstream tasks pertinent to single-cell data. By comparing seven different single-cell LLMs with task-specific methods, we found that single-cell LLMs may not consistently excel in all tasks than task-specific methods. However, the emergent abilities and the successful applications of cross-species/cross-modality transfer learning of LLMs are promising. In addition, we present a systematic evaluation of the effects of hyper-parameters, initial settings, and stability for training single-cell LLMs based on a proposedscEvalframework, and provide guidelines for pre-training and fine-tuning. Our work summarizes the current state of single-cell LLMs, and points to their constraints and avenues for future developments.

show abstract

Section: Discussionmentioning

confidence: 99%

Evaluating the Utilities of Foundation Models in Single-cell Data Analysis

Liu,

Li,

Wang

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…With better pre-training design, we may increase the scale of current models to billion level. For model fine-tuning, instruction tuning [75] is a potential direction to explore. In this context, cells could be considered as prompts, as described in scGPT.…”

Section: Discussionmentioning

confidence: 99%

Evaluating the Utilities of Large Language Models in Single-cell Data Analysis

Zhao,

Liu,

et al. 2023

Preprint

View full text Add to dashboard Cite

Large Language Models (LLMs) have made significant strides in both industrial and scientific domains. In this paper, we evaluate the performance of LLMs in single-cell sequencing data analysis through comprehensive experiments across eight downstream tasks pertinent to single-cell data. By comparing seven different single-cell LLMs with task-specific methods, we found that single-cell LLMs may not consistently excel in all tasks than task-specific methods. However, the emergent abilities and the successful applications of cross-species/cross-modality transfer learning of LLMs are promising. In addition, we present a systematic evaluation of the effects of hyper-parameters, initial settings, and stability for training single-cell LLMs based on a proposed scEval framework, and provide guidelines for pre-training and fine-tuning. Our work summarizes the current state of single-cell LLMs, and points to their constraints and avenues for future developments.

show abstract

“…This capability is learned through pre-training on large corpora of diverse text data. Recent advancements in the realm of LLMs are Generative Pre-trained Transformers (GPTs) [32,[64][65][66][67][68][69][70]. GPTs leverage multi-head self-attention mechanisms for parallelized processing of input data, enabling the capture of long-range dependencies to predict the next token in a sentence or a sequence of text based on the context of preceding tokens.…”

Section: Generative Large Language Modelmentioning

confidence: 99%

Decoding Continuous Character-based Language from Non-invasive Brain Recordings

Zhang,

Zheng,

Yin

et al. 2024

Preprint

View full text Add to dashboard Cite

Deciphering natural language from brain activity through non-invasive devices remains a formidable challenge. Previous non-invasive decoders either require multiple experiments with identical stimuli to pinpoint cortical regions and enhance signal-to-noise ratios in brain activity, or they are limited to discerning basic linguistic elements such as letters and words. We propose a novel approach to decoding continuous language from single-trial non-invasive fMRI recordings, in which a three-dimensional convolutional network augmented with information bottleneck is developed to automatically identify responsive voxels to stimuli, and a character-based decoder is designed for the semantic reconstruction of continuous language characterized by inherent character structures. The resulting decoder can produce intelligible textual sequences that faithfully capture the meaning of perceived speech both within and across subjects, while existing decoders exhibit significantly inferior performance in cross-subject contexts. The ability to decode continuous language from single trials across subjects demonstrates the promising applications of non-invasive language brain-computer interfaces in both healthcare and neuroscience.

show abstract

Beyond prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations

Cited by 21 publications

References 37 publications

Evaluating the Utilities of Foundation Models in Single-cell Data Analysis

Evaluating the Utilities of Foundation Models in Single-cell Data Analysis

Evaluating the Utilities of Large Language Models in Single-cell Data Analysis

Decoding Continuous Character-based Language from Non-invasive Brain Recordings

Contact Info

Product

Resources

About