Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Khashabi, Daniel; Lyu, Shane; Min, Sewon; Qin, Lianhui; Richardson, Kyle; Singh, Sameer; Welleck, Sean; Hajishirzi, Hannaneh; Khot, Tushar; Sabharwal, Ashish; Choi, Yejin

doi:10.48550/arxiv.2112.08348

Cited by 7 publications

(12 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Last, our interpretation of FFN outputs as updates to the output distribution relates to recent works that interpreted groups of LM parameters in the discrete vocabulary space (Geva et al, 2021;Khashabi et al, 2021), or viewed the representation as an information stream (Elhage et al, 2021).…”

Section: Related Workmentioning

confidence: 90%

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Geva¹,

Caciularu²,

Wang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Transformer-based language models (LMs) are at the core of modern NLP, but their internal prediction construction process is opaque and largely not understood. In this work, we make a substantial step towards unveiling this underlying prediction process, by reverseengineering the operation of the feed-forward network (FFN) layers, one of the building blocks of transformer models. We view the token representation as a changing distribution over the vocabulary, and the output from each FFN layer as an additive update to that distribution. Then, we analyze the FFN updates in the vocabulary space, showing that each update can be decomposed to sub-updates corresponding to single FFN parameter vectors, each promoting concepts that are often humaninterpretable. We then leverage these findings for controlling LM predictions, where we reduce the toxicity of GPT2 by almost 50%, and for improving computation efficiency with a simple early exit rule, saving 20% of computation on average. 1 * Equal contribution. † Work done during an internship at AI2. 1 https://github.com/aviclu/ffn-values.

show abstract

Section: Related Workmentioning

confidence: 90%

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Geva¹,

Caciularu²,

Wang³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Given optimization difficulties when training prompt embeddings, Diao et al [57] recently used black-box optimization to train prompt embeddings without requiring gradients. Several works have analyzed prompt tuning from the perspective of interpretability Khashabi et al [58] and its similarity to other PEFT methods He et al [29]. Prompt tuning has been applied to various applications for NLP including continual learning [59], model robustness [60,61], summarization [62], machine translation [63], co-training [64], probing language models [65,65], inverse prompting [66], and transfer learning [67].…”

Section: Related Workmentioning

confidence: 99%

Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning

Liu¹,

Tam²,

Muqeeth³

et al. 2022

Preprint

View full text Add to dashboard Cite

Few-shot in-context learning (ICL) enables pre-trained language models to perform a previously-unseen task without any gradient-based training by feeding a small number of training examples as part of the input. ICL incurs substantial computational, memory, and storage costs because it involves processing all of the training examples every time a prediction is made. Parameter-efficient fine-tuning (e.g. adapter modules, prompt tuning, sparse update methods, etc.) offers an alternative paradigm where a small set of parameters are trained to enable a model to perform the new task. In this paper, we rigorously compare few-shot ICL and parameter-efficient fine-tuning and demonstrate that the latter offers better accuracy as well as dramatically lower computational costs. Along the way, we introduce a new parameter-efficient fine-tuning method called (IA) 3 that scales activations by learned vectors, attaining stronger performance while only introducing a relatively tiny amount of new parameters. We also propose a simple recipe based on the T0 model [1] called T-Few that can be applied to new tasks without task-specific tuning or modifications. We validate the effectiveness of T-Few on completely unseen tasks by applying it to the RAFT benchmark [2], attaining super-human performance for the first time and outperforming the state-of-the-art by 6% absolute. All of the code used in our experiments is publicly available. 1 * Equal contribution. 1 https://github.com/r-three/t-few Preprint. Under review.

show abstract

“…Recent work (Brown et al, 2020;Jiang et al, 2020;Khashabi et al, 2021;Gao et al, 2021) shows it's possible to combine discrete text prompt z with input x to directly perform various NLP tasks using a pre-trained LM's generative distribution P LM (y|z, x), without needing to fine-tune the model. For instance, in classification, the LM can be a masked language model (MLM) such as BERT (Devlin et al, 2019), and y is the class-label token (a.k.a.…”

Section: The Discrete Prompt Optimization Problemmentioning

confidence: 99%

“…One of the most popular schemes of prompt optimization is to tune soft prompts (i.e., continuous embedding vectors) as they are amenable to gradient descent Li and Liang, 2021;Vu et al, 2021;Gu et al, 2021;Liu et al, 2021d;Mokady et al, 2021;An et al, 2022, etc.). However, the resulting continuous embedding learned with an LM is, by its nature, hard for humans to understand (Khashabi et al, 2021;Hambardzumyan et al, 2021;Mokady et al, 2021) and incompatible for use with other LMs. Besides, the required LM internal gradients are often expensive to compute, or simply unavailable for LMs deployed with only inference APIs (e.g., .…”

Section: Introductionmentioning

confidence: 99%

RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning

Deng¹,

Wang²,

Hsieh³

et al. 2022

Preprint

View full text Add to dashboard Cite

Prompting has shown impressive success in enabling large pretrained language models (LMs) to perform diverse NLP tasks, especially when only few downstream data are available. Automatically finding the optimal prompt for each task, however, is challenging. Most existing work resorts to tuning soft prompt (e.g., embeddings) which falls short of interpretability, reusability across LMs, and applicability when gradients are not accessible. Discrete prompt, on the other hand, is difficult to optimize, and is often created by "enumeration (e.g., paraphrasing)-then-selection" heuristics that do not explore the prompt space systematically. This paper proposes RLPROMPT, an efficient discrete prompt optimization approach with reinforcement learning (RL). RL-PROMPT formulates a parameter-efficient policy network that generates the desired discrete prompt after training with reward. To overcome the complexity and stochasticity of reward signals by the large LM environment, we incorporate effective reward stabilization that substantially enhances the training efficiency. RLPROMPT is flexibly applicable to different types of LMs, such as masked (e.g., BERT) and left-to-right models (e.g., GPTs), for both classification and generation tasks. Experiments on few-shot classification and unsupervised text style transfer show superior performance over a wide range of existing finetuning or prompting methods. Interestingly, the resulting optimized prompts are often ungrammatical gibberish text; and surprisingly, those gibberish prompts are transferrable between different LMs to retain significant performance, indicating LM prompting may not follow human language patterns. 1

show abstract

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Cited by 7 publications

References 10 publications

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space

Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning

RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning

Contact Info

Product

Resources

About