Xinxi Lyu scite author profile

Xinxi Lyu

5Publications

96Citation Statements Received

70Citation Statements Given

How they've been cited

287

How they cite others

Affiliations

Publications

Order By: Most citations

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

Min¹,

Lyu²,

Holtzman³

et al. 2022

Preprint

View full text Add to dashboard Cite

Large language models (LMs) are able to incontext learn-perform a new task via inference alone by conditioning on a few inputlabel pairs (demonstrations) and making predictions for new inputs. However, there has been little understanding of how the model learns and which aspects of the demonstrations contribute to end task performance. In this paper, we show that ground truth demonstrations are in fact not required-randomly replacing labels in the demonstrations barely hurts performance, consistently over 12 different models including GPT-3. Instead, we find that other aspects of the demonstrations are the key drivers of end task performance, including the fact that they provide a few examples of (1) the label space, (2) the distribution of the input text, and (3) the overall format of the sequence. Together, our analysis provides a new way of understanding how and why in-context learning works, while opening up new questions about how much can be learned from large language models through inference alone.

show abstract

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

Min¹,

Lyu²,

Holtzman³

et al. 2022

190

View full text Add to dashboard Cite

Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

Lyu¹,

Min²,

Beltagy³

et al. 2022

Preprint

View full text Add to dashboard Cite

Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

Lyu¹,

Min²,

Beltagy³

et al. 2023

View full text Add to dashboard Cite

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Khashabi¹,

Lyu²,

Min³

et al. 2022

View full text Add to dashboard Cite

Fine-tuning continuous prompts for target tasks has recently emerged as a compact alternative to full model fine-tuning. Motivated by these promising results, we investigate the feasibility of extracting a discrete (textual) interpretation of continuous prompts that is faithful to the problem they solve. In practice, we observe a "wayward" behavior between the task solved by continuous prompts and the nearest neighbor discrete projections of these prompts: One can find continuous prompts that solve a task while being projected to an arbitrary text (e.g., definition of a different or even a contradictory task) and simultaneously being within a very small (2%) margin of the best continuous prompt of the same size for the task. We provide intuitions behind this odd and surprising behavior, as well as extensive empirical analyses quantifying the effect of design choices. For instance, larger models exhibit higher waywardness, i.e, we can find prompts that more closely map to any arbitrary text with a smaller drop of accuracy. These findings have important implications relating to the difficulty of faithfully interpreting continuous prompts and their generalization across models and tasks, providing guidance for future progress in prompting language models.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xinxi Lyu

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Contact Info

Product

Resources

About