Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm

Reynolds, Laria; McDonell, Kyle

doi:10.48550/arxiv.2102.07350

Cited by 22 publications

(23 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For instance placing "TL;DR" (internet slang for Too Long; Didn't Read) at the end of an article causes the model to generate a summary. Efficiently discovering the right prompt is difficult and has become an active area of research (Reynolds and McDonell, 2021;Shin et al, 2020;Jiang et al, 2020). Brown et al (2020) demonstrated that few-shot learning without fine-tuning is possible with very large language models.…”

Section: Background and Related Workmentioning

confidence: 99%

Surface Form Competition: Why the Highest Probability Answer Isn't Always Right

Holtzman¹,

West²,

Shwartz³

et al. 2021

Preprint

View full text Add to dashboard Cite

Large language models have shown promising results in zero-shot settings (Brown et al., 2020;Radford et al., 2019). For example, they can perform multiple choice tasks simply by conditioning on a question and selecting the answer with the highest probability.However, ranking by string probability can be problematic due to surface form competition-wherein different surface forms compete for probability mass, even if they represent the same underlying concept, e.g. "computer" and "PC." Since probability mass is finite, this lowers the probability of the correct answer, due to competition from other strings that are valid answers (but not one of the multiple choice options).We introduce Domain Conditional Pointwise Mutual Information, an alternative scoring function that directly compensates for surface form competition by simply reweighing each option according to a term that is proportional to its a priori likelihood within the context of the specific zero-shot task. It achieves consistent gains in zero-shot performance over both calibrated (Zhao et al., 2021) and uncalibrated scoring functions on all GPT-2 and GPT-3 models on a variety of multiple choice datasets. 1

show abstract

Section: Background and Related Workmentioning

confidence: 99%

Surface Form Competition: Why the Highest Probability Answer Isn't Always Right

Holtzman¹,

West²,

Shwartz³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Prompt-based approaches involve constructing optimal prompts for language models to best elicit knowledge and maximize prediction performances (Radford et al, 2019;Schick and Schütze, 2020). As the scale of language models grows, the potential of replacing the full finetuning paradigm with the prompt-based approach has been reported (Reynolds and McDonell, 2021;Li and Liang, 2021), as learning via prompts is efficient regarding time and space complexity. However, language models are highly sensitive to the prompt design, motivating methodologies for optimizing prompts.…”

Section: Prompt Optimizationmentioning

confidence: 99%

What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers

Kim¹,

Kim²,

Lee³

et al. 2021

Preprint

View full text Add to dashboard Cite

GPT-3 shows remarkable in-context learning ability of large-scale language models (LMs) trained on hundreds of billion scale data. Here we address some remaining issues less reported by the GPT-3 paper, such as a non-English LM, the performances of different sized models, and the effect of recently introduced prompt optimization on in-context learning. To achieve this, we introduce Hy-perCLOVA, a Korean variant of 82B GPT-3 trained on a Korean-centric corpus of 560B tokens. Enhanced by our Korean-specific tokenization, HyperCLOVA with our training configuration shows state-of-the-art in-context zero-shot and few-shot learning performances on various downstream tasks in Korean. Also, we show the performance benefits of promptbased learning and demonstrate how it can be integrated into the prompt engineering pipeline. Then we discuss the possibility of materializing the No Code AI paradigm by providing AI prototyping capabilities to nonexperts of ML by introducing HyperCLOVA studio, an interactive prompt engineering interface. Lastly, we demonstrate the potential of our methods with three successful in-house applications.

show abstract

“…It has been shown that a well-selected prefix, or 'prompt', can dramatically increase the performance of a language model on a specific task [41]. A resulting line of research has been automatically creating either natural language prompts or continuous vector prompts, to perform well on tasks [19,34].…”

Section: Natural Language Generationmentioning

confidence: 99%

“…First we craft a 'prefix' prompt to pre-pend to any prompt used by a writer. Prefix prompts have been shown to greatly improve performance by providing the language model with appropriate context [41]. We found early on in development that simply providing the model with a technical topic was not enough -also providing a context area was necessary for it to appropriately interpret technical terms.…”

Section: Design Goalsmentioning

confidence: 99%

Sparks: Inspiration for Science Writing using Language Models

Gero¹,

Chilton²

2021

Preprint

View full text Add to dashboard Cite

Gero, Liu, and Chilton.there's less work on how the summaries might be used by people. Our work aims to study how text generated by language models might be used by writers in a science writing task. There's some relation to a natural language generation task like summarization, because we are concerned with real facts, but we take a human centered approach where the language model provides suggestions, rather than a completed output. Science Communication on Social MediaScience communication helps the public understand scientific contributions -consider how it has been applied to tackle vaccine misinformation [44], the COVID-19 pandemic [56], and climate change [24]. Traditionally, science communication took place through journals, conferences, articles, and books -places where peer review was an implicit part of the publication process. However, the rise of digital networks has made science establish a virtual presence through electronic journals and digital records. The ubiquity of social media further presented opportunities for scientists to have direct channels to the public. Now any scientist can conduct science communication online by posting about their work online [47], engaging in the 'Ask' communities on Reddit [22] or explaining something on Youtube [52]. Even PhD students or undergraduate researchers have the ability to disseminate their scientific knowledge at any time without depending on a venue or a publication. This emerging trend, where the scientist can now partake in conversations outside of an implicitly gated, peer-review process, reflects one of the many broad shifts away from traditional science communication. Scholars of science communication have reified this emerging form of communication as "post-normal science communication" [12].Defining characteristics of post-normal science communication include a tolerance for subjectivity, an insertion of the self, the integration of advocacy, and call to actions. Despite these dramatic shifts, the original tenets of science communication such as storytelling, analogies, figures, and citations remain valuable, and storytelling in particular is a driving principle within our system.

show abstract

Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm

Cited by 22 publications

References 14 publications

Surface Form Competition: Why the Highest Probability Answer Isn't Always Right

Surface Form Competition: Why the Highest Probability Answer Isn't Always Right

What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers

Sparks: Inspiration for Science Writing using Language Models

Contact Info

Product

Resources

About