Qiming Yuan scite author profile

We introduce Codex, a GPT language model finetuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J solves 11.4%. Furthermore, we find that repeated sampling from the model is a surprisingly effective strategy for producing working solutions to difficult prompts. Using this method, we solve 70.2% of our problems with 100 samples per problem. Careful investigation of our model reveals its limitations, including difficulty with docstrings describing long chains of operations and with binding operations to variables. Finally, we discuss the potential broader impacts of deploying powerful code generation technologies, covering safety, security, and economics.

show abstract

Solving Rubik's Cube with a Robot Hand

OpenAI¹,

Akkaya²,

Andrychowicz³

et al. 2019

Preprint

268

296

View full text Add to dashboard Cite

The sequential organisation of gift offering and acceptance in Chinese

Hua

Wei

Yuan

2000

Journal of Pragmatics

View full text Add to dashboard Cite

Language context tunes brain network for language control in bilingual language production

Zhang

Chen

et al. 2020

Neuropsychologia

View full text Add to dashboard Cite

The top 100 most cited articles on rhabdomyolysis: A bibliometric analysis

Liu

Yuan

Mao

et al. 2020

The American Journal of Emergency Medicine

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Qiming Yuan

Evaluating Large Language Models Trained on Code

Solving Rubik's Cube with a Robot Hand

The sequential organisation of gift offering and acceptance in Chinese

Language context tunes brain network for language control in bilingual language production

The top 100 most cited articles on rhabdomyolysis: A bibliometric analysis

Contact Info

Product

Resources

About