Stanislas Polu scite author profile

Stanislas Polu

5Publications

85Citation Statements Received

145Citation Statements Given

How they've been cited

How they cite others

145

Affiliations

OpenAI (United States), École Polytechnique

Publications

Order By: Most citations

Generative Language Modeling for Automated Theorem Proving

Polu¹,

Sutskever²

2020

Preprint

View full text Add to dashboard Cite

We explore the application of transformer-based language models to automated theorem proving. This work is motivated by the possibility that a major limitation of automated theorem provers compared to humans -the generation of original mathematical terms -might be addressable via generation from language models. We present an automated prover and proof assistant, GPT-f, for the Metamath formalization language, and analyze its performance. GPT-f found new short proofs that were accepted into the main Metamath library, which is to our knowledge, the first time a deep learning based system has contributed proofs that were adopted by a formal mathematics community.Preprint. Under review.

show abstract

Proof Artifact Co-training for Theorem Proving with Language Models

Han¹,

Rute²,

Wu³

et al. 2021

Preprint

View full text Add to dashboard Cite

Labeled data for imitation learning of theorem proving in large libraries of formalized mathematics is scarce as such libraries require years of concentrated effort by human specialists to be built. This is particularly challenging when applying large Transformer language models to tactic prediction, because the scaling of performance with respect to model size is quickly disrupted in the data-scarce, easily-overfitted regime. We propose PACT (Proof Artifact Co-Training), a general methodology for extracting abundant selfsupervised data from kernel-level proof terms for co-training alongside the usual tactic prediction objective. We apply this methodology to Lean, an interactive proof assistant which hosts some of the most sophisticated formalized mathematics to date. We instrument Lean with a neural theorem prover driven by a Transformer language model and show that PACT improves theorem proving success rate on a held-out suite of test theorems from 32% to 48%.

show abstract

MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics

Zheng

Han

Polu

2021

Preprint

View full text Add to dashboard Cite

We present miniF2F, a dataset of formal Olympiad-level mathematics problems statements intended to provide a unified cross-system benchmark for neural theorem proving. The miniF2F benchmark currently targets Metamath, Lean, and Isabelle and consists of 488 problem statements drawn from the AIME, AMC, and the International Mathematical Olympiad (IMO), as well as material from high-school and undergraduate mathematics courses. We report baseline results using GPT-f [12], a neural theorem prover based on and provide an analysis of its performance. We intend for miniF2F to be a community-driven effort and hope that our benchmark will help spur advances in neural theorem proving.

show abstract

Formal Mathematics Statement Curriculum Learning

Polu¹,

Han²,

Zheng³

et al. 2022

Preprint

View full text Add to dashboard Cite

Discrepancy-Sensitive Dynamic Fractional Cascading, Dominated Maxima Searching, and 2-d Nearest Neighbors in Any Minkowski Metric

Atallah

Blanton

Goodrich

et al.

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Stanislas Polu

Generative Language Modeling for Automated Theorem Proving

Proof Artifact Co-training for Theorem Proving with Language Models

MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics

Formal Mathematics Statement Curriculum Learning

Discrepancy-Sensitive Dynamic Fractional Cascading, Dominated Maxima Searching, and 2-d Nearest Neighbors in Any Minkowski Metric

Contact Info

Product

Resources

About