2022
DOI: 10.48550/arxiv.2205.11491
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

HyperTree Proof Search for Neural Theorem Proving

Abstract: We propose an online training procedure for a transformer-based automated theorem prover. Our approach leverages a new search algorithm, HyperTree Proof Search (HTPS), inspired by the recent success of AlphaZero. Our model learns from previous proof searches through online training, allowing it to generalize to domains far from the training distribution. We report detailed ablations of our pipeline's main components by studying performance on three environments of increasing complexity. In particular, we show … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 24 publications
0
3
0
Order By: Relevance
“…Our work proposes to augment methods that alternate between learning and search to solve mathematical reasoning problems. Several methods of this flavor have been introduced recently, such as Expert Iteration [1] and AlphaZero [13] for game playing, HTPS [6] and GPT-f [11] for neural theorem proving, and ConPoLe [10] for mathematical problems from the Common Core environments.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Our work proposes to augment methods that alternate between learning and search to solve mathematical reasoning problems. Several methods of this flavor have been introduced recently, such as Expert Iteration [1] and AlphaZero [13] for game playing, HTPS [6] and GPT-f [11] for neural theorem proving, and ConPoLe [10] for mathematical problems from the Common Core environments.…”
Section: Related Workmentioning
confidence: 99%
“…The Common Core environments are a simplified setting compared to industrial theorem proving languages, like Lean [8] or Isabelle/HOL [9]. A range of significant work has been developed in RL agents to find proofs in these languages [11,6,3], typically using a language model fine-tuned on human proofs as the action generator.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation