Deep Generation of Coq Lemma Names Using Elaborated Terms

Nie, Pengyu; Palmskog, Karl; Li, Junyi Jessy; Gligoric, Milos

doi:10.1007/978-3-030-51054-1_6

Cited by 9 publications

(11 citation statements)

References 51 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We evaluated an earlier version of ROOSTERIZE using a corpus derived from the MathComp family of Coq projects, finding that the toolchain significantly outperforms strong baselines on automatic metrics [7]. Moreover, we found encouraging results in a qualitative case study where the maintainer of a medium-sized Coq project manually evaluated over 150 name suggestions generated by ROOSTERIZE.…”

Section: Introductionmentioning

confidence: 88%

“…Users should first obtain a model, e.g., by downloading a pre-trained model. The following command downloads the model we pre-trained on our MathComp corpus [7]:…”

Section: A Command Linementioning

confidence: 99%

“…ROOSTERIZE learns naming conventions by leveraging neural networks trained on existing Coq code. The deep learning and suggestion processes use multiple representations of lemma statements, including syntax trees and Coq kernel trees (also called elaborated terms) [7]. In essence, ROOSTERIZE consists of (1) a set of components written in OCaml that interact with Coq or directly process information extracted from Coq, and (2) a set of components written in Python that perform name learning and generation.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Roosterize: Suggesting Lemma Names for Coq Verification Projects Using Deep Learning

Nie

Palmskog

et al. 2021

2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)

Self Cite

View full text Add to dashboard Cite

Naming conventions are an important concern in large verification projects using proof assistants, such as Coq. In particular, lemma names are used by proof engineers to effectively understand and modify Coq code. However, providing accurate and informative lemma names is a complex task, which is currently often carried out manually. Even when lemma naming is automated using rule-based tools, generated names may fail to adhere to important conventions not specified explicitly. We demonstrate a toolchain, dubbed ROOSTERIZE, which automatically suggests lemma names in Coq projects. ROOSTERIZE leverages a neural network model trained on existing Coq code, thus avoiding manual specification of naming conventions. To allow proof engineers to conveniently access suggestions from ROOSTERIZE during Coq project development, we integrated the toolchain into the popular Visual Studio Code editor. Our evaluation shows that ROOSTERIZE substantially outperforms strong baselines for suggesting lemma names and is useful in practice. The demo video for ROOSTERIZE can be viewed at: https://youtu.be/HZ5ac7Q14rc.

show abstract

Section: Introductionmentioning

confidence: 88%

“…Users should first obtain a model, e.g., by downloading a pre-trained model. The following command downloads the model we pre-trained on our MathComp corpus [7]:…”

Section: A Command Linementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Roosterize: Suggesting Lemma Names for Coq Verification Projects Using Deep Learning

Nie

Palmskog

et al. 2021

2021 IEEE/ACM 43rd International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Despite the high accuracy achieved by our preliminary implementation even when using the baseline n-gram model, we believe our spacing prediction (based only on raw token streams) needs significant tuning for practical use. For example, newlines before Qed sentences often get mispredicted, and unlike for name suggestions [3], it is usually inconvenient to inspect more than the top-1 suggestion for spacing. Moreover, for MathComp, we were able to construct, with help from maintainers, a sufficiently large corpus with strict adherence to conventions; for other projects, it may be more challenging, e.g., due to project size or lack of consensus on conventions.…”

Section: Challenges and Future Directionsmentioning

confidence: 99%

“…As a first step, we here outline initial models to learn and suggest space formatting in Coq files, with a preliminary implementation for Coq 8.10, and evaluated using on a corpus based on MathComp 1.9.0 which comprises 164k lines of Coq code from four core projects [3].…”

mentioning

confidence: 99%