Code completion with statistical language models

Raychev, Veselin; Vechev, Martin; Yahav, Eran

doi:10.1145/2594291.2594321

Cited by 361 publications

(161 citation statements)

References 42 publications

Supporting

Mentioning

156

Contrasting

Order By: Relevance

“…A large number of intelligent code completion systems for both statically and dynamically typed languages have been proposed in the literature [4,5,2,6,7,8]. Best Matching Neighbor (BMN) and statistical language models such as n-grams, as well as recurrent neural network (RNN) based approaches leveraging sequential nature of the source code have been particularly effective at creating such systems.…”

Section: Introductionmentioning

confidence: 99%

Pythia: AI-assisted Code Completion System

Svyatkovskiy

Zhao

et al. 2019

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining

129

View full text Add to dashboard Cite

In this paper, we propose a novel end-to-end approach for AI-assisted code completion called Pythia. It generates ranked lists of method and API recommendations which can be used by software developers at edit time. The system is currently deployed as part of Intellicode extension in Visual Studio Code IDE. Pythia exploits state-of-the-art large-scale deep learning models trained on code contexts extracted from abstract syntax trees. It is designed to work at a high throughput predicting the best matching code completions on the order of 100 ms. We describe the architecture of the system, perform comparisons to frequency-based approach and invocation-based Markov Chain language model, and discuss challenges serving Pythia models on lightweight client devices. The offline evaluation results obtained on 2700 Python open source software GitHub repositories show a top-5 accuracy of 92%, surpassing the baseline models by 20% averaged over classes, for both intra and cross-project settings.

show abstract

Section: Introductionmentioning

confidence: 99%

Pythia: AI-assisted Code Completion System

Svyatkovskiy

Zhao

et al. 2019

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining

129

View full text Add to dashboard Cite

show abstract

“…Future research could expand this work by considering further classes of smart contracts [35] and further details, such as different smart contract compiler versions [36]. Furthermore, future research that aims to support software developers could aim at not only detecting security vulnerabilities in a smart contract, but also at localizing this vulnerability (e.g., by expressing the information which portions of the bytecode cause the vulnerabilities [37]) or even by providing suggestions to fix it [60,61].…”

Section: ) Smart Contract Securitymentioning

confidence: 99%

On the Convergence of Artificial Intelligence and Distributed Ledger Technology: A Scoping Review and Future Research Agenda

et al. 2020

View full text Add to dashboard Cite

Developments in Artificial Intelligence (AI) and Distributed Ledger Technology (DLT) currently lead to lively debates in academia and practice. AI processes data to perform tasks that were previously thought possible only for humans. DLT has the potential to create consensus over data among a group of participants in uncertain environments. In recent research, both technologies are used in similar and even the same systems. Examples include the design of secure distributed ledgers or the creation of allied learning systems distributed across multiple nodes. This can lead to technological convergence, which in the past, has paved the way for major innovations in information technology. Previous work highlights several potential benefits of the convergence of AI and DLT but only provides a limited theoretical framework to describe upcoming real-world integration cases of both technologies. We aim to contribute by conducting a systematic literature review on previous work and providing rigorously derived future research opportunities. This work helps researchers active in AI or DLT to overcome current limitations in their field, and practitioners to develop systems along with the convergence of both technologies.

show abstract

“…Other methods, such as that of Omar [4] and Raychev et al [5] have also focused on techniques that require more information about the language such as its syntax, or its type system.…”

Section: Prior Workmentioning

confidence: 99%