Lin Jiang scite author profile

Software defect prediction can assist developers in finding potential bugs and reducing maintenance cost. Traditional approaches usually utilize software metrics (Lines of Code, Cyclomatic Complexity, etc.) as features to build classifiers and identify defective software modules. However, software metrics often fail to capture programs' syntax and semantic information. In this paper, we propose Seml, a novel framework that combines word embedding and deep learning methods for defect prediction. Specifically, for each program source file, we first extract a token sequence from its abstract syntax tree. Then, we map each token in the sequence to a real-valued vector using a mapping table, which is trained with an unsupervised word embedding model. Finally, we use the vector sequences and their labels (defective or non-defective) to build a Long Short Term Memory (LSTM) network. The LSTM model can automatically learn the semantic information of programs and perform defect prediction. The evaluation results on eight open source projects show that Seml outperforms three state-of-the-art defect prediction approaches on most of the datasets for both within-project defect prediction and cross-project defect prediction.INDEX TERMS Defect prediction, Long Short Term Memory Network, word embedding.

show abstract

Sequence Coverage Directed Greybox Fuzzing

Liu

Zhang

et al. 2019

View full text Add to dashboard Cite

RF CMOS technology scaling in High-k/metal gate era for RF SoC (system-on-chip) applications

Jan

Agostinelli

Deshpande

et al. 2010

View full text Add to dashboard Cite

Machine Learning Based Recommendation of Method Names: How Far are We

Jiang

Liu

Jiang

2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lin Jiang

Mini-rank: Adaptive DRAM architecture for improving memory power efficiency

Seml: A Semantic LSTM Model for Software Defect Prediction

Sequence Coverage Directed Greybox Fuzzing

RF CMOS technology scaling in High-k/metal gate era for RF SoC (system-on-chip) applications

Machine Learning Based Recommendation of Method Names: How Far are We

Contact Info

Product

Resources

About