Anthony Peruma scite author profile

Identifiers make up a majority of the text in code. They are one of the most basic mediums through which developers describe the code they create and understand the code that others create. Therefore, understanding the patterns latent in identifier naming practices and how accurately we are able to automatically model these patterns is vital if researchers are to support developers and automated analysis approaches in comprehending and creating identifiers correctly and optimally. This paper investigates identifiers by studying sequences of partof-speech annotations, referred to as grammar patterns. This work advances our understanding of these patterns and our ability to model them by 1) establishing common naming patterns in different types of identifiers, such as class and attribute names; 2) analyzing how different patterns influence comprehension; and 3) studying the accuracy of state-of-the-art techniques for part-of-speech annotations, which are vital in automatically modeling identifier naming patterns, in order to establish their limits and paths toward improvement. To do this, we manually annotate a dataset of 1,335 identifiers from 20 open-source systems and use this dataset to study naming patterns, semantics, and tagger

show abstract

tsDetect: an open source test smells detection tool

Peruma

Almalki

Newman

et al. 2020

View full text Add to dashboard Cite

The test code, just like production source code, is subject to bad design and programming practices, also known as smells. The presence of test smells in a software project may affect the quality, maintainability, and extendability of test suites making them less effective in finding potential faults and quality issues in the project's production code. In this paper, we introduce tsDetect, an automated test smell detection tool for Java software systems that uses a set of detection rules to locate existing test smells in test code. We evaluate the effectiveness of tsDetect on a benchmark of 65 unit test files containing instances of 19 test smell types. Results show that tsDetect achieves a high detection accuracy with an average precision score of 96% and an average recall score of 97%. tsDetect is publicly available, with a demo video, at: https://testsmells.github.io/ CCS CONCEPTS • Software and its engineering → Software testing and debugging; Software notations and tools; Software maintenance tools.

show abstract

How we refactor and how we document it? On the use of supervised machine learning algorithms to classify refactoring documentation

AlOmar

Peruma

Mkaouer

et al. 2021

Expert Systems with Applications

View full text Add to dashboard Cite

An empirical investigation of how and why developers rename identifiers

Peruma

Mkaouer

Decker

et al. 2018

View full text Add to dashboard Cite

Identifier names play a significant role in program comprehension activities, with high-quality names improving developer productivity and system quality. To correct poorquality names, developers rename identifiers to reflect their intended purpose better. However, renames do not always result in high-quality, long-lasting names; in many cases, developers perform multiple rename operations on the same identifier throughout the system's lifetime. In this paper, we report on a large-scale empirical study that examines the occurrence of identifiers undergoing multiple renames (i.e., rename chains). Our findings show the presence of rename chains in almost every project, with methods typically having more rename chains than other identifier types. Furthermore, it is usually the same developer responsible for creating all renames within a chain, with most names maintaining the same grammatical structure. Understanding rename chains can help us provide stronger advice, and targeted research, on how to craft high-quality, longlasting identifiers.

show abstract

Contextualizing Rename Decisions using Refactorings and Commit Messages

Peruma

Mkaouer

Decker

et al. 2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Anthony Peruma

On the generation, structure, and semantics of grammar patterns in source code identifiers

tsDetect: an open source test smells detection tool

How we refactor and how we document it? On the use of supervised machine learning algorithms to classify refactoring documentation

An empirical investigation of how and why developers rename identifiers

Contextualizing Rename Decisions using Refactorings and Commit Messages

Contact Info

Product

Resources

About