SummaryIn software maintenance the developers may add new feature to program, improve existing function, and remove bugs. In this case the developer should identify the location in the source code that corresponds to a specific functionality; this is known as feature location. This presents a new approach for enhancing the process of feature location using Information Retrieval (IR) and Natural Language Processing. The approach presented augments the source code with additional semantic information that was extracted and derived from the synonyms of source code terms. This approach works in a pipeline structure, starting by augmenting the source code corpus with synonyms of the original terms and ending by inferring the source code with a particular user query. More specifically, the WordNet platform is used for extracting the synonyms of terms. Moreover, the approach uses an advanced IR technique, namely the Latent Semantic Indexing, for searching and inferring the source code. The used approach was tested and evaluated on two open source systems, namely the Qt and Hippodraw. Four experiments were conducted on each system using 21 features and the results showed that enriching the source code with synonyms of terms clearly and significantly improved the process of feature location efficiently. The experimental results showed that the approach presented obtained higher levels of Recall and Precision. For that reason, it has been shown to improve the state‐of‐the‐art techniques for feature location process.
In information retrieval, the accuracy of the retrieval process is mainly dependent on query terms selection; therefore, the user must choose the needed terms carefully and selectively. Traditionally, the process of selecting query terms is done manually. However, in the last two decades, a lot of research has been directed towards automating the process of choosing and enhancing query terms. In this article, a new novel approach is presented, which relies on topic modeling in query building and expansion. Two open source systems were selected to perform the experiments, results show that adding the topic's term to the user's query clearly improves its quality and thus, improves the ranking results.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.