Deeksha M. Arya scite author profile

Most modern Issue Tracking Systems (ITSs) for open source software (OSS) projects allow users to add comments to issues. Over time, these comments accumulate into discussion threads embedded with rich information about the software project, which can potentially satisfy the diverse needs of OSS stakeholders. However, discovering and retrieving relevant information from the discussion threads is a challenging task, especially when the discussions are lengthy and the number of issues in ITSs are vast. In this paper, we address this challenge by identifying the information types presented in OSS issue discussions. Through qualitative content analysis of 15 complex issue threads across three projects hosted on GitHub, we uncovered 16 information types and created a labeled corpus containing 4656 sentences. Our investigation of supervised, automated classification techniques indicated that, when prior knowledge about the issue is available, Random Forest can effectively detect most sentence types using conversational features such as the sentence length and its position. When classifying sentences from new issues, Logistic Regression can yield satisfactory performance using textual features for certain information types, while falling short on others. Our work represents a nontrivial first step towards tools and techniques for identifying and obtaining the rich information recorded in the ITSs to support various software engineering activities and to satisfy the diverse needs of OSS stakeholders.Index Terms-collaborative software engineering, issue tracking system, issue discussion analysis

show abstract

ArguLens: Anatomy of Community Opinions On Usability Issues Using Argumentation Models

Wang

Arya

Novielli

et al. 2020

View full text Add to dashboard Cite

show abstract

Traceability Network Analysis: A Case Study of Links in Issue Tracking Systems

Nicholson

Arya

Guo

2020

View full text Add to dashboard Cite

Analysis and Detection of Information Types of Open Source Software Issue Discussions

Arya

Wang

Guo

et al. 2019

Preprint

View full text Add to dashboard Cite

Information correspondence between types of documentation for APIs

2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Deeksha M. Arya

Analysis and Detection of Information Types of Open Source Software Issue Discussions

ArguLens: Anatomy of Community Opinions On Usability Issues Using Argumentation Models

Traceability Network Analysis: A Case Study of Links in Issue Tracking Systems

Analysis and Detection of Information Types of Open Source Software Issue Discussions

Information correspondence between types of documentation for APIs

Contact Info

Product

Resources

About