2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE) 2019
DOI: 10.1109/icse.2019.00123
|View full text |Cite
|
Sign up to set email alerts
|

9.6 Million Links in Source Code Comments: Purpose, Evolution, and Decay

Abstract: Links are an essential feature of the World Wide Web, and source code repositories are no exception. However, despite their many undisputed benefits, links can suffer from decay, insufficient versioning, and lack of bidirectional traceability. In this paper, we investigate the role of links contained in source code comments from these perspectives. We conducted a large-scale study of around 9.6 million links to establish their prevalence, and we used a mixed-methods approach to identify the links' targets, pur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
91
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
1

Relationship

4
5

Authors

Journals

citations
Cited by 77 publications
(91 citation statements)
references
References 41 publications
0
91
0
Order By: Relevance
“…In these empirical studies, a common factor for low rates of executability include missing configuration information, such as dependencies, and software decay due to reliance on older versions (e.g., the code only works with older versions of APIs). Our observation of documentation rot is closely related to other forms of software decay, such as unavailable urls in source code [9]. In summary, executability is a useful property for understanding the quality and replicability of software artifacts, including those found in software tutorials.…”
Section: Related Workmentioning
confidence: 60%
“…In these empirical studies, a common factor for low rates of executability include missing configuration information, such as dependencies, and software decay due to reliance on older versions (e.g., the code only works with older versions of APIs). Our observation of documentation rot is closely related to other forms of software decay, such as unavailable urls in source code [9]. In summary, executability is a useful property for understanding the quality and replicability of software artifacts, including those found in software tutorials.…”
Section: Related Workmentioning
confidence: 60%
“…Outside of Stack Overflow, Hata et al [51] studied the role of links contained in source code comments in terms of prevalence, link targets, purposes, decay, and evolutionary aspects. Similar to our findings, Hata et al also report that link can be fragile since they are vulnerable to link rot and link targets change frequently.…”
Section: Related Workmentioning
confidence: 99%
“…a) Repository preparation: To pursue the feasibility of meta-maintenance, we collect a large amount of software development repositories that have been actively developed. We follow the same procedure as in a previous study [46] to identify candidate repositories. We target software development repositories on GitHub written in seven common programming languages, that is, C, C++, Java, JavaScript, Python, PHP, and Ruby.…”
Section: B Data Collectionmentioning
confidence: 99%
“…These languages have been ranked consistently in the top 10 languages on GitHub from 2008 to 2018 (based on the number of repositories from 2008 to 2015 [47], the number of pull requests from 2014 to 2018 [48], and top languages from 2014 to 2018 in the official report [49]). Using the GHTorrent dataset 1 [50], we identify active repositories for the seven languages with the following criteria [46]: (i) having more than 500 commits in their entire history (the same threshold used in previous work [51]), and (ii) having at least 100 commits in the most active two years to remove long-term less active repositories and short-term projects that have not been maintained for long. We determine repositories' languages based on the GHTorrent information.…”
Section: B Data Collectionmentioning
confidence: 99%