2016
DOI: 10.15439/2016f524
|View full text |Cite
|
Sign up to set email alerts
|

Preliminary Report on Empirical Study of Repeated Fragments in Internal Documentation

Abstract: Abstract-In this paper we present preliminary results of an empirical study, in which we used copy/paste detection (PMD CPD implementation) to search for repeating documentation fragments. The study was performed on 5 open source projects, including Java 8 SDK sources. The study shows that there are many occurrences of copy-pasting documentation fragments in the internal documentation, e.g., copy-pasted method parameter description. Besides these, many of the copy-pasted fragments express some domain or design… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 11 publications
0
6
0
Order By: Relevance
“…In their further research [13], they examine exact duplicates in embedded documentation of several open-source projects, but do not consider near duplicates.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In their further research [13], they examine exact duplicates in embedded documentation of several open-source projects, but do not consider near duplicates.…”
Section: Related Workmentioning
confidence: 99%
“…Duplicates in software documentation have been extensively studied during the last decade [6,11,12,13,14,15,16,17]. At the same time, there are no specialized tools for duplicate detection.…”
Section: Introductionmentioning
confidence: 99%
“…In [3] Nosál and Porubän present the results of a case study in which they searched for exact duplicates in internal documentation (source code comments) of an open source project set. They used a modified copy/paste detection tool, which was originally developed for code analysis and found considerable number of text duplicates.…”
Section: Related Workmentioning
confidence: 99%
“…The initial interval tree for is constructed using the () function (line 2). The core part of the algorithm is a loop in which new near duplicate groups are constructed (lines [3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18]…”
Section: Algorithm Descriptionmentioning
confidence: 99%
See 1 more Smart Citation