2022
DOI: 10.1016/j.infsof.2021.106783
|View full text |Cite
|
Sign up to set email alerts
|

How far are we from reproducible research on code smell detection? A systematic literature review

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(14 citation statements)
references
References 35 publications
0
14
0
Order By: Relevance
“…In our previous work [10], we tackled the problem of developing a machine learning (ML)-based code smell detector. Like other researchers, we, unfortunately, found that the publicly available code smell datasets may be hard to reproduce [11], [12] and contain noisy labels due to annotators' inconsistent understand-ing of the code smells [12]. As the performance of ML models highly depends on the used dataset, the field of ML-based code smell detection would greatly benefit from a systematic approach to labeling code smells.…”
Section: Introductionmentioning
confidence: 91%
“…In our previous work [10], we tackled the problem of developing a machine learning (ML)-based code smell detector. Like other researchers, we, unfortunately, found that the publicly available code smell datasets may be hard to reproduce [11], [12] and contain noisy labels due to annotators' inconsistent understand-ing of the code smells [12]. As the performance of ML models highly depends on the used dataset, the field of ML-based code smell detection would greatly benefit from a systematic approach to labeling code smells.…”
Section: Introductionmentioning
confidence: 91%
“…To be qualified as a reproducible scientific study, the reported experimental results of a study should be obtained by other researchers using authors' artifacts (i.e., source code and datasets) with the same experimental setup . Some researchers pointed out the reproducibility issues in SE (Lewowski & Madeyski, 2022). Recently analyzed some studies on the use of DL models in solving a SE problem, like defect prediction or code clone detection.…”
Section: Reproducibility Packagementioning
confidence: 99%
“…Thus, we examined whether the authors of primary studies on SDP using DL publish reproduction packages for their studies. We used the categories used by Lewowski & Madeyski (2022) during data extraction. Figure 12 shows the results on the presence of a reproducibility package in the primary studies in our paper pool.…”
Section: Reproducibility Packagementioning
confidence: 99%
“…There are two classical coefficients to measure the correlation between indicators: Spearman rank correlation coefficient and Pearson correlation coefficient 28 . The Pearson correlation coefficient has two restrictions: (1) the data obey the normal distribution; (2) the data units are consistent, and the zeros are relative, not absolute. If the measured metrics do not meet the Pearson conditions, it is necessary to consider the Spearman rank correlation coefficient.…”
Section: Spearman Rank Correlation Coefficient 27mentioning
confidence: 99%
“…The security and quality assurance of Android apps become crucial and vital to keep appealing and adapting to new devices. As a metric indicating the sub-optimal design, smells are the main culprit 1,2 .…”
Section: Introductionmentioning
confidence: 99%