2013 35th International Conference on Software Engineering (ICSE) 2013
DOI: 10.1109/icse.2013.6606589
|View full text |Cite
|
Sign up to set email alerts
|

How, and why, process metrics are better

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

10
235
3
1

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 258 publications
(249 citation statements)
references
References 26 publications
10
235
3
1
Order By: Relevance
“…Specifically, we evaluate whether ordering files by entropy will better guide us to identifying buggy files than traditional logistic regression and random forest based DP. DP is typically used at release-time to predict post-release bugs [35,57,42,36,11]; so, for this comparison we use the post release bug data collected in Phase-II. DP is implemented using two classifiers: logistic regression (LR) [43,42] and Random Forest (RF), where the response is a binary variable indicating whether a file is buggy or not.…”
Section: Rq2 Are Buggy Lines Less "Natural" Than Bug-fix Lines?mentioning
confidence: 99%
See 2 more Smart Citations
“…Specifically, we evaluate whether ordering files by entropy will better guide us to identifying buggy files than traditional logistic regression and random forest based DP. DP is typically used at release-time to predict post-release bugs [35,57,42,36,11]; so, for this comparison we use the post release bug data collected in Phase-II. DP is implemented using two classifiers: logistic regression (LR) [43,42] and Random Forest (RF), where the response is a binary variable indicating whether a file is buggy or not.…”
Section: Rq2 Are Buggy Lines Less "Natural" Than Bug-fix Lines?mentioning
confidence: 99%
“…DP is typically used at release-time to predict post-release bugs [35,57,42,36,11]; so, for this comparison we use the post release bug data collected in Phase-II. DP is implemented using two classifiers: logistic regression (LR) [43,42] and Random Forest (RF), where the response is a binary variable indicating whether a file is buggy or not. The predictor variables are the process metrics from [42,11], such as #developers, #file-commit, code churn, and previous bug history; prior research shows that process metrics are better predictors of file level defects [42].…”
Section: Rq2 Are Buggy Lines Less "Natural" Than Bug-fix Lines?mentioning
confidence: 99%
See 1 more Smart Citation
“…The change history of source codes provides information that can help predict fault-prone files [39]. For example, a source code file that was fixed very recently is more likely to still contain bugs than a file that was last fixed long time in the past, or never fixed.…”
Section: Bug-fixing Recencymentioning
confidence: 99%
“…Correspondingly, the source code is syntactically parsed into methods and the features are designed to exploit method-level measures of relevance for a bug report. It has been previously observed that software process metrics (e.g., change history) are more important than code metrics (e.g., size of codes) in detecting defects [39]. Consequently, we use the change history of source code as a strong signal for linking fault-prone files with bug reports.…”
Section: Introductionmentioning
confidence: 99%