Proceedings of the 2008 International Working Conference on Mining Software Repositories 2008
DOI: 10.1145/1370750.1370777
|View full text |Cite
|
Sign up to set email alerts
|

Correctness of data mined from CVS

Abstract: Source code repositories managed by the popular CVS are frequently mined by researchers to validate various hypotheses about how the development of a software product progressed. This paper presents a study where the development process of 17 student teams was followed. It will show that in an environment where the teams were permitted to manage their own CVS repositories that several errors emerged that could lead to the misinterpretation of a hypothesis. These were classified into three types: type one error… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2009
2009
2011
2011

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 5 publications
0
2
0
Order By: Relevance
“…Each transaction contains a number of commits, an author, a log message, and a date. Note that we mine CVS repositories with the obvious issues, e. g., file naming, commit transaction, described in [7,17]. Since CVS does not keep track of transactions, we run a commit algorithm on the CVS history as described in [7].…”
Section: Mining Software Repositoriesmentioning
confidence: 99%
“…Each transaction contains a number of commits, an author, a log message, and a date. Note that we mine CVS repositories with the obvious issues, e. g., file naming, commit transaction, described in [7,17]. Since CVS does not keep track of transactions, we run a commit algorithm on the CVS history as described in [7].…”
Section: Mining Software Repositoriesmentioning
confidence: 99%
“…Subversion does provide the original name of a file when the file was renamed with Subversion's rename facility, but only then. CVS does not record file name changes at all, which leads to misinterpretations [5]. For example, Subversion's blame command is useful when origins of code are needed.…”
Section: Crude Change Informationmentioning
confidence: 99%