2019
DOI: 10.1007/978-3-030-26574-8_1
|View full text |Cite
|
Sign up to set email alerts
|

Creating Evolving Project Data Sets in Software Engineering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 28 publications
0
3
0
Order By: Relevance
“…GitHub and Open source software as sources for mining software development data Data mining on GitHub or Open source repositories has been used to create evolving project data sets for intelligent/empirical software engineering [108,109], for example, for Eclipse Modelling Framework metamodels formation [110,111] anomaly detection [112] or identification of migration reasons [113].…”
Section: Covid-19 Influence On Software Engineeringmentioning
confidence: 99%
“…GitHub and Open source software as sources for mining software development data Data mining on GitHub or Open source repositories has been used to create evolving project data sets for intelligent/empirical software engineering [108,109], for example, for Eclipse Modelling Framework metamodels formation [110,111] anomaly detection [112] or identification of migration reasons [113].…”
Section: Covid-19 Influence On Software Engineeringmentioning
confidence: 99%
“…Since constant changes in development practices and technologies over time can produce different outcomes in ESE studies, a dataset of software projects constructed several years ago can rarely be representative of recently developed projects. Lewowski and Madeyski (2020) partially tackled this with their tool to create evolving datasets, but did not precise how to deal with deprecated projects.…”
Section: Related Workmentioning
confidence: 99%
“…Irreproducibility-which may either be planned or accidental-is a serious problem in many fields of science like medicine [3], biomedical studies [4], and software engineering [5] and, in our opinion, should definitely be included as a principle to be followed. On a practical level, this would include detailed research protocols, data collection methods and datasheets or even automated scripts ran to gather data [6]. Only by including all relevant details we can verify research claims and avoid problems like comparing results obtained from two data sets that were sampled from different populations-a problem especially visible during COVID-19 pandemic, where infection Bull.…”
Section: 2mentioning
confidence: 99%