Proceedings of the 15th International Conference on Mining Software Repositories 2018
DOI: 10.1145/3196398.3196460
|View full text |Cite
|
Sign up to set email alerts
|

A graph-based dataset of commit history of real-world Android apps

Abstract: Obtaining a good dataset to conduct empirical studies on the engineering of Android apps is an open challenge. To start tackling this challenge, we present AndroidTimeMachine, the first, self-contained, publicly available dataset weaving spread-out data sources about real-world, open-source Android apps. Encoded as a graph-based database, AndroidTimeMachine concerns 8,431 real open-source Android apps and contains: (i) metadata about the apps' GitHub projects, (ii) Git repositories with full commit history and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
31
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
5

Relationship

3
7

Authors

Journals

citations
Cited by 50 publications
(31 citation statements)
references
References 16 publications
(28 reference statements)
0
31
0
Order By: Relevance
“…This will require an in-depth analysis of the commit logs in order to trace, for each issue, its inducing commits, its changes of location within the repository, and its issue-resolution commit. Moreover, we are planning to investigate on how the maintainability issue evolution trends compare between different static code analysis tools and on a larger dataset of GitHub repositories such as the one in [46]. Finally, we will select a subset of representative apps, build and fine tune prediction models for their maintainability issues (e.g., by using ARIMA), and assess the accuracy of those models in predicting how maintainability issues evolve in the future.…”
Section: Discussionmentioning
confidence: 99%
“…This will require an in-depth analysis of the commit logs in order to trace, for each issue, its inducing commits, its changes of location within the repository, and its issue-resolution commit. Moreover, we are planning to investigate on how the maintainability issue evolution trends compare between different static code analysis tools and on a larger dataset of GitHub repositories such as the one in [46]. Finally, we will select a subset of representative apps, build and fine tune prediction models for their maintainability issues (e.g., by using ARIMA), and assess the accuracy of those models in predicting how maintainability issues evolve in the future.…”
Section: Discussionmentioning
confidence: 99%
“…However, we did not have access to any proprietary software that can serve this study. We also encourage future studies to consider other datasets of open-source apps to extend this study [16], [32]. Another possible threat is that our study only concerns 8 Android-specific code smells.…”
Section: B Threats To Validitymentioning
confidence: 98%
“…However, we did not have access to any proprietary software that can serve this study. We also encourage future studies to consider other datasets of open-source apps to extend this study [16], [33]. Another possible threat is that our study only concerns 8 Android-specific code smells.…”
Section: B Threats To Validitymentioning
confidence: 98%