2011 27th IEEE International Conference on Software Maintenance (ICSM) 2011
DOI: 10.1109/icsm.2011.6080801
|View full text |Cite
|
Sign up to set email alerts
|

Categorizing software applications for maintenance

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 49 publications
(34 citation statements)
references
References 24 publications
0
34
0
Order By: Relevance
“…These actionable guidelines are also pertinent to studies/approaches on software categorization [11,13,16], in which the lexical information in bytecode or source code is used to categorize the apps; given the widespread use of third-party libraries, such as Google Ads or Facebook for Android using the identifiers extracted from those libraries can reduce the variance and consequently impact the categorization process. In addition, studies aimed at identifying similar apps [15], which use non-textual based detection, should also consider the impact of third-party libraries and obfuscation practices.…”
Section: Discussionmentioning
confidence: 99%
“…These actionable guidelines are also pertinent to studies/approaches on software categorization [11,13,16], in which the lexical information in bytecode or source code is used to categorize the apps; given the widespread use of third-party libraries, such as Google Ads or Facebook for Android using the identifiers extracted from those libraries can reduce the variance and consequently impact the categorization process. In addition, studies aimed at identifying similar apps [15], which use non-textual based detection, should also consider the impact of third-party libraries and obfuscation practices.…”
Section: Discussionmentioning
confidence: 99%
“…[9]- [11] focus on analysing identifiers and comment terms in the source code to do categorization. McMillan et.al [12], [13] propose a brand-new approach which leverages the third-party API calls in the program as semantic anchors to categorize software. In these works, most of them only experiment on relatively small collections of projects with flat and coarse-grained categories like "Internet" and "Games/Entertainment".…”
Section: Introductionmentioning
confidence: 99%
“…They treat every software system as a document consisted of a collection of words including code identifiers and comments parsed from source code, cluster topics based on topic similarities and categorize software by software-topic matrices. McMillan et al [14] use API calls from third-party libraries as attributes for automatic categorization of software applications. They chose decision trees, naïve Bayes and support vector machines (SVM) to categorize applications, and find that SVM is most-effective.…”
Section: Related Workmentioning
confidence: 99%
“…The software engineering community has conducted plenty of efforts on discovering or retrieval of related software by mining source code identifiers [8,11,20], word frequencies [10], source code comments [19], API calls [14] and hybrid artifacts [1,3]. These works mainly concentrated on a few project repositories while little attention has been paid to the project profiles in global communities and made no use of software labels.…”
Section: Introductionmentioning
confidence: 99%