Proceedings of the First International Workshop on Software Mining 2012
DOI: 10.1145/2384416.2384419
|View full text |Cite
|
Sign up to set email alerts
|

Labeled topic detection of open source software from mining mass textual project profiles

Abstract: Nowadays open source software has become an indispensable basis for both individual and industrial software engineering. Various kinds of labeling mechanisms like categories, keywords and tags are used in open source communities to annotate projects and facilitate the discovery of certain software. However, as large amounts of software are attached with no/few labels or the existing labels are from different ontology space, it is still hard to retrieve potentially topic-relevant software. This paper highlights… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(8 citation statements)
references
References 19 publications
0
8
0
Order By: Relevance
“…More recently, Wang et al . proposed an approach to assign tags to software projects using mining of existing projects tags and descriptions . These approaches can be used in our context to create groups of equivalent libraries but without any guarantee on the fact that a library of a group can be replaced by any other equivalent library of the group.…”
Section: Related Workmentioning
confidence: 99%
“…More recently, Wang et al . proposed an approach to assign tags to software projects using mining of existing projects tags and descriptions . These approaches can be used in our context to create groups of equivalent libraries but without any guarantee on the fact that a library of a group can be replaced by any other equivalent library of the group.…”
Section: Related Workmentioning
confidence: 99%
“…Topic models are widely-used in software engineering [16,19]. Unfortunately, the main weakness to topic modeling approaches is that there is no agreed-upon technique for labeling the topics so that those topics can be understood in context [28,16]. In other words, currently a programmer can read topics to determine what features are implemented in software, but it is difficult for the programmer to determine how those features interact, or which features are the most important.…”
Section: The Problemmentioning
confidence: 99%
“…Issue metadata, such as tags and labels, are designed to partially address the problem of information overload by allowing OSS stakeholders to quickly grasp the most important information such as the origin and the status of the issue. In practice, however, the metadata are often outdated, incomplete or even misleading [27]. Even when managed diligently, it usually only represents the interests of the project management team or the core developers and thus is inadequate in fulfilling the needs of the diverse OSS project participants.…”
Section: Introductionmentioning
confidence: 99%