Proceedings of the 22nd International Conference on Program Comprehension 2014
DOI: 10.1145/2597008.2597793
|View full text |Cite
|
Sign up to set email alerts
|

Improving topic model source code summarization

Abstract: In this paper, we present an emerging source code summarization technique that uses topic modeling to select keywords and topics as summaries for source code. Our approach organizes the topics in source code into a hierarchy, with more general topics near the top of the hierarchy. In this way, we present the software's highest-level functionality first, before lower-level details. This is an advantage over previous approaches based on topic models, that only present groups of related keywords without a hierarc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 39 publications
(20 citation statements)
references
References 25 publications
0
20
0
Order By: Relevance
“…Few researches are found which have used NLP based techniques to create the extractive summaries. Paul W. McBurney et al [22] have used Topic Models to create the source code summaries focusing on the presentation of summaries. They used HDTM algorithm extract the hierarchy of topics from the source code.…”
Section: Extractive Summarization: An Overview In the Context Of Softmentioning
confidence: 99%
See 3 more Smart Citations
“…Few researches are found which have used NLP based techniques to create the extractive summaries. Paul W. McBurney et al [22] have used Topic Models to create the source code summaries focusing on the presentation of summaries. They used HDTM algorithm extract the hierarchy of topics from the source code.…”
Section: Extractive Summarization: An Overview In the Context Of Softmentioning
confidence: 99%
“…Even though it can be used for generating abstractive summaries as well but most of the researches are for extractive summaries only. [22] Source Code Topic Models(HDTM) JHotdraw, jajuk, jEdit, jTopas, nanoXML, siena Fowkes et al [24] Source Code Topic Models (extension of TopicSum)…”
Section: Extractive Summarization: An Overview In the Context Of Softmentioning
confidence: 99%
See 2 more Smart Citations
“…22 * is "equal to" this StatusLine */ 23 public boolean equals(Object obj){ 24 try { 25 StatusLine r = (StatusLine) obj; 26 if (r.getCode() == getCode()&& 27 r.getReason().equals(getReason())) 28 return true; Previous work in code summarization has considered summarization using: (a) program slicing (i.e. hiding irrelevant lines of code for a chosen program path) [10], [11]; (b) natural language paraphrases [12], [13]; (c) short lists of keywords [14]- [17]; or (d) (potentially discontiguous) lines of code that match a user's query [18]. In contrast, our work is based on the idea that an effective summary can be obtained by carefully folding the original filesummarizing code with code.…”
Section: Introductionmentioning
confidence: 99%