2012 IEEE 12th International Working Conference on Source Code Analysis and Manipulation 2012
DOI: 10.1109/scam.2012.29
|View full text |Cite
|
Sign up to set email alerts
|

On the Use of Stemming for Concern Location and Bug Localization in Java

Abstract: Abstract-As the popularity of text-based source code search and analysis grows, the use of stemmers to strip suffixes has increased. Although widely investigated in the information retrieval community, the comparative effectiveness of stemmers in the domain of software is relatively unknown. In this paper, we investigate which of the well-known stemmers perform best in the domain of Java software for concern location and bug localization. For these two problems, we evaluate the use of stemming on over 500 sear… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
20
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(21 citation statements)
references
References 33 publications
1
20
0
Order By: Relevance
“…They also claim that the accuracy of TR techniques is highly dependent on the configuration of the tool. Their claim is supported by recent research (e.g., [Biggers, Bocovich, Capshaw, Eddy, Etzkorn, and Kraft, 2012;Dit, Guerrouj, Poshyvanyk, and Antoniol, 2011;Hill, Rao, and Kak, 2012]). …”
Section: Overviewmentioning
confidence: 63%
“…They also claim that the accuracy of TR techniques is highly dependent on the configuration of the tool. Their claim is supported by recent research (e.g., [Biggers, Bocovich, Capshaw, Eddy, Etzkorn, and Kraft, 2012;Dit, Guerrouj, Poshyvanyk, and Antoniol, 2011;Hill, Rao, and Kak, 2012]). …”
Section: Overviewmentioning
confidence: 63%
“…We observe no significant difference among the three methods. In prior work, Hill et al [13] also observed that no single stemmer is better for all kinds of queries. While we choose Krovetz somewhat arbitrarily, as the more conservative of the two stemming algorithms, closer analysis here appears to be warranted to provide a fuller explanation.…”
Section: System Tuningmentioning
confidence: 94%
“…Emily Hill et al [78] have conducted qualitative study of stemmers on software domain, source code, specifically on java. To check the impact of stemming and retrieval effectiveness, they used Mean Average Precision (MAP) and Rank measure and conducted quantitative study of query-byquery.…”
Section: Content Analysismentioning
confidence: 99%