2012
DOI: 10.1002/smr.1564
|View full text |Cite
|
Sign up to set email alerts
|

Improving IR‐based traceability recovery via noun‐based indexing of software artifacts

Abstract: One of the most successful applications of textual analysis in software engineering is the use of information retrieval (IR) methods to reconstruct traceability links between software artifacts. Unfortunately, because of the limitations of both the humans developing artifacts and the IR techniques any IR-based traceability recovery method fails to retrieve some of the correct links, while on the other hand it also retrieves links that are not correct. This limitation has posed challenges for researchers that h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
50
2
1

Year Published

2012
2012
2019
2019

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 61 publications
(54 citation statements)
references
References 60 publications
(117 reference statements)
1
50
2
1
Order By: Relevance
“…Most proposed approaches for automatic bug assignment recommendations remove noise by only using general preprocessing steps of the natural language processing (NLP), such as removing stop words and non-alphabetic tokens. However, Capobianco et al [18] showed that using only unigram noun terms significantly improves the accuracy of IR-based traceability recovery method. Unlike other parts of speech, such as verbs and adjectives, nouns are usually used in a specific context.…”
Section: Extracted Entitiesmentioning
confidence: 99%
“…Most proposed approaches for automatic bug assignment recommendations remove noise by only using general preprocessing steps of the natural language processing (NLP), such as removing stop words and non-alphabetic tokens. However, Capobianco et al [18] showed that using only unigram noun terms significantly improves the accuracy of IR-based traceability recovery method. Unlike other parts of speech, such as verbs and adjectives, nouns are usually used in a specific context.…”
Section: Extracted Entitiesmentioning
confidence: 99%
“…The IR-based framework is widely used and the POS tagging technique has demonstrated to be effective for improving the performance [5,24]. Tian et al have investigated the effectiveness of seven POS taggers on sampled bug reports; the Stanford POS tagger and TreeTagger achieved the highest accuracy up to 90.5% [26].…”
Section: Part-of-speech Taggingmentioning
confidence: 99%
“…Dalam pencarian literatur yang telah ditemukan, kami menemukan banyak variasi-variasi pendekatan information retrieval. Misalnya penggunaan smoothing filtering pada IR [9], penggunaan noun-based indexing pada IR [10], Incremental LSI pada IR [11].…”
Section: Diskusiunclassified