2010 14th European Conference on Software Maintenance and Reengineering 2010
DOI: 10.1109/csmr.2010.31
|View full text |Cite
|
Sign up to set email alerts
|

Recognizing Words from Source Code Identifiers Using Speech Recognition Techniques

Abstract: Abstract-The existing software engineering literature has empirically shown that a proper choice of identifiers influences software understandability and maintainability. Researchers have noticed that identifiers are one of the most important source of information about program entities and that the semantic of identifiers guide the cognitive process.Recognizing the words forming identifiers is not an easy task when naming conventions (e.g., Camel Case) are not used or strictly followed and-or when these words… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
45
0

Year Published

2011
2011
2015
2015

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 58 publications
(48 citation statements)
references
References 24 publications
3
45
0
Order By: Relevance
“…Good quality identifiers must be understand-able, complete, and unambiguous. To obtain these properties, expert may use splitting/expansion [Madani et al(2010)Madani, Guerrouj, Di Penta, Guéhéneuc, and Antoniol] approach to split identifiers such as cmdpntr into cmd pntr and then expand the resulting words into command pointer. The results of the splitting/expansion approach have all the above-mentioned properties of good identifiers.…”
Section: Identification Of Preventive Measuresmentioning
confidence: 99%
“…Good quality identifiers must be understand-able, complete, and unambiguous. To obtain these properties, expert may use splitting/expansion [Madani et al(2010)Madani, Guerrouj, Di Penta, Guéhéneuc, and Antoniol] approach to split identifiers such as cmdpntr into cmd pntr and then expand the resulting words into command pointer. The results of the splitting/expansion approach have all the above-mentioned properties of good identifiers.…”
Section: Identification Of Preventive Measuresmentioning
confidence: 99%
“…The issue submitter (and patch creator as well) stated that this inconsistency was propagated to another plug-in (ApacheDS plug-ins for Studio) via a resource file (server.xml) and in the documentation as well. Even if a writer distinguishes the two words based on their definition consistently throughout the project [16], program readers may not precisely catch the slight difference between the words and this can eventually result in a misunderstanding. Similar issues are observed in a wide range of programs, such as in a pair of Real and Scala described in Issue 7 .…”
Section: Three Types Of Inconsistent Identifiersmentioning
confidence: 99%
“…For example, the word free can be used as a verb, adjective, and adverb in natural language while only the use as a verb of the word was observed in a specific software project. One of the real cases for this inconsistency is shown in an issue report 16 that indicates that the word return in a variable name returnString can be confusing since return is often used in a method identifier as a verb. Consequently, the corresponding patch 17 changes returnString to resultString.…”
Section: Three Types Of Inconsistent Identifiersmentioning
confidence: 99%
See 1 more Smart Citation
“…Current approaches to identifier name tokenisation [7,8,15] report accuracies of around 96% for the tokenisation of unique identifier names. However, some approaches ignore identifier names containing digits [8,15], or treat digits as discrete tokens [7].…”
Section: Introductionmentioning
confidence: 99%