2015
DOI: 10.1016/j.jss.2014.10.013
|View full text |Cite
|
Sign up to set email alerts
|

From source code identifiers to natural language terms

Abstract: a b s t r a c tProgram comprehension techniques often explore program identifiers, to infer knowledge about programs. The relevance of source code identifiers as one relevant source of information about programs is already established in the literature, as well as their direct impact on future comprehension tasks.Most programming languages enforce some constrains on identifiers strings (e.g., white spaces or commas are not allowed). Also, programmers often use word combinations and abbreviations, to devise str… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0
3

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(13 citation statements)
references
References 28 publications
0
10
0
3
Order By: Relevance
“…This is an expected result, since the identifiers are the basic element of each language. There are a lot of other works that build source code analysis methods based only on the source code identifiers . The next four elements are also present in most vectors.…”
Section: Experimental Evaluation Of Clustering Efficiencymentioning
confidence: 99%
“…This is an expected result, since the identifiers are the basic element of each language. There are a lot of other works that build source code analysis methods based only on the source code identifiers . The next four elements are also present in most vectors.…”
Section: Experimental Evaluation Of Clustering Efficiencymentioning
confidence: 99%
“…First, the identifier is retrieved by using the method retrieveIdentifier(). Once the identifier is available, the split algorithm is applied; in this case samurai() splitter is the used algorithm [11], [22], [12]. Samurai returns a set of words (that compound the identifier under analysis) which are searched in the domain specific dictionary (findInDictionary()) with the goal to verify if it is a valid word or not.…”
Section: Wsdludmentioning
confidence: 99%
“…Hard split dilakukan untuk memisahkan identifier berdasarkan karakter tertentu seperti tanda garis bawah (underscore) atau berdasarkan aturan penulisan camel case. Sementara itu, soft split akan memisahkan identifier yang tidak terlalu terlihat (tanda) pemisahnya [9]. Ilustrasi proses pemisahan dan ekspansi ini dapat dilihat pada Gambar 2.…”
Section: Normalisasi Dengan Lingua::idsplitterunclassified
“…Mekanisme pemisahan dan ekspansi identifier [9] [9] Otomata pada Gambar 3 digunakan untuk menghitung skor kandidat hasil pemisahan. Dengan melihat skor kandidat kata pada Tabel 1, didapatkan bahwa kandidat term time dan sort adalah term yang paling baik untuk menjadi hasil pemisahan atas identifier timesort.…”
Section: Normalisasi Dengan Lingua::idsplitterunclassified
See 1 more Smart Citation