2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE) 2019
DOI: 10.1109/ase.2019.00099
|View full text |Cite
|
Sign up to set email alerts
|

CLCDSA: Cross Language Code Clone Detection using Syntactical Features and API Documentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
32
0
2

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 78 publications
(34 citation statements)
references
References 41 publications
0
32
0
2
Order By: Relevance
“…However, this way is limited for two reasons: (1) the library functions are OS dependent; (2) it fails to recognize the library calls that have different names yet with similar functionality (e.g.,memcpy and memmove) [39]. To address the above problems, inspired by CLCDSA [81], the similarity of cross-os library calls can be learned with the help of the documentation and Mikolov's Word2Vec [82] model.…”
Section: A Impacts Caused By Internal Reasonsmentioning
confidence: 99%
“…However, this way is limited for two reasons: (1) the library functions are OS dependent; (2) it fails to recognize the library calls that have different names yet with similar functionality (e.g.,memcpy and memmove) [39]. To address the above problems, inspired by CLCDSA [81], the similarity of cross-os library calls can be learned with the help of the documentation and Mikolov's Word2Vec [82] model.…”
Section: A Impacts Caused By Internal Reasonsmentioning
confidence: 99%
“…The approach is a semi-supervised machine learning model which is capable of detecting cross-language clones by employing a token level vector generation algorithm and tree-based skip-gram algorithm. This approach does not support more granular clone type classification (type 1, 2, 3, and 4) [25], which use action filters to filter out non-probable clones and make the model more scalable. This method has the limitation with respect to more granular clone classifications.…”
Section: Related Workmentioning
confidence: 99%
“…en, the tree structures are converted into token sequences or vectors to improve the efficiency of similarity measure. In addition, Nafi et al [30] combine the approaches of AST and attribute counting to detect the similarity of cross-language source code. However, the intermediate representation based on trees cannot represent the logical structure of the source code completely, such as the loop structure.…”
Section: Cross-language Source Code Similarity Detection Through Tree-based Intermediate Representationmentioning
confidence: 99%
“…Nafi et al [29] propose CLCDSA, which selects nine measurement attributes and obtain feature measurement values by traversing the AST (abstract syntax tree). Flores et al [30] propose DeSoCoRe to extract code features by tri-gram model and weights word frequency based on normalized term frequency. e similarity between codes is calculated by cosine similarity.…”
Section: Code Similarity Detection Effectiveness Comparisonmentioning
confidence: 99%
See 1 more Smart Citation