2020
DOI: 10.7494/csci.2020.21.1.3389
|View full text |Cite
|
Sign up to set email alerts
|

TF-IDF Inspired Detection for Cross-Language Source Code Plagiarism and Collusion

Abstract: Several computing courses allow students to choose which programming language they want to use for completing a programming task. This can lead to cross-language code plagiarism and collusion, in which the copied code file is rewritten in another programming language. In response to that, this paper proposes a detection technique which is able to accurately compare code files written in various programming languages, but with limited effort in accommodating such languages at development stage. The only languag… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 56 publications
0
1
0
Order By: Relevance
“…The proposed approach has three limitations A. finds only semantic similarities; B. does not classify cross-language clone types (Type 1,2, 3, and 4);C. does not support long and complex types of python; D. dead code elimination 3. A recent study on cross-language source code by (Karnalim, 2020) [23] does not handle identifier renaming; and since it works by converting the original code to intermediate code, the proposed approach is computationally complex. 4.…”
Section: Related Workmentioning
confidence: 99%
“…The proposed approach has three limitations A. finds only semantic similarities; B. does not classify cross-language clone types (Type 1,2, 3, and 4);C. does not support long and complex types of python; D. dead code elimination 3. A recent study on cross-language source code by (Karnalim, 2020) [23] does not handle identifier renaming; and since it works by converting the original code to intermediate code, the proposed approach is computationally complex. 4.…”
Section: Related Workmentioning
confidence: 99%