Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020 2020
DOI: 10.1145/3383583.3398594
|View full text |Cite
|
Sign up to set email alerts
|

Cross-Language Source Code Plagiarism Detection using Explicit Semantic Analysis and Scored Greedy String Tilling

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 3 publications
0
4
0
Order By: Relevance
“…To extract the semantic similarity between the codes and mutants, the proposed technique uses the well‐known greedy string tiling (GST) 27 approach that traverses the string pairs to identify the semantic relationship between them. Hence, to regulate the process, the strings in the program code are generated into a pattern, and then the pattern is traversed element‐by‐element to mark the matching strings.…”
Section: Proposed Methodologymentioning
confidence: 99%
“…To extract the semantic similarity between the codes and mutants, the proposed technique uses the well‐known greedy string tiling (GST) 27 approach that traverses the string pairs to identify the semantic relationship between them. Hence, to regulate the process, the strings in the program code are generated into a pattern, and then the pattern is traversed element‐by‐element to mark the matching strings.…”
Section: Proposed Methodologymentioning
confidence: 99%
“…In such a manner, program similarity can be calculated in linear time. Cosine correlation is employed by Flores et al [38] and Foltynek et al [39]. Latent semantic analysis is employed by Ullah et al [40] and Cosma and Joy et al [41].…”
Section: Literature Reviewmentioning
confidence: 99%
“…On the other hand, the algorithm is known to be robust and precise 9 and still attracts attention within the research community. For example, a modified variant, the scored GST, is used in the cross‐lingual source code plagiarism detection system EsaGST, 20 based on explicit semantic analysis.…”
Section: Greedy String Tiling and Karp–rabin Algorithmsmentioning
confidence: 99%