2014
DOI: 10.1007/978-3-642-54798-0_12
|View full text |Cite
|
Sign up to set email alerts
|

Knowledge Graphs as Context Models: Improving the Detection of Cross-Language Plagiarism with Paraphrasing

Abstract: Abstract. Cross-language plagiarism detection attempts to identify and extract automatically plagiarism among documents in different languages. Plagiarized fragments can be translated verbatim copies or may alter their structure to hide the copying, which is known as paraphrasing and is more difficult to detect. In order to improve the paraphrasing detection, we use a knowledge graph-based approach to obtain and compare context models of document fragments in different languages. Experimental results in German… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 13 publications
0
11
0
Order By: Relevance
“…The Cross-Language Knowledge Graph Analysis (CL-KGA) model (Franco-Salvador et al, 2013) uses a multilingual semantic network to create knowledge graphs that model the context of documents. The model achieved interesting results for CL plagiarism detection, also in cases of paraphrasing (Franco-Salvador et al, 2014a). However, it left unanswered questions -relationship with CL-ESA, contributions of WSD, vocabulary expansion, etc.…”
Section: Cross-language Plagiarism Detectionmentioning
confidence: 95%
“…The Cross-Language Knowledge Graph Analysis (CL-KGA) model (Franco-Salvador et al, 2013) uses a multilingual semantic network to create knowledge graphs that model the context of documents. The model achieved interesting results for CL plagiarism detection, also in cases of paraphrasing (Franco-Salvador et al, 2014a). However, it left unanswered questions -relationship with CL-ESA, contributions of WSD, vocabulary expansion, etc.…”
Section: Cross-language Plagiarism Detectionmentioning
confidence: 95%
“…Evaluation Corpora In the author profiling task at PAN 2013 [58] participants approached the task of identifying age and gender in a large corpus collected from social media, and age was annotated with three classes: 10s (13)(14)(15)(16)(17), 20s (23)(24)(25)(26)(27), and 30s (33)(34)(35)(36)(37)(38)(39)(40)(41)(42)(43)(44)(45)(46)(47). At PAN 2014, we continued to study the gender and age aspects of the author profiling problem, however, four corpora of different genres were considered-social media, blogs, Twitter, and hotel reviews-both in English and Spanish.…”
Section: Author Profiling: How Writing Style Is Sharedmentioning
confidence: 99%
“…The granularity for CL-KGA is the closest to 1.0, the best possible value. A more detailed analysis can be found in [16].…”
Section: Evaluation Corpus and Measures In Our Evaluation We Use The mentioning
confidence: 99%
See 1 more Smart Citation
“…The Cross-Language Knowledge Graph Analysis (CL-KGA) model (FrancoSalvador et al, 2013a,b) uses a multilingual semantic network to create knowledge graphs that model the context of documents. The model achieved interesting results for CL plagiarism detection, also in cases of paraphrasing (Franco-Salvador et al, 2014a). However, it left unanswered questions -relationship with CL-ESA, contributions of WSD, vocabulary expansion, etc.…”
Section: Cross-language Plagiarism Detectionmentioning
confidence: 94%