2013
DOI: 10.1016/j.knosys.2013.06.018
|View full text |Cite
|
Sign up to set email alerts
|

Methods for cross-language plagiarism detection

Abstract: Three reasons make plagiarism across languages to be on the rise: (i) speakers of under-resourced languages often consult documentation in a foreign language, (ii) people immersed in a foreign country can still consult material written in their native language, and (iii) people are often interested in writing in a language different to their native one. Most efforts for automatically detecting cross-language plagiarism depend on a preliminary translation, which is not always available.In this paper we propose … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
53
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 66 publications
(54 citation statements)
references
References 8 publications
1
53
0
Order By: Relevance
“…The main idea of the machine translation-based models consists in using MT tools to translate textual units into the same language (pivot language) in order to apply a monolingual comparison between them [5]. For this purpose, Kent and Salim [18] have used Google Translate API to translate texts, while Muhr et al [29] replace each word of the original text by its most likely translations in the target language.…”
Section: Cross-language Semantic Textual Similarity Detectionmentioning
confidence: 99%
“…The main idea of the machine translation-based models consists in using MT tools to translate textual units into the same language (pivot language) in order to apply a monolingual comparison between them [5]. For this purpose, Kent and Salim [18] have used Google Translate API to translate texts, while Muhr et al [29] replace each word of the original text by its most likely translations in the target language.…”
Section: Cross-language Semantic Textual Similarity Detectionmentioning
confidence: 99%
“…In recent years there have been a few approaches to cross-language similarity analysis that can be used for CL plagiarism detection [5]. A simple, yet effective approach is the cross-language character n-gram (CL-CNG) model [34].…”
Section: Obfuscation Via Translationmentioning
confidence: 99%
“…This is partly explained by the increased popularity of tools for collaboratively editing through contributors across the world, which eases the production of different language-written documents, leading to a new phenomenon of multilingual information overload. Analyzing multilingual document collections is getting increased attention as it can support a variety of tasks, such as building translation resources [20,14], detection of plagiarism in patent collections [1], cross-lingual document similarity and multilingual document classification [18,16,6,2,5].…”
Section: Introductionmentioning
confidence: 99%