2017 IEEE International Conference on Data Mining (ICDM) 2017
DOI: 10.1109/icdm.2017.52
|View full text |Cite
|
Sign up to set email alerts
|

Accurate Detection of Automatically Spun Content via Stylometric Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(14 citation statements)
references
References 19 publications
0
14
0
Order By: Relevance
“…Word matching software such as Turnitin® (n.d.) has proven valuable in identifying replication of text from other sources. However, the very purpose of paraphrasing tools is to deceive software developed to detect plagiarism, and it is apparent that to date this strategy has been successful (Lancaster and Clarke 2009;Rogerson and McCarthy 2017;Shahid et al 2017). Consequently, the burden of detection remains with the human reader who has to become increasing adept at spotting stylistic variations and any other flags relating to mechanisms that have been used to avoid detection (Gillam et al 2010).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Word matching software such as Turnitin® (n.d.) has proven valuable in identifying replication of text from other sources. However, the very purpose of paraphrasing tools is to deceive software developed to detect plagiarism, and it is apparent that to date this strategy has been successful (Lancaster and Clarke 2009;Rogerson and McCarthy 2017;Shahid et al 2017). Consequently, the burden of detection remains with the human reader who has to become increasing adept at spotting stylistic variations and any other flags relating to mechanisms that have been used to avoid detection (Gillam et al 2010).…”
Section: Discussionmentioning
confidence: 99%
“…To create the spintax, a bank of potentially alternative terms is held in a synonym dictionary, which may be local to the paraphrasing tool, or held in cloud storage (Shahid et al 2017;Zhang et al 2014). In their study, Zhang et al (2014) were able to access this dictionary and reverse engineer two paraphrasing tools (Plagiarisma and The Best Spinner) to establish which words are subject to synonym substitution, referred to as 'mutables' , and which words do not appear in the synonym dictionary and thus would not be included in the spintax, referred to as 'immutables'.…”
Section: Paraphrasing Toolsmentioning
confidence: 99%
See 1 more Smart Citation
“…AlSallal et al [9] used common words and content words with SVM to detect plagiarism. Shahid et al [42] used syntactic and lexical features with SVM to detect "spun" content.…”
Section: Machine Learning Methods For Plagiarism Detectionmentioning
confidence: 99%
“…In deception detection, author verification has been used to identify fake reviews by providing evidence that reviews published under different aliases are actually written by the same author (Layton, Watters, & Ureche, ). This technology can also be used to enhance recommender systems (Vaz, Martins de Matos, & Martins, ), opinion mining (Panicheva, Cardiff, & Rosso, ), personalized spam e‐mail detection (Shams & Mercer, ), and spun content detection (Shahid et al, ).…”
Section: Introductionmentioning
confidence: 99%