2012
DOI: 10.1007/s11192-012-0781-y
|View full text |Cite
|
Sign up to set email alerts
|

Duplicate and fake publications in the scientific literature: how many SCIgen papers in computer science?

Abstract: International audienceTwo kinds of bibliographic tools are used to retrieve scientific publications and make them available online. For one kind, access is free as they store information made publicly available online. For the other kind, access fees are required as they are compiled on information provided by the major publishers of scientific literature. The former can easily be interfered with, but it is generally assumed that the latter guarantee the integrity of the data they sell. Unfortunately, duplicat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
70
0
7

Year Published

2014
2014
2023
2023

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 93 publications
(77 citation statements)
references
References 15 publications
0
70
0
7
Order By: Relevance
“…The title of one paper generated by the software was: "Simulating Flip-flop Gates Using Peer-to-peer Methodologies." At least 120 such computer-generated papers were published in peer-reviewed scientific journals (Labbé & Labbé, 2013;Lott, 2014). Juster (1972, p. 23) states, "Few people would accept the naïve nochange model even if it were clearly shown to be more accurate."…”
Section: Why Simplicity Repels and Complexity Luresmentioning
confidence: 99%
“…The title of one paper generated by the software was: "Simulating Flip-flop Gates Using Peer-to-peer Methodologies." At least 120 such computer-generated papers were published in peer-reviewed scientific journals (Labbé & Labbé, 2013;Lott, 2014). Juster (1972, p. 23) states, "Few people would accept the naïve nochange model even if it were clearly shown to be more accurate."…”
Section: Why Simplicity Repels and Complexity Luresmentioning
confidence: 99%
“…Many other machine-generated texts support for malicious purposes such as paper generation and fake review. Labbé and Labbé (2013) prove that artificial papers are produced by using abundant duplicated words and phrases. Therefore, they suggested an inter-textual distance to estimate the similarity between two word distributions and used the distance to recognize the machinegenerated text.…”
Section: Other Machine-generated Text Detectionmentioning
confidence: 94%
“…Other researchers prove that the coherence of the humanwritten text is better than the machine-translated one (Nguyen-Son et al, 2018, 2019. Beyond detecting machine-translated text, artificial fake reviews and papers are also recognized by readability (Juuti et al, 2018) and duplicate patterns (Labbé and Labbé, 2013), respectively. The limitation in all existing methods above is that they only analyze the intrinsic contents of machinegenerated texts but ignore the original processes which are used to produce the texts.…”
Section: Introductionmentioning
confidence: 99%
“…Here is a sentence from one abstract (since removed from IEEE Xplore): "Furthermore, it explored a pervasive tool for enabling pasteurization, which is used to show that context-free grammar and B-trees are largely compatible." Chatterbot output can now be detected automatically (12) and publishers find themselves, regrettably, forced to use such software, as well as anti-copying utilities.…”
Section: Consequencesmentioning
confidence: 99%