Proceedings of the Workshop on Stylistic Variation 2017
DOI: 10.18653/v1/w17-4914
|View full text |Cite
|
Sign up to set email alerts
|

Assessing the Stylistic Properties of Neurally Generated Text in Authorship Attribution

Abstract: Recent applications of neural language models have led to an increased interest in the automatic generation of natural language. However impressive, the evaluation of neurally generated text has so far remained rather informal and anecdotal. Here, we present an attempt at the systematic assessment of one aspect of the quality of neurally generated text. We focus on a specific aspect of neural language generation: its ability to reproduce authorial writing styles. Using established models for authorship attribu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
6
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 20 publications
1
6
0
Order By: Relevance
“…However, previous AA work largely focuses on authorship attribution among humans, while only a few papers (Manjavacas et al, 2017;Uchendu et al, 2020;Munir et al, 2021) study neural generated text. Our work aims to provide the first benchmark for Authorship Attribution in the form of the Turing Test by including humans and neural language models.…”
Section: Related Workmentioning
confidence: 99%
“…However, previous AA work largely focuses on authorship attribution among humans, while only a few papers (Manjavacas et al, 2017;Uchendu et al, 2020;Munir et al, 2021) study neural generated text. Our work aims to provide the first benchmark for Authorship Attribution in the form of the Turing Test by including humans and neural language models.…”
Section: Related Workmentioning
confidence: 99%
“…AA can also be applied to predicting author(s) of source code (Simko et al, 2018), chatbot detec-tion , and even detecting authors intentionally trying to mask their writing style (Juola, 2012;Sánchez-Junquera et al, 2020). Finally, our work bears similarity to (Manjavacas et al, 2017), which investigates the stylistic properties of different neural text generation techniques (i.e., Ngram-based and RNN-based).…”
Section: Applications Of Authorship Attributionmentioning
confidence: 99%
“…Alsulami et al [19] used Long shortterm memory (LSTM) for source code authorship classification with 200 source files from 10 programmers and gained 85.00% accuracy. Enrique et al [13] assessed the quality of neurally generated English text using an established authorship identification. Koppel et al [20] used a naive similaritybased techniques for authorship classification in English texts.…”
Section: A Non-bengali Language-based Authorship Classificationmentioning
confidence: 99%
“…To build this corpus, we crawled texts from four online sources namely NLTR society for natural language technology research [36], Ebanglalibrary [37], Git repository [38] and Blogs [39]- [41]. The maximum number of texts (13,308) are collected from NLTR source whereas minimum number of texts (240) are crawled from Blogs. A self-built automatic web crawler 3 is used to scrapping the data from four sources.…”
Section: ) Bengali Authorship Classification Corpus (Bacc-18)mentioning
confidence: 99%
See 1 more Smart Citation