Proceedings of the Fourth Workshop on Computational Linguistics for Literature 2015
DOI: 10.3115/v1/w15-0709
|View full text |Cite
|
Sign up to set email alerts
|

Towards a better understanding of Burrows's Delta in literary authorship attribution

Abstract: Burrows's Delta is the most established measure for stylometric difference in literary authorship attribution. Several improvements on the original Delta have been proposed. However, a recent empirical study showed that none of the proposed variants constitute a major improvement in terms of authorship attribution performance. With this paper, we try to improve our understanding of how and why these text distance measures work for authorship attribution. We evaluate the effects of standardization and vector no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 16 publications
0
6
0
1
Order By: Relevance
“…8 Viz. the success story of the metric of Burrows's Delta (Burrows 2002), e. g., applied to German literary history by Jannidis and Lauer (2014), and scrutinized methodologically by Evert et al (2015). 9 See e. g., Underwood 2015, but also earlier work such as Biber and Finegan 1989.…”
Section: Preliminariesmentioning
confidence: 99%
“…8 Viz. the success story of the metric of Burrows's Delta (Burrows 2002), e. g., applied to German literary history by Jannidis and Lauer (2014), and scrutinized methodologically by Evert et al (2015). 9 See e. g., Underwood 2015, but also earlier work such as Biber and Finegan 1989.…”
Section: Preliminariesmentioning
confidence: 99%
“…While contemporary research varies in the type of features and algorithms, all have roots in observation that grammatical words are strong predictors of style and author [8] especially those occupying the top of the frequency list (following Zipf's law). Another method that revolutionized the field was Burrows's Delta [11] (later perfected by Evert et al [12,13]), which allowed for calculating differences between profiles of feature frequencies in a more balanced way than the one provided by Euclidean distances.…”
Section: Introductionmentioning
confidence: 99%
“…In addition to training classifiers, we also briefly explored using distance measures from stylistics and authorship attribution research such as Burrow's Delta (Burrows, 2002) and other related measures (Evert et al, 2015) using 100-1000 most frequent word, character and POS n-grams in the corpus. We did not find them particularly useful for this task, with highest accuracies of less than 60% on the development set.…”
Section: Mixed Word-pos Representationsmentioning
confidence: 99%