2012
DOI: 10.1080/09296174.2012.659003
|View full text |Cite
|
Sign up to set email alerts
|

Authorship Attribution: A Comparative Study of Three Text Corpora and Three Languages

Abstract: The first objective of this paper is carry out three experiments intended to evaluate authorship attribution methods based on three test-collections available in three different languages (English, French, and German). In the first we represent and categorize 52 text excerpts written by nine authors and taken from 19th century English novels. In the second we work with 44 segments from French novels written by eleven authors, mostly from the 19th century. In the third we extract 59 German text excerpts from no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 42 publications
0
11
0
Order By: Relevance
“…On the other hand, we can apply an automatic part-ofspeech tagger. However, such an approach is not error-free, and some recent studies have compared the relative merits of these two text representation schemes for authorship attribution (Savoy, 2012;Miranda García and Calle Martín, 2012). Finally, we must mention that some natural languages may have other linguistic construction than those used in the English language.…”
Section: Practical Considerationsmentioning
confidence: 99%
“…On the other hand, we can apply an automatic part-ofspeech tagger. However, such an approach is not error-free, and some recent studies have compared the relative merits of these two text representation schemes for authorship attribution (Savoy, 2012;Miranda García and Calle Martín, 2012). Finally, we must mention that some natural languages may have other linguistic construction than those used in the English language.…”
Section: Practical Considerationsmentioning
confidence: 99%
“…New calibration and applications of this index are having particularly fruitful results (see e.g. [20], [54]). …”
Section: Related Workmentioning
confidence: 98%
“…type-token ratio), the words" length, the repeated segments of words, the position and recursion of specific keywords (see [38], [32], [64], [35], [61] ) for a recent literature review on the topic; see [54] for a presentation and discussion of the different methods as well as the related references. ).…”
Section: Quantitative Linguisticmentioning
confidence: 99%
See 1 more Smart Citation
“…Choosing an appropriate similarity measure is crucial for text clustering, but hundreds of different measures are available (Rudman, 1998), whereas no measure can be considered best suited for all applications. A pairwise measure of similarity should mirror the proximity of two texts but this numeric value depends on the lexical features of texts and on the measure itself (Tuzzi, 2010;Savoy, 2012).…”
Section: Introductionmentioning
confidence: 99%