2015
DOI: 10.3390/socsci4030758
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical and Non-Hierarchical Linear and Non-Linear Clustering Methods to “Shakespeare Authorship Question”

Abstract: A few literary scholars have long claimed that Shakespeare did not write some of his best plays (history plays and tragedies) and proposed at one time or another various suspect authorship candidates. Most modern-day scholars of Shakespeare have rejected this claim, arguing that strong evidence that Shakespeare wrote the plays and poems being his name appears on them as the author. This has caused and led to an ongoing scholarly academic debate for quite some long time. Stylometry is a fast-growing field often… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 22 publications
0
6
0
Order By: Relevance
“…The results for NBM and SVM were taken from [17]. NBM and SVM were tested with the different feature representation types: ultimate style markers, documentlevel character n-grams (with n= [2,7]), function words, token n-grams (with n= [1,3]), token lemmas (with n= [1,3]), part-ofspeech tag n-grams (with n= [1,3]), and n-grams of concatenated lexical and morphological features. There is no single the best feature representation type: it depends on the classification method and the dataset size (for the best types see Table I).…”
Section: B Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…The results for NBM and SVM were taken from [17]. NBM and SVM were tested with the different feature representation types: ultimate style markers, documentlevel character n-grams (with n= [2,7]), function words, token n-grams (with n= [1,3]), token lemmas (with n= [1,3]), part-ofspeech tag n-grams (with n= [1,3]), and n-grams of concatenated lexical and morphological features. There is no single the best feature representation type: it depends on the classification method and the dataset size (for the best types see Table I).…”
Section: B Evaluationmentioning
confidence: 99%
“…The majority of AP tasks are solved with the traditional machine learning methods and the weight vectors of features [3], [4]. The most influential examples of this field refer to Support Vector Machines (SVMs) [5], Multi-Class Real Winnow [6], Mean Proximity Clustering [7] and Holomorphic Transforms [8]. While a range of explored feature types usually covers stylistic (e.g., average sentence length, standardized type/token ratio), lexical (e.g., bag-of-words, function words), character (e.g., document or word-level character ngrams), morphological (e.g., part-of-speech tags) levels of feature representation types.…”
Section: Introduction and Related Workmentioning
confidence: 99%
“…Author identification in the context of the literary genre attracted attention beyond the NLP research circles, e.g., due to the work by Aljumily (2015), who addressed the allegations that Shakespeare did not write some of his best plays using clustering techniques with function word frequency, word n-grams and character n-grams.…”
Section: Related Workmentioning
confidence: 99%
“…For both, feature engineering is crucial and for both the tendency is to use word/character n-grams and/or function and stop word frequencies (Mosteller and Wallace, 1963;Aljumily, 2015;Gamon, 2004;Argamon et al, 2009), PoS tags (Koppel et al, 2002;Mukherjee and Liu, 2010), or patterns captured by context-free-grammar-derived linguistic patterns; see e.g. (Raghavan et al, 2010;Sarawgi et al, 2011;Gamon, 2004).…”
Section: Related Workmentioning
confidence: 99%
“…This assumption is very well-known and well-understood and explained, and there is an extremely long line of research on this subject (e.g. Aljumily, 2015;2017;Holmes, 1994Holmes, , 1998Holmes and Kardos, 2003). More details about the linguistic features for authorship attribution that have been proposed in the literature is discussed in, for example, Stamatatos (2009).…”
Section: Introductionmentioning
confidence: 97%