2017
DOI: 10.11649/cs.1430
|View full text |Cite
|
Sign up to set email alerts
|

An open stylometric system based on multilevel text analysis

Abstract: An open stylometric system based on multilevel text analysisStylometric techniques are usually applied to a limited number of typical tasks, such as authorship attribution, genre analysis, or gender studies. However, they could be applied to several tasks beyond this canonical set, if only stylometric tools were more accessible to users from different areas of the humanities and social sciences. This paper presents a general idea, followed by a fully functional prototype of an open stylometric system that faci… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0
3

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(17 citation statements)
references
References 31 publications
0
14
0
3
Order By: Relevance
“…In stylometric authorship studies, researchers have generally used various statistical multivariate analysis techniques that range from frequency distribution (i.e. listing frequently used words) to discriminant analysis examining linguistic and stylistic features within texts that can be detectors of potential authors (Daelemans, 2013;Eder, Piasecki & Walkowiak, 2017;Gómez-Adorno, Posadas-Duran, Ríos-Toledo, Sidorov & Sierra, 2018;Hoover, 2003;Lagutina, Boychuk, Vorontsova & Paramonov, 2019). These statistical techniques have been used with different linguistic variables, including morphological, lexical, and syntactic variables (Burrows, 2007;Strome, 2013).…”
Section: Literature Reviewmentioning
confidence: 99%
“…In stylometric authorship studies, researchers have generally used various statistical multivariate analysis techniques that range from frequency distribution (i.e. listing frequently used words) to discriminant analysis examining linguistic and stylistic features within texts that can be detectors of potential authors (Daelemans, 2013;Eder, Piasecki & Walkowiak, 2017;Gómez-Adorno, Posadas-Duran, Ríos-Toledo, Sidorov & Sierra, 2018;Hoover, 2003;Lagutina, Boychuk, Vorontsova & Paramonov, 2019). These statistical techniques have been used with different linguistic variables, including morphological, lexical, and syntactic variables (Burrows, 2007;Strome, 2013).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Stilometrija, statistična ali kvantitativna analiza stila, raziskuje po dobnosti in razlike med besedili na različnih jezikovnih ravninah (Eder et al 2017: 1), v prvi vrsti z namenom določanja avtorstva. Izhodišče stilometrije je, da je avtorjev stil nekaj nezavednega, česar ni mogoče zavestno usmerjati, obenem pa ima kvantitativne razlikovalne lastnosti.…”
Section: (Računalniško Podprta) Stilometrijaunclassified
“…Ko govorimo o stilometriji v okvirih digitalne humanistike, seveda govorimo o računalniško podprti stilometriji; kot taka je definirana kot del računalniških analiz besedil v okviru oddaljenega branja ali makroanalize, in sicer z upoštevanjem velikih (večjih) zbirk besedil, v katerih skuša najti razmerja in vzorce podobnosti in razlik, ki so »očesu človeškega bralca« skriti (Eder et al 2016: 108). Običajno se stilometrija opira na enostavne jezikovne značilnosti, ki jih lahko v besedilnem dokumentu določamo avtomatsko, kot na primer relativna frekventnost (najpogostejših) besed, raba in razporeditev ločil, povprečna dolžina povedi ali besed (Eder et al 2017: 1−2). Še vedno močno prednjači analiza na ravni leksike, vedno večja zmogljivost računalnikov in dostopnost besedil v elektronski obliki pa omo goča ta tudi skladenjske in pomenske stilometrične analize (Holmes, Kardos 2003: 6).…”
Section: (Računalniško Podprta) Stilometrijaunclassified
See 1 more Smart Citation
“…That opens possibilities for automatic categorisation of text documents in terms of the subject areas in any digital collection of documents. It is also an important problem for researchers from different areas of the humanities and social science (Eder et al, 2017).…”
Section: Introductionmentioning
confidence: 99%