2021
DOI: 10.1007/978-3-030-74251-5_24
|View full text |Cite
|
Sign up to set email alerts
|

A Framework for Authorial Clustering of Shorter Texts in Latent Semantic Spaces

Abstract: Authorial clustering involves the grouping of documents written by the same author or team of authors without any prior positive examples of an author's writing style or thematic preferences. For authorial clustering on shorter texts (paragraph-length texts that are typically shorter than conventional documents), the document representation is particularly important. We propose a high-level framework which utilizes a compact data representation in a latent feature space derived with nonparametric topic modelin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…To the convenience of researchers, LAC can seamlessly work on a set of corpora, and it integrates the results of all datasets in one final output to enable more reproducible and reliable reporting of performance levels. Till now, LAC has enabled the work done by Trad and Spiliopoulou [10]. A next step would be to devise a customised similarity metric by which we can assess how similar two documents are in terms of writing style.…”
Section: Impact On Researchmentioning
confidence: 99%
“…To the convenience of researchers, LAC can seamlessly work on a set of corpora, and it integrates the results of all datasets in one final output to enable more reproducible and reliable reporting of performance levels. Till now, LAC has enabled the work done by Trad and Spiliopoulou [10]. A next step would be to devise a customised similarity metric by which we can assess how similar two documents are in terms of writing style.…”
Section: Impact On Researchmentioning
confidence: 99%