Proceedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities An 2021
DOI: 10.18653/v1/2021.latechclfl-1.17
|View full text |Cite
|
Sign up to set email alerts
|

‘Tecnologica cosa’: Modeling Storyteller Personalities in Boccaccio’s ‘Decameron’

Abstract: We explore Boccaccio's Decameron to see how digital humanities tools can be used for tasks that have limited data in a language no longer in contemporary use: medieval Italian. We focus our analysis on the question: Do the different storytellers in the text exhibit distinct personalities? To answer this question, we curate and release a dataset based on the authoritative edition of the text. We use supervised classification methods to predict storytellers based on the stories they tell, confirming the difficul… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…We therefore looked to prior successful small-text topic analyses to inform our training procedure. Following recently published work [20], we use a Python wrapper of the MALLET library to train our model [5,59]. MALLET, unlike the more-popular Python-based gensim library, uses Gibbs sampling [32] for the LDA algorithm's underlying sampling method.…”
Section: A Analyzing Themes Via Topic Modelingmentioning
confidence: 99%
See 3 more Smart Citations
“…We therefore looked to prior successful small-text topic analyses to inform our training procedure. Following recently published work [20], we use a Python wrapper of the MALLET library to train our model [5,59]. MALLET, unlike the more-popular Python-based gensim library, uses Gibbs sampling [32] for the LDA algorithm's underlying sampling method.…”
Section: A Analyzing Themes Via Topic Modelingmentioning
confidence: 99%
“…MALLET, unlike the more-popular Python-based gensim library, uses Gibbs sampling [32] for the LDA algorithm's underlying sampling method. Gibbs sampling is an exact Markov chain Monte Carlo technique [15], which Cooper et al [20] notes has better performance for small-text corpora than inexact, variational-inference LDA implementations [45]. For the documents submitted to LDA in training, we chunk the FAccT papers into contiguous segments of 200 words, following the intuition concerning hyperparameter tuning for Gibbs-sampling-based LDA described in the well-cited paper by Griffiths and Steyvers [35].…”
Section: A Analyzing Themes Via Topic Modelingmentioning
confidence: 99%
See 2 more Smart Citations