Fill the silence! Basics for modeling hesitation

Betz, Simon; Kosmala, Loulou

doi:10.21862/diss-09-004-betz-kosm

Cited by 12 publications

(14 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Literature on these distributional issues is much more scarce, some few exceptions to be mentioned include e.g. [Crible et al 2017], [Betz et al 2015], [Betz and Kosmala 2019], [Bóna 2019]. Our paper is in line with this distributional approach, which we advocate and check against Russian data.…”

Section: Introductionsupporting

confidence: 73%

“…In examples, we provide the session ID (04, 22, or 23) and IDs of the EDUs involved (in these IDs, N stands for Narrator, C for Commentator, and R for Reteller). Since we were primarily interested in how fluent and disfluent stretches are distributed across relatively long speech intervals, we concentrated on monologic fragments and didn't consider dialogical parts 3 . Overall, we annotated 32 mins of audio that contained 4,780 words.…”

Section: Datamentioning

confidence: 99%

See 1 more Smart Citation

Disfluencies in Russian Spoken Monologues: A Distributional Analysis

Korotaev¹,

Podlesskaya²,

Смирнова³

et al. 2020

Computational Linguistics and Intellectual Technologies

View full text Add to dashboard Cite

The paper addresses the overall distribution of speech disfluencies in Russian spoken monologic discourse: basing on corpus data, we investigate qualitatively and quantitatively how disfluencies of different types group (or do not group) with each other and how isolated disfluencies and their sequences are sandwiched with periods of fluent speech in the course of speech production. Self-repairs, filled and silent pauses, and instances of hesitation lengthening were annotated in a subcorpus of the “Russian Pears Chats and Stories” (RUPEX). A distribution-oriented typology of disfluencies was proposed that distinguishes between isolated disfluencies, disfluency clusters, and quasiclusters. We claim that disfluency tokens tend to cluster, as isolated occurrences are significantly less frequent in our data than it could have been expected basing on the relative frequency of tokens. This finding contradicts previous studies that treated disfluency clusters as a more marginal phenomenon, and emphasizes the importance of a distributional, rather than merely structural, approach to annotating disfluencies. Furthermore, individual types of disfluency tokens demonstrate significantly different distributional patterns. Compared to other types, self-repairs occur more often in isolation, while words with hesitation lengthening appear predominantly in clusters, and filled pauses most often group with silent pauses to form quasi-clusters.

show abstract

Section: Introductionsupporting

confidence: 73%

Section: Datamentioning

confidence: 99%

Disfluencies in Russian Spoken Monologues: A Distributional Analysis

Korotaev¹,

Podlesskaya²,

Смирнова³

et al. 2020

Computational Linguistics and Intellectual Technologies

View full text Add to dashboard Cite

show abstract

“…Let us now illustrate this model with an utterance taken from the SITAF Corpus (Horgues & Scheuer, 2015) which has been analyzed in detail in previous work on disfluency (Betz & Kosmala, 2019;Kosmala, 2021bKosmala, , 2021aKosmala et al, 2019).…”

Section: Production-oriented Models Of Disfluencymentioning

confidence: 99%

Rethinking (Dis)fluency Within the Scope of Interactional Linguistics and Gesture Studies

Kosmala¹

2022

Studia UBB Philosophia

Self Cite

View full text Add to dashboard Cite

"The study of so-called ‘disfluency’ phenomena (uh and um, filled and unfilled pauses, self-repairs and the like) has gained a lot of attention in various fields in linguistics in the past few decades, but a majority of studies tend to be production-oriented and often disregard fundamental aspects of face-to-face communication such as interactional dynamics and gesture. This paper presents a multimodal and multilevel model of “inter-fluency”, considering different levels of analysis, mainly, talk, gesture, and interaction, by combining different theoretical frameworks and methodologies in gesture studies and interactional linguistics in order to bridge this gap and go beyond previous cognitive-oriented models. Keywords: Interaction, fluency, gesture, multimodality, interactive model"

show abstract

“…Recent analyses have shown cross-linguistic differences in the duration of silences preceding and following fillers in an utterance. [18,17] found that there is an interplay between fillers and associated silences in English and German as silences following fillers are significantly longer compared with the preceding ones. In French, on the other hand, the silences preceding fillers have been found to be significantly longer [17].…”

Section: Introductionmentioning

confidence: 99%

Hesitations in Urdu/Hindi: Distribution and Properties of Fillers & Silences

Jabeen¹,

Betz²

2022

Interspeech 2022

View full text Add to dashboard Cite

This research presents an analysis of hesitations in Urdu/Hindi semi-spontaneous dialogues. We annotated and analyzed twenty-five minutes of speech to investigate the frequency of hesitations and the properties of fillers as well as the formants in fillers' vocalic intervals to determine their vowel quality. We found that our participants used fillers, silences, and prolongations with varying frequency. Moreover, Urdu/Hindi speakers used the fillers with only vocalic intervals (uh) more frequently than the ones with vocalic intervals followed by nasals (um). The regression analysis showed that the um type fillers were significantly longer and followed by longer silences as compared with the uh type fillers. Furthermore, the um types were placed more frequently at the turn medial position, whereas the uh type fillers occurred at turn initial or medial position with similar frequency. The analysis of their formants showed that the vocalic intervals used in the fillers differed from other vowels in the inventory of Urdu/Hindi. Our data confirms the existing claim that uh and um are two distinct types of fillers. Our results are relevant for developing speech synthesis systems for Urdu/Hindi as well as improving the existing models seeking to incorporate hesitations and fillers in a realistic manner.

show abstract

Fill the silence! Basics for modeling hesitation

Cited by 12 publications

References 7 publications

Disfluencies in Russian Spoken Monologues: A Distributional Analysis

Disfluencies in Russian Spoken Monologues: A Distributional Analysis

Rethinking (Dis)fluency Within the Scope of Interactional Linguistics and Gesture Studies

Hesitations in Urdu/Hindi: Distribution and Properties of Fillers & Silences

Contact Info

Product

Resources

About