2020
DOI: 10.48550/arxiv.2009.11340
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The importance of fillers for text representations of speech transcripts

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
1

Relationship

3
0

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…• Emotion classification (Dinkar et al, 2020;Garcia et al, 2019;Colombo et al, 2021a;Witon et al, 2018) where score are continuous.…”
Section: B5 Possible Extensionsmentioning
confidence: 99%
“…• Emotion classification (Dinkar et al, 2020;Garcia et al, 2019;Colombo et al, 2021a;Witon et al, 2018) where score are continuous.…”
Section: B5 Possible Extensionsmentioning
confidence: 99%
“…In order to handle the general low agreement between annotators, the consensus between the annotators is then built by computing the root mean square (RMS), as made by Dinkar, Colombo, Labeau, and Clavel (2020).…”
Section: Scoresmentioning
confidence: 99%
“…Although achieving state-of-the-art (SOTA) results on written benchmarks (Wang et al, 2018), they are not tailored to spoken dialog (SD). Indeed, Tran et al (2019) have suggested that training a parser on conversational speech data can improve results, due to the discrepancy between spoken and written language (e.g., disfluencies (Stolcke and Shriberg, 1996), fillers (Shriberg, 1999;Dinkar et al, 2020), different data distribution). Furthermore, capturing discourse-level features, which distinguish dialog from other types of text (Thornbury and Slade, 2006), e.g., capturing multi-utterance dependencies, is key to embed dialog that is not explicitly present in pre-training objectives (Devlin et al, 2018;Yang et al, 2019;Liu et al, 2019), as they often treat sentences as a simple stream of tokens.…”
Section: Introductionmentioning
confidence: 99%