Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-1413
|View full text |Cite
|
Sign up to set email alerts
|

VESUS: A Crowd-Annotated Database to Study Emotion Production and Perception in Spoken English

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(16 citation statements)
references
References 18 publications
0
16
0
Order By: Relevance
“…In the Toronto Emotional Speech Set (TESS) [143], 2 speakers speak 200 words with 7 different emotions in the carrier phrase ("Say the word ..."). A recent VESUS database [144] is designed and released with over 250 distinct phrases, each read by ten actors in five emotional states. These databases mark valuable practice to understand the emotion variance in the word or phrase level, but may not be suitable to build a state-of-the-art emotional voice conversion framework that is usually data-driven.…”
Section: Lexical Variabilitymentioning
confidence: 99%
“…In the Toronto Emotional Speech Set (TESS) [143], 2 speakers speak 200 words with 7 different emotions in the carrier phrase ("Say the word ..."). A recent VESUS database [144] is designed and released with over 250 distinct phrases, each read by ten actors in five emotional states. These databases mark valuable practice to understand the emotion variance in the word or phrase level, but may not be suitable to build a state-of-the-art emotional voice conversion framework that is usually data-driven.…”
Section: Lexical Variabilitymentioning
confidence: 99%
“…The ability of the regressor to differentiate between emotions resp. place the emotions in the AV space was tested on ten publicly available databases: EmoDB [19], EMOVO [20], RAVDESS [21], CREMA-D [22], SAVEE [23], VESUS [24], eNTERFACE [25], JL Corpus [26], TESS [27], and GEES [28]. These databases are categorically annotated and do not include information on AV values.…”
Section: Testing Databasesmentioning
confidence: 99%
“…The content of the corpus is semantically constant to allow the tone of the delivery to play a greater role in predicting the emotion of an instance. The same methodology is also used in [42] while designing a database for the English language. Other dataset for English language include MSP-IPROV [7], RAVDESS [31], SAVEE [23] and VESUS [42].…”
Section: Related Workmentioning
confidence: 99%