2018
DOI: 10.1007/s10579-018-9427-x
|View full text |Cite
|
Sign up to set email alerts
|

ShEMO: a large-scale validated database for Persian speech emotion detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 53 publications
(9 citation statements)
references
References 43 publications
0
9
0
Order By: Relevance
“…Samples of aggressive English speech ( n = 39) were taken from a corpus of recordings of British drama students, who were instructed to imagine that they were about to attack someone in a fight and to yell ‘That's enough, I'm coming for you!’ [25]. Recordings of aggressive Persian speech ( n = 43) were obtained from ShEMO—an open corpus of emotional speech compiled from radio plays [28]. In contrast with the lexically identical English recordings, the Persian utterances were taken from different contexts and were not repetitions of the same phrase.…”
Section: Methodsmentioning
confidence: 99%
“…Samples of aggressive English speech ( n = 39) were taken from a corpus of recordings of British drama students, who were instructed to imagine that they were about to attack someone in a fight and to yell ‘That's enough, I'm coming for you!’ [25]. Recordings of aggressive Persian speech ( n = 43) were obtained from ShEMO—an open corpus of emotional speech compiled from radio plays [28]. In contrast with the lexically identical English recordings, the Persian utterances were taken from different contexts and were not repetitions of the same phrase.…”
Section: Methodsmentioning
confidence: 99%
“…In this work, we used four popular English datasets (TESS [13], RAVEDESS [14], SAVEE [15], IEMOCAP [16]) and one German dataset (EMODB [17]) as source for pretraining. We selected three low-resource language datasets for adaption -Italian (EMOVO [18]), Persian (SHEMO [19]), and Urdu (URDU [20]). Table 1 lists down the corpus statistics for source and target datasets.…”
Section: Studied Languages and Datamentioning
confidence: 99%
“…6) ShEMO: Sharif Emotional Speech Database [27] is a Persian emotional speech dataset that contains 3000 seminatural utterances extracted from online radio plays and labeled considering the emotions anger, fear, happiness, sadness, surprise, and neutral state, by a group of 12 annotators of both sexes.…”
Section: A Speech Databasesmentioning
confidence: 99%