2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7952655
|View full text |Cite
|
Sign up to set email alerts
|

Using regional saliency for speech emotion recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
98
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 106 publications
(100 citation statements)
references
References 16 publications
1
98
1
Order By: Relevance
“…In recent times, several neural network approaches have been successfully applied for emotion classification. Researchers have propsoed convolutional neural network (CNN)-based models that are trained on speech utterances for performing identification [14,15,16]. There have been some successful approaches on using attention mechanisms as well [17,18].…”
Section: Recent Workmentioning
confidence: 99%
“…In recent times, several neural network approaches have been successfully applied for emotion classification. Researchers have propsoed convolutional neural network (CNN)-based models that are trained on speech utterances for performing identification [14,15,16]. There have been some successful approaches on using attention mechanisms as well [17,18].…”
Section: Recent Workmentioning
confidence: 99%
“…Following previous works [10,11], we choose 4 emotion types for our experiments (namely neutral, happy, angry and sad) from the improvised speech for study, since scripted data may contain undesired contextual information. Adopting the methodology of previous works [10,11,23,24], we perform a 10-fold cross-validation using a leave-oneout strategy. In each training process, 9 speakers are used as training data and the remained one is used for testing data.…”
Section: Experiments Settingmentioning
confidence: 99%
“…Furthermore, simply average-pooling or max-pooling may be insufficient to derive effective representations for complex emotional expressions that require analysis of higher order statistics. Some recent works show the benefit of introducing an attention mechanism for representation learning [12,13,10]. However, they generally derive salient regions from the features in a bottomup manner.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations