2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7953109
|View full text |Cite
|
Sign up to set email alerts
|

Prediction-based learning for continuous emotion recognition in speech

Abstract: In this paper, a prediction-based learning framework is proposed for a continuous prediction task of emotion recognition from speech, which is one of the key components of affective computing in multimedia. The main goal of this framework is to utmost exploit the individual advantages of different regression models cooperatively. To this end, we take two widely used regression models for example, i. e., support vector regression and bidirectional long short-term memory recurrent neural network. We concatenate … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

3
27
0
1

Year Published

2018
2018
2020
2020

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 44 publications
(31 citation statements)
references
References 22 publications
3
27
0
1
Order By: Relevance
“…These positive PCCs suggest that the difficulty information can help improve the model performance in the learning process. This conclusion confirms our previous findings in [39]. Note that, in [39], the selected database has subjects that are different from the one in this article.…”
Section: Emotion Prediction With Dynamic Difficulty Awareness Traisupporting
confidence: 92%
See 1 more Smart Citation
“…These positive PCCs suggest that the difficulty information can help improve the model performance in the learning process. This conclusion confirms our previous findings in [39]. Note that, in [39], the selected database has subjects that are different from the one in this article.…”
Section: Emotion Prediction With Dynamic Difficulty Awareness Traisupporting
confidence: 92%
“…This conclusion confirms our previous findings in [39]. Note that, in [39], the selected database has subjects that are different from the one in this article. Similar observations can be found when calculating the PCCs between the PU and the performance improvement [i. e., PCC(µ, ∆ c ), as shown in the second three rows].…”
Section: Emotion Prediction With Dynamic Difficulty Awareness Traisupporting
confidence: 92%
“…Apart from these works that intend to explore innovative neural network architectures for SER, various advanced training strategies have been investigated as well. Furthermore, prediction-based learning was introduced in [7] to incorporate the strength of different models, and the reconstruction-error based learning was exam-ined in [8] to compensate the weakness of a neural network itself. In [9], CNNs and LSTM-RNNs were sophisticatedly concatenated into a joint framework, and trained in an end-to-end manner, i. e., by directly learning a suitable representation of the raw signal.…”
Section: Introductionmentioning
confidence: 99%
“…Whereas the presented conditional adversarial training framework utilises a cascaded structure, as used in our previous work, i. e., prediction-based learning [7] and reconstruction-error-based learning [8], it includes some specific advantages in comparison to those two approaches. The main idea of prediction-based learning is to take advantage of different models where predictions made by a first model are combined with the original features to learn a second model.…”
Section: Introductionmentioning
confidence: 99%
“…An extrovert is easy to share and express his emotion and be more social. Introverted people are difficult to express and disguise their sentiments making them socially reserved [2] [3].…”
Section: Introductionmentioning
confidence: 99%