Proceedings of the 20th ACM International Conference on Multimodal Interaction 2018
DOI: 10.1145/3242969.3242972
|View full text |Cite
|
Sign up to set email alerts
|

Joint Discrete and Continuous Emotion Prediction Using Ensemble and End-to-End Approaches

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 34 publications
0
6
0
Order By: Relevance
“…These performances were obtained using an optimizable ensemble regressor. Our performances are better than those from the literature [11][12][13][14][15] (see Section 2), who performed more complex processing and feature extraction. In Table 1, the validation performances were evaluated by performing 5-fold cross validation across the training data.…”
Section: Discussion Of Resultsmentioning
confidence: 60%
See 1 more Smart Citation
“…These performances were obtained using an optimizable ensemble regressor. Our performances are better than those from the literature [11][12][13][14][15] (see Section 2), who performed more complex processing and feature extraction. In Table 1, the validation performances were evaluated by performing 5-fold cross validation across the training data.…”
Section: Discussion Of Resultsmentioning
confidence: 60%
“…They achieved a CCC of 0.413 and 0.527 on arousal and valence predictions, respectively. Albadawy et al [15] used the visual features provided by AVEC 2015, which included appearance (LGBP-TOP) and geometric (Euclidean distances between 49 facial landmarks) features. For arousal and valence predictions, they proposed a joint modelling strategy using a deep BiLSTM for ensemble and end-to-end models.…”
Section: Literature Overviewmentioning
confidence: 99%
“…log Mel filterbank LSTM [8] LSTM [28] 0.527(1) 0.504(2) BottleNeck LSTM [29] 0.533 0.466 eGeMAPS + BoAW RVM [10] 0.494 0.507 eGeMAPS ARX 0.540 0.502 It can be observed that ARX achieves comparable performances to the much more complex non-linear LSTM back-ends on all three databases. It is also worth noting that it outperforms the best LSTM system on the SEWA database, with 1.3% relative improvement for arousal.…”
Section: Resultsmentioning
confidence: 98%
“…While a significant number of regression modelling techniques have been proposed for emotion predictions, the most widely adopted models are Support Vector Regression [6] owing to its robustness, and Long-short term memory recurrent neural network (LSTM-RNN) [7][8][9] which captures the temporal dynamics of emotion. In addition, relevance vector machines (RVM) [10] and Gaussian mixture regression (GMR) [11] have also shown good performance in emotion prediction tasks.…”
Section: Introductionmentioning
confidence: 99%
“…Khorram et al [66] proposed two architectures for continuous emotions recognition-dilated CNN with a varying dilation factor for different layers and downsampling/upsampling CNN-with different ways of modelling long-term dependencies. AlBadawy and Kim [68] further improved the accuracy of valence with joint modelling of the discrete and continuous emotion labels. Table 4 summarises the top performances of the continuous SER systems tested on the RECOLA dataset.…”
Section: Temporal Variations Modellingmentioning
confidence: 99%