Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge 2015
DOI: 10.1145/2808196.2811639
|View full text |Cite
|
Sign up to set email alerts
|

ETS System for AV+EC 2015 Challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
16
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(16 citation statements)
references
References 8 publications
0
16
0
Order By: Relevance
“…Even though the problem of emotion recognition has been extensively studied in the literature, we only focus on works that predicted dimensional values, continuously in time. Successful attempts to solving the continuous emotion recognition problem relied on DNNs [4], BLSTMs [5], and more commonly, support vector regression (SVR) classifiers [6]. With the exception of BLSTMs, such approaches do not incorporate longterm dependencies unless coupled with feature engineering.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Even though the problem of emotion recognition has been extensively studied in the literature, we only focus on works that predicted dimensional values, continuously in time. Successful attempts to solving the continuous emotion recognition problem relied on DNNs [4], BLSTMs [5], and more commonly, support vector regression (SVR) classifiers [6]. With the exception of BLSTMs, such approaches do not incorporate longterm dependencies unless coupled with feature engineering.…”
Section: Related Workmentioning
confidence: 99%
“…For instance, Valstar et al [2] showed that it was necessary to consider larger windows when making frame-level emotion predictions (four seconds for arousal and six seconds for valence). Le et al [3] and Cardinal et al [4] found that increasing the number of contextual frames when training a deep neural network (DNN) for making frame-level emotion predictions is helpful but only to a certain point. Bidirectional long short-term memory networks (BLSTMs) can naturally incorporate longterm temporal dependencies between features; explaining their success in continuous emotion recognition tasks (e.g., [5]).…”
Section: Introductionmentioning
confidence: 99%
“…In the last years, several works attempt to predict valence and arousal using machine learning algorithms [5], [10], [19], [1]. Eyben et al [5] proposed a fully automatic audiovisual recognition approach based on Long Short-Term Memory (LSTM) modeling of word-level audio and video features.…”
Section: Introductionmentioning
confidence: 99%
“…Automatic facial expression recognition has been investigated in the last years due to the great number of applications, ranging from human-computer interaction, emotion analysis [2] to detection of driver fatigue [24]. Several approaches based on handcrafted features [3], [14], [24], [25] such as textural, eigenfaces, etc.…”
Section: Introductionmentioning
confidence: 99%