Efficient Facial Feature Learning with Wide Ensemble-based Convolutional Neural Networks

Siqueira, Henrique; Magg, Sven; Wermter, Stefan

doi:10.48550/arxiv.2001.06338

Cited by 5 publications

(7 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We outperform their result by almost 6% as reported in Table 6. We also outperform the results reported in Miao et al [43], Li et al [44], and Siqueira et al [26] which employ different type of complex neural networks to learn facial expressions.…”

Section: Model Accuracysupporting

confidence: 61%

“…Most of these models, however, employ large and deep neural networks that demand a high computational power for training and re-adapting [23,24,10]. As a result, these models specialize in recognizing emotion expressions under conditions represented in the datasets they are trained with [25,26]. Thus, when these models are used to recognize facial expression under different conditions, not represented in the training data, they tend to perform poorly.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

The FaceChannel: A Fast & Furious Deep Neural Network for Facial Expression Recognition

Barros¹,

Churamani²,

Sciutti³

2020

Preprint

View full text Add to dashboard Cite

Current state-of-the-art models for automatic Facial Expression Recognition (FER) are based on very deep neural networks that are effective but rather expensive to train. Given the dynamic conditions of FER, this characteristic hinders such models of been used as a general affect recognition. In this paper, we address this problem by formalizing the FaceChannel, a light-weight neural network that has much fewer parameters than common deep neural networks. We introduce an inhibitory layer that helps to shape the learning of facial features in the last layer of the network and thus improving performance while reducing the number of trainable parameters. To evaluate our model, we perform a series of experiments on different benchmark datasets and demonstrate how the FaceChannel achieves a comparable, if not better, performance to the current stateof-the-art in FER. Our experiments include cross-dataset analysis, to estimate how our model behaves on different affective recognition conditions. We conclude our paper with an analysis of how FaceChannel learns and adapt the learned facial features towards the different datasets.

show abstract

Section: Model Accuracysupporting

confidence: 61%

Section: Introductionmentioning

confidence: 99%

The FaceChannel: A Fast & Furious Deep Neural Network for Facial Expression Recognition

Barros¹,

Churamani²,

Sciutti³

2020

Preprint

View full text Add to dashboard Cite

show abstract

“…They focus on using a fine-tuned VGG13 encoder that updates all the convolutional layers. We also outperform the results Miao et al [60], Li et al [51], and Siqueira et al [72] reported, all of which employ different types of complex neural networks to learn facial expressions. On the FABO dataset, the Face-STN achieves higher results than reported in the literature, including Chen et al [18], who proposed a frame-based recognition and a bag-of-words-based model, or even Gunes et al [33] who used an SVM-based implementation.…”

Section: Modelsupporting

confidence: 60%

“…Norm. [18] 66.5 % SHCNN [60] 86.54 % Bag of Words [18] 59.00 % TFE-JL [51] 84.30% SVM [33] 32.49 % ESR-9 [72] 87. Some of the models with which these datasets were evaluated seem to be outdated for other computer vision tasks.…”

Section: Fer+ Fabo Modelmentioning

confidence: 99%

Across the Universe: Biasing Facial Representations Toward Non-Universal Emotions With the Face-STN

Barros

Sciutti

2022

IEEE Access

View full text Add to dashboard Cite

Facial expression recognition, as part of an affective computing system, usually relies on solid performance metrics to be successful. These metrics depend significantly on the affective context in which one evaluates this system. While presenting excellent performance on the dataset it was trained on, a facial expression recognition model might drastically fail when one assesses it in a different scenario. Such performance reduction occurs because most facial perception models rely on an extreme generalization concept, focusing on a universal emotion perception system. With the recent findings on the non-universality of emotional perception, generalization of facial encoders seems not to be the optimal path to take. Therefore, exploiting transfer learning towards adapting specific facial features to specific scenarios could address this problem. This paper proposes and investigates a Spatial Transformer Plugin (STN) to rearrange different facial encoders towards particular affective representations from different scenarios. We experiment with our model in eight different facial expression recognition datasets (AffectNet and the derived MaskedAffectNet, OMG-Emotion, FERPlus, ElderReact, EmoReact, FABO and JAFFE datasets) and obtain competitive performance with much less training effort than state-of-the-art models. Besides performance alone, we introduce the STN as a mechanism towards a non-universal emotional perception system and discuss how it rearranges learned perception features to address some specific characteristics of each investigated dataset.

show abstract

“…Overall most methods for facial emotion recognition use classical neural networks, and Bayesian neural networks are not commonly used, even more recent work that uses ensembles like Siqueira et al [19] or Surace et al [20] do not consider the possibility of modeling output uncertainty, despite Lakshminarayanan et al [13] showing that ensembles are able to produce state of the art uncertainty quantification.…”

Section: Related Workmentioning

confidence: 99%

Hey Human, If your Facial Emotions are Uncertain, You Should Use Bayesian Neural Networks!

Matin,

Valdenegro-Toro

2020

Preprint

View full text Add to dashboard Cite

Facial emotion recognition is the task to classify human emotions in face images. It is a difficult task due to high aleatoric uncertainty and visual ambiguity. A large part of the literature aims to show progress by increasing accuracy on this task, but this ignores the inherent uncertainty and ambiguity in the task. In this paper we show that Bayesian Neural Networks, as approximated using MC-Dropout, MC-DropConnect, or an Ensemble, are able to model the aleatoric uncertainty in facial emotion recognition, and produce output probabilities that are closer to what a human expects. We also show that calibration metrics show strange behaviors for this task, due to the multiple classes that can be considered correct, which motivates future work. We believe our work will motivate other researchers to move away from Classical and into Bayesian Neural Networks.

show abstract

Efficient Facial Feature Learning with Wide Ensemble-based Convolutional Neural Networks

Cited by 5 publications

References 14 publications

The FaceChannel: A Fast & Furious Deep Neural Network for Facial Expression Recognition

The FaceChannel: A Fast & Furious Deep Neural Network for Facial Expression Recognition

Across the Universe: Biasing Facial Representations Toward Non-Universal Emotions With the Face-STN

Hey Human, If your Facial Emotions are Uncertain, You Should Use Bayesian Neural Networks!

Contact Info

Product

Resources

About