Facial expression recognition based on improved residual network

Zhang, Weiguang; Zhang, Xuguang; Tang, Yinggan

doi:10.1049/ipr2.12743

Cited by 17 publications

(6 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The improved residual neural network (RNN) is introduced for AFER in [13]. The degradation in network performance is effectively prevented by the designed RNN, which derives deep features and preserves the shallow ones.…”

Section: Related Workmentioning

confidence: 99%

“…The end-to-end training of DML-Net reduces multiple metric losses and suppresses overfitting, thereby enhancing the recognition rate. The improved residual neural network (RNN) is introduced for AFER in [13]. The degradation in network performance is effectively prevented by the designed RNN, which derives deep features and preserves the shallow ones.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Modelling appearance variations in expressive and neutral face image for automatic facial expression recognition

Kumar H N,

M S,

Asif Shah

et al. 2024

IET Image Processing

View full text Add to dashboard Cite

In automatic facial expression recognition (AFER) systems, modelling the spatio‐temporal feature information in a specific manner, coalescing, and its effective utilization is challenging. The state‐of‐the‐art studies have examined integrating multiple features to enhance the recognition rate of AFER systems. However, the feature variations between expressive and neutral face images are not fully explored to identify the expression class. The proposed research presents an innovative approach to AFER by modelling appearance variations in both expressive and neutral face images. The prominent contributions of the work are developing a novel and hybrid feature space by integrating the discriminative feature distribution derived from expressive and neutral face images; preserving the highly discriminative latent feature distribution using autoencoders. Local binary pattern (LBP) and histogram of oriented gradients (HOG) are the feature descriptors employed to derive the discriminative texture and shape information, respectively. The component‐based approach is employed, wherein the features are derived from the salient facial regions instead of the whole face. The three‐stage stacked deep convolutional autoencoder (SDCA) and multi‐class support vector machine (MSVM) are employed to address dimensionality reduction and classification, respectively. The efficacy of the proposed model is substantiated by empirical findings, which establish its superiority in terms of accuracy in AFER tasks on widely recognized benchmark datasets.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Modelling appearance variations in expressive and neutral face image for automatic facial expression recognition

Kumar H N,

M S,

Asif Shah

et al. 2024

IET Image Processing

View full text Add to dashboard Cite

show abstract

“…Guo et al proposed a double-code LBP-layer spatial attention network (DLSANet) that improved traditional recognition models by incorporating a double-code LBP (DLBP) layer and spatial attention network into the emotion network (ENet). This approach achieved superior results compared to contemporary facial emotion recognition methods on commonly used facial expression recognition datasets (Guo et al, 2023). Nan et al incorporated the MobileNetV1 model with an attention mechanism to enhance the model's ability to extract local features from facial expression samples.…”

Section: Related Workmentioning

confidence: 99%

A multi‐scale feature fusion convolutional neural network for facial expression recognition

Zhang,

Fu,

et al. 2023

Expert Systems

View full text Add to dashboard Cite

This paper designs a new facial Expression recognition network called a multi‐scale feature Fusion Convolutional neural Network (EFCN). This network is proposed to solve two problems in the facial expression recognition task. First, there are many commonalities between faces of different expression categories, and the recognition task cannot be precisely performed when the commonality is greater than the individuality. Secondly, facial detail features have a significant impact on the final results of expression recognition, while the image detail features extracted by traditional convolutional neural networks are not sufficient. In order to address the above issues, the feature enhancement network (FEN) and the detail information enhancement module (DEM) are designed. The FEN fuses deep and shallow features. Accordingly, the feature map contains richer information, making it easy to identify the samples. The DEM extracts and fuses the features passed by the backbone network with multi‐scale features to enhance the network's ability to extract features from small regions of the face. We validated the proposed method on three datasets, RAF‐DB, CK+, and JAFFE, and achieved 84.50%, 97.86%, and 91.05% accuracy, respectively, and the experimental results showed the effectiveness of the proposed method in this paper. For example, on the JAFFE dataset, the recognition accuracy of this method surpasses the MLT method by 1.87%.

show abstract

“…Facial landmark detection, also known as face alignment, is an essential topic in computer vision and is widely used in many fields, such as face recognition [1], face reconstruction [2], and facial expression recognition [3]. Unlike the traditional features characterized by feature descriptors, facial landmarks include pupils, nose tips, and eye corners, which are visible to the naked eye and have human structural properties.…”

Section: Introductionmentioning

confidence: 99%

Robust facial landmark detection by probability‐guided hourglass network

Liang

Líu

et al. 2023

IET Image Processing

View full text Add to dashboard Cite

The absence of local features and global shape constraints severely limits the performance of the hourglass network for facial landmark detection in unconstrained environments. Moreover, diverse feature types and scales may result in low accuracy. This paper proposes a probability-guided hourglass network to enhance the shape constraints for robust facial landmark detection. Firstly, a multi-scale pre-processing module is designed to extract features at different scales. Secondly, based on the heatmaps generated by the stacked hourglass network, the coarse localizations are obtained, while the probability maps are generated with local features. Finally, a probability-based boundary regression method is proposed and the hausdorff distance is modified as the loss function to constrain the feature shape. Adaptive weights are also added to the loss function, which can help relieve the data imbalance problem. Subjective and objective experimental results on the challenging datasets show that this method outperforms the state-of-the-arts on unconstrained conditions.

show abstract

Facial expression recognition based on improved residual network

Cited by 17 publications

References 41 publications

Modelling appearance variations in expressive and neutral face image for automatic facial expression recognition

Modelling appearance variations in expressive and neutral face image for automatic facial expression recognition

A multi‐scale feature fusion convolutional neural network for facial expression recognition

Robust facial landmark detection by probability‐guided hourglass network

Contact Info

Product

Resources

About