2018
DOI: 10.1007/s11263-018-1095-1
|View full text |Cite
|
Sign up to set email alerts
|

RED-Net: A Recurrent Encoder–Decoder Network for Video-Based Face Alignment

Abstract: We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model. Our proposed model predicts 2D facial point heat maps regularized by both detection and regression loss, while uniquely exploiting recurrent learning at both spatial and temporal dimensions. At the spatial level, we add a feedback loop connection between the combined output response map and the input, in order to enable iterative coarse-to-fine face alignment using a single network model, instea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 54 publications
0
8
0
Order By: Relevance
“…To evaluate our method comprehensively and verify its effectiveness, we conducted extensive experiments and compared it with DIP and eight other state-of-the-art image denoising methods, including BM3D [2], NCSR [7], WNNM [5], DnCNN [11], FFDNet [12], TWSC [23], RED-Net [24], and CsNet [13]. We conducted denoising experiments on four datasets.…”
Section: Datasets and Experimental Setupmentioning
confidence: 99%
“…To evaluate our method comprehensively and verify its effectiveness, we conducted extensive experiments and compared it with DIP and eight other state-of-the-art image denoising methods, including BM3D [2], NCSR [7], WNNM [5], DnCNN [11], FFDNet [12], TWSC [23], RED-Net [24], and CsNet [13]. We conducted denoising experiments on four datasets.…”
Section: Datasets and Experimental Setupmentioning
confidence: 99%
“…A joint optimization of alignment and tracking over a cascade with DNN-based features was proposed in Khan et al [ 48 ]. Finally, end-to-end learning via DNNs was also applied to video, e.g., using recurrent neural networks [ 20 , 27 ] and two-stream transformer networks [ 24 ].…”
Section: Related Workmentioning
confidence: 99%
“…At first, the detection of pose change activates the alignment, preventing unnecessary processing when the head doesn’t move. Alignment is then performed by regression cascade of tree ensembles, exploiting their superior computational efficiency [ 9 , 10 ] with respect to (possibly more accurate) state-of-the-art alignment methods based on deep neural networks (DNNs) [ 5 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 , 26 , 27 , 28 , 29 , 30 , 31 ]. In a scenario where energy efficiency is a matter of the utmost importance, a DNN-based alignment pre-processor might eclipse the energy efficiency advantage of ECs.…”
Section: Introductionmentioning
confidence: 99%
“…Sun et al [27] presented an approach for detecting the positions of face keypoints with three-level carefully designed convolution networks. In addition to convolutional neural networks(CNN), recurrent neural networks(RNN) have also been developed on feature point detection, especially on facial feature point detection [25], [28], [29].…”
Section: Related Workmentioning
confidence: 99%