2019
DOI: 10.1007/s11263-019-01275-0
|View full text |Cite
|
Sign up to set email alerts
|

Rectified Wing Loss for Efficient and Robust Facial Landmark Localisation with Convolutional Neural Networks

Abstract: Efficient and robust facial landmark localisation is crucial for the deployment of real-time face analysis systems. This paper presents a new loss function, namely Rectified Wing (RWing) loss, for regression-based facial landmark localisation with Convolutional Neural Networks (CNNs). We first systemically analyse different loss functions, including L2, L1 and smooth L1. The analysis suggests that the training of a network should pay more attention to small-medium errors. Motivated by this finding, we design a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 28 publications
(19 citation statements)
references
References 99 publications
0
19
0
Order By: Relevance
“…Optimization algorithm and hyper-parameters were the same to those used by Bulat and Tzimiropoulos to train the FAN model [35]. Our training algorithm used a recently introduced loss function, adaptive wing-loss, which improves model performance by penalizing small errors more than the traditional squared-loss [36], [52], [53] Twelve participants from each group were randomly selected and used to train the model. Data from the remaining participants were used to test the model performance by computing the accuracy in landmark localization.…”
Section: A Toronto Neuroface Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…Optimization algorithm and hyper-parameters were the same to those used by Bulat and Tzimiropoulos to train the FAN model [35]. Our training algorithm used a recently introduced loss function, adaptive wing-loss, which improves model performance by penalizing small errors more than the traditional squared-loss [36], [52], [53] Twelve participants from each group were randomly selected and used to train the model. Data from the remaining participants were used to test the model performance by computing the accuracy in landmark localization.…”
Section: A Toronto Neuroface Datasetmentioning
confidence: 99%
“…These databases often consist of thousands of photographs with a large variety of poses, expressions, illumination, backgrounds, and scales. Thus, pre-trained FA models are designed to provide accurate facial landmark localization under general conditions [31]- [36].…”
Section: Introductionmentioning
confidence: 99%
“…Different Learning Objectives Several variations of the ℓ 1 or ℓ 2 based loss functions designed for face alignment have exhibited excellent accuracy. Wing [35] and RWing [54] both focus on small range errors and switch the loss function from an ℓ 1 loss to a modified logarithm function. AWing [55] applies a similar idea to improve the quality of heat map regression results.…”
Section: Related Workmentioning
confidence: 99%
“…This alignment is carried out by first extracting facial landmarks and then performing an affine transformation based on the coordinates of five facial landmarks, i.e., eye centres, nose tip and mouth corners. There are many existing facial landmark detection algorithms, such as cascaded shape regression [41], [42] and CNN based methods [43]- [46]. However, standard facial landmark detection approaches such as MTCNN [43] do not work well for LR images, i.e., they fail to detect a large number of LR faces during training as well as testing.…”
Section: B Proposed Solutionmentioning
confidence: 99%