2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00358
|View full text |Cite
|
Sign up to set email alerts
|

Semantic Alignment: Finding Semantically Consistent Ground-Truth for Facial Landmark Detection

Abstract: Recently, deep learning based facial landmark detection has achieved great success. Despite this, we notice that the semantic ambiguity greatly degrades the detection performance. Specifically, the semantic ambiguity means that some landmarks (e.g. those evenly distributed along the face contour) do not have clear and accurate definition, causing inconsistent annotations by annotators. Accordingly, these inconsistent annotations, which are usually provided by public databases, commonly work as the groundtruth … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
52
0
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 66 publications
(53 citation statements)
references
References 31 publications
0
52
0
1
Order By: Relevance
“…This is because with the increase of , the quantization error is reduced and more detailed spatial and temporal patterns are captured. For example, as increases from 64 to 768, the NRMSE of our method decreases by 15 com, cha, and full. The light-weight 1D heatmap allows us to fully boost its resolution despite limited space, and the quantization error is well alleviated.…”
Section: Experimental Results and Analysis Undermentioning
confidence: 89%
See 2 more Smart Citations
“…This is because with the increase of , the quantization error is reduced and more detailed spatial and temporal patterns are captured. For example, as increases from 64 to 768, the NRMSE of our method decreases by 15 com, cha, and full. The light-weight 1D heatmap allows us to fully boost its resolution despite limited space, and the quantization error is well alleviated.…”
Section: Experimental Results and Analysis Undermentioning
confidence: 89%
“…Chen et al [5] proposed a Conditional Random Field (CRF) method to embed geometric relationships among landmarks based on their heatmaps. Liu et al [15] proposed a heatmap correction unit which uses global shape constraints to refine heatmaps.…”
Section: Heatmap Regression Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Implementation Details In our experiments, we first use the four stacked hourglass networks as our backbone [37]. All the training and testing images are In the next section, we firstly compare our algorithm with the state-of-theart methods, such as 3DDFA [44], RAR [29], Wing [36], LAB [8], DU-Net [48], Liu et al [11], ODN [9], HRNet [49], Chandran et al [13] and LUVLi [12].…”
Section: Dataset and Implementation Detailsmentioning
confidence: 99%
“…For instance, the coordinate regression facial landmark detection methods [6,8,9] learn features from the whole face images and then regress to the landmark coordinates, which drives the models to learn the whole facial features in a common/normal way that cannot accurately model the differences of local details and the relationships among local details. Also, the heatmap regression facial landmark detection methods [10,11,12,13] generate a landmark heatmap for each landmark and then predict landmarks by traversing the corresponding landmark heatmaps.…”
Section: Introductionmentioning
confidence: 99%