2020
DOI: 10.1016/j.patcog.2019.107114
|View full text |Cite
|
Sign up to set email alerts
|

MobileFAN: Transferring deep hidden representation for face alignment

Abstract: Facial landmark detection is a crucial prerequisite for many face analysis applications. Deep learning-based methods currently dominate the approach of addressing the facial landmark detection. However, such works generally introduce a large number of parameters, resulting in high memory cost. In this paper, we aim for a lightweight as well as effective solution to facial landmark detection. To this end, we propose an effective lightweight model, namely Mobile Face Alignment Network (MobileFAN), using a simple… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
2

Relationship

3
6

Authors

Journals

citations
Cited by 40 publications
(20 citation statements)
references
References 31 publications
0
20
0
Order By: Relevance
“…To obtain part-level features from the images, additional bounding boxes are used to mark the desire regions and train the feature detection model. Many strategies [28] have been proposed to detect similar objects (features) in images including YOLOv5 [12] and Fast R-CNN [29]. The complete training process of the feature detection module consists of the following two steps.…”
Section: A Feature Detection Modulementioning
confidence: 99%
“…To obtain part-level features from the images, additional bounding boxes are used to mark the desire regions and train the feature detection model. Many strategies [28] have been proposed to detect similar objects (features) in images including YOLOv5 [12] and Fast R-CNN [29]. The complete training process of the feature detection module consists of the following two steps.…”
Section: A Feature Detection Modulementioning
confidence: 99%
“…Existing methods for this task include random forest [1], [26] and cascaded shape regression [27], [28]. Convolutional neural networks (CNN) based regression has become a recently popular approach for keypoint localization [18], [29]- [31]. Two CNN-based architecture designs have emerged: direct coordinate regression [30], [32] and heatmap regression [29], [33], where the latter usually outperforms the former, due to the advantage of preserving higher spatial resolution for accurate localization.…”
Section: Related Work a Anatomical Landmark Detectionmentioning
confidence: 99%
“…One feasible solution to mitigate the challenge is to utilize backbone CNN architectures (e.g. VGG16 [14] or ResNet50 [15]) trained on a large-scale and diverse image dataset, such as VGGFace2 [16], which can be fine-tuned or specialized with additional task-specific layers to promote the optimization of facial anatomical landmark detection [17], [18]. Finetuning, as a common paradigm in transfer learning [19], aims to benefit the target task by providing a good initialization, but it can require exhaustive tuning or a set of ad-hoc hyper-parameters to achieve good performance [20], [21].…”
Section: Introductionmentioning
confidence: 99%
“…In computer vision, fine-grained visual categorization (FGVC) aims to classify the objects with small inter-class variances in which a clear difference may exist for different species, and has been extensively studied and made considerable progress in the past years [28], [10], [27], [29], [5], [26]. Ultra-fine-grained visual categorization (ultra-FGVC), however, focuses on classifying objects with more similar patterns among categories under a same class, and has been understudied [22], [20].…”
Section: Introductionmentioning
confidence: 99%