2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00118
|View full text |Cite
|
Sign up to set email alerts
|

FSA-Net: Learning Fine-Grained Structure Aggregation for Head Pose Estimation From a Single Image

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
244
0
1

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 265 publications
(245 citation statements)
references
References 39 publications
0
244
0
1
Order By: Relevance
“…Facial information recovery aims to use high-level information to perform estimations and has gained great success in recent years. Yang et al [2] proposed a method for head estimation from a single image without landmark or depth information. On the basis of regression and feature aggregation, they learned a fine-grained structure mapping, and their results outperformed many state-of-the-art methods in terms of efficiency.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Facial information recovery aims to use high-level information to perform estimations and has gained great success in recent years. Yang et al [2] proposed a method for head estimation from a single image without landmark or depth information. On the basis of regression and feature aggregation, they learned a fine-grained structure mapping, and their results outperformed many state-of-the-art methods in terms of efficiency.…”
Section: Related Workmentioning
confidence: 99%
“…Multiview data were required in Refs. [2,4] as input, such as multiview images or cameras, which cannot be used for direct video tracking. The results in Refs.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…It showed that it can directly predict head rotation and highly outperform landmark-to-pose methods using state-of-the-art landmark detection methods. FSA-Net [47] provides attention for pose estimation and even proved to be a slight improvement over Hopenet [46].…”
Section: Gaze Estimationmentioning
confidence: 99%
“…The method to unite appearances was based on the kernel method of object tracking [46,47]. Consider that frame t i is the image frame at time t i .…”
Section: Uniting the Appearances Into A Sequencementioning
confidence: 99%