2017
DOI: 10.1016/j.patcog.2017.06.009
|View full text |Cite
|
Sign up to set email alerts
|

Head pose estimation in the wild using Convolutional Neural Networks and adaptive gradient methods

Abstract: Head pose estimation is an old problem that is recently receiving new attention because of possible applications in human-robot interaction, augmented reality and driving assistance. However, most of the existing work has been tested in controlled environments and is not robust enough for real-world applications. In order to handle these limitations we propose an approach based on Convolutional Neural Networks (CNNs) supplemented with the most recent techniques adopted from the deep learning community. We eval… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
111
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 196 publications
(111 citation statements)
references
References 33 publications
0
111
0
Order By: Relevance
“…In comparison, our ResNet18-64 has the lowest number of parameters while predicting more accurately than the LeNet-5 variant [25] and nearly as accurate as the ResNet50 [27]. Patacchiola and Cangelosi [25] also use low-resolution images with 64 x 64 pixels, while Ruiz et al [27] take larger images with 224 x 224 pixels. To improve the computational efficiency, we believe that low-resolution images are better suited for real-world applications.…”
Section: Methodsmentioning
confidence: 92%
See 4 more Smart Citations
“…In comparison, our ResNet18-64 has the lowest number of parameters while predicting more accurately than the LeNet-5 variant [25] and nearly as accurate as the ResNet50 [27]. Patacchiola and Cangelosi [25] also use low-resolution images with 64 x 64 pixels, while Ruiz et al [27] take larger images with 224 x 224 pixels. To improve the computational efficiency, we believe that low-resolution images are better suited for real-world applications.…”
Section: Methodsmentioning
confidence: 92%
“…The number of parameters of [27] is based on their provided open source implementation, which is executable on a GPU based system. 4 In order to compare the frame rate, we reimplemented the LeNet-5 variant of [25]. In comparison, our ResNet18-64 has the lowest number of parameters while predicting more accurately than the LeNet-5 variant [25] and nearly as accurate as the ResNet50 [27].…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations