2021
DOI: 10.1016/j.eng.2020.08.027
|View full text |Cite
|
Sign up to set email alerts
|

Gaze Estimation via a Differential Eyes’ Appearances Network with a Reference Grid

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 42 publications
0
10
0
Order By: Relevance
“…Literature [10] uses dilated convolution to expand the perception area of the model to acquire richer global information. Besides, dilated spatial convolution pooling pyramid [1] [11] is adopted to extract multi-scale global information. Ranftl et al [12] used Transformer to replace convolutional neural networks as feature extractors and applied it to dense prediction problems.…”
Section: Video Depth Estimationmentioning
confidence: 99%
See 1 more Smart Citation
“…Literature [10] uses dilated convolution to expand the perception area of the model to acquire richer global information. Besides, dilated spatial convolution pooling pyramid [1] [11] is adopted to extract multi-scale global information. Ranftl et al [12] used Transformer to replace convolutional neural networks as feature extractors and applied it to dense prediction problems.…”
Section: Video Depth Estimationmentioning
confidence: 99%
“…The development of video surveillance is manifested by the automatic identification and analysis of the action of humans in the scene [1]. However, surveillance videos are primarily captured with 2D monocular cameras, such that action recognition tasks can only be performed from 2D information, thus causing low accuracy or efficiency.…”
Section: Introductionmentioning
confidence: 99%
“…Appearance-based methods take face or eye images as input, and learn the mapping of face or eye images to gaze information by using a large number of training samples with ground-truth labels, thereby predicting the gaze information for new images using the trained model. Most methods are calibration-free, but some methods use a few calibration samples to optimize the model and reduce the impact of individual differences (Krafka et al, 2016 ; Gu et al, 2021 ; Liu G. et al, 2021 ; Wang et al, 2023 ). Liu G. et al ( 2021 ) trained a differential convolutional neural network to predict the gaze difference between two input eye images of a same subject, and the gaze direction of a new eye sample was predicted by inferring the gaze differences of a set of subject-specific calibration images.…”
Section: Introductionmentioning
confidence: 99%
“…Liu et al [ 25 ] propose a differential network (Diff-Nn) to address the gaze calibration problem by directly predicting the difference information between two images of the eyes. Gu et al [ 26 ] developed Diff-Nn for the gaze estimation using the left and right eye patch of one face simultaneously. Several other works mention that the performance based on the methods considering the difference information is directly affected by the number and the specific label of the inference image [ 25 , 26 ].…”
Section: Introductionmentioning
confidence: 99%
“…Gu et al [ 26 ] developed Diff-Nn for the gaze estimation using the left and right eye patch of one face simultaneously. Several other works mention that the performance based on the methods considering the difference information is directly affected by the number and the specific label of the inference image [ 25 , 26 ].…”
Section: Introductionmentioning
confidence: 99%