2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.01898
|View full text |Cite
|
Sign up to set email alerts
|

GaTector: A Unified Framework for Gaze Object Prediction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(15 citation statements)
references
References 39 publications
0
15
0
Order By: Relevance
“…[37] dataset. Compared with GaTector [44], GTR achieves a 9.62 AP gain (+19.8%) for GO-D and 0.19 mAP gain (+43.5%) for joint GO-D and human head detection on GOO-Real subset. Moreover, GTR is designed as a single-stage and end-to-end pipeline while prior methods invariably require an additional object detector, which is a fine-tuned YOLO-V4 [1] by following standard protocols.…”
Section: Performance Comparison To State-of-the-art Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…[37] dataset. Compared with GaTector [44], GTR achieves a 9.62 AP gain (+19.8%) for GO-D and 0.19 mAP gain (+43.5%) for joint GO-D and human head detection on GOO-Real subset. Moreover, GTR is designed as a single-stage and end-to-end pipeline while prior methods invariably require an additional object detector, which is a fine-tuned YOLO-V4 [1] by following standard protocols.…”
Section: Performance Comparison To State-of-the-art Methodsmentioning
confidence: 99%
“…It also propose a new Gaze On Objects (GOO) dataset that is composed of a large set of synthetic images (GOO-Synth) supplemented by a smaller subset of real images (GOO-Real) of people looking at objects in a retail environment. On this basis, Wang et al [44] propose a new method GaTector for gaze object detection, which leveraged an additional object detector (YOLOV4 [1]) to recognize target objects.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, we consider the visual selective attention (VSA) towards the scene, which is the process of directing the gaze to relevant visual stimuli while ignoring the irrelevant ones in the environment [9]. The task of detecting the target being looked at by a person in an image or video is known as attention target detection [2,10,18,22,24,26,37,45]. However, in the scenario presented in this work we have a special case of this task in which the interest targets are always outside the video frame.…”
Section: Gaze Analysismentioning
confidence: 99%
“…However, in the scenario presented in this work we have a special case of this task in which the interest targets are always outside the video frame. While in most cases the problem is tackled in the 2D image space [10,37,45], we decided to extend our approach to work in a fully 3D manner. The scenario consists of 3 shop windows, where a camera has been placed inside each one to record the outside.…”
Section: Gaze Analysismentioning
confidence: 99%