GOO: A Dataset for Gaze Object Prediction in Retail Environments

Tomas, Henri; Reyes, Marcus; Dionido, Raimarc S.; Ty, Mark; Mirando, Jonric; Casimiro, Joel; Atienza, Rowel; Guinto, Richard

doi:10.1109/cvprw53098.2021.00349

Cited by 27 publications

(20 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…First, the hypothetical gaze distribution model, as defined in our previous work [12], represents the concept of the object channel. The model has surpassed the existing benchmark Area Under the Curve (AUC) and Angular error baselines on the GOO dataset [8], showing the importance of the object channel. Second, the Face3D model represents the depth channel and the novel concept of remote gaze estimation in 3D vector space.…”

Section: Introductionmentioning

confidence: 90%

“…Dataset GTE GF GOP GE in Retail Approach\Technique Bermejo et al [1] UcoHead, Own dataset ✓ --✓ CNN (Coarse-to-Fine) Recasense et al [11] Gaze Follow ✓ ✓ --CNN with shifted grids Tomas et al [8] GOO ✓ ✓ ✓ ✓ Existing CNN models Kellnhofer et al [17] Gaze360 ✓ --✓ CNN-LSTM Fang et al [13] Gaze360, Gaze Follow, VideoAttentionTarget ✓ ---Attention-based CNN Chong et al [23] Gaze Follow, VideoCoAtt, VideoAttentionTarget ✓ ✓ --CNN-LSTM Lian et al [24] Gaze Follow ✓ ✓ --Static-CNN Kodama et al [22] Own dataset ✓ ✓ --Static-CNN Khamis et al [25] Own…”

Section: Studymentioning

confidence: 99%

“…Then, in order to create the dual attention map, we aggregated the FoV attention map and depth attention map as given in (7). After that, to enhanced the saliency estimation we aggregated this dual attention map with hypothetical gaze distribution G (object channel) to create hypo dual attention map as in (8), where ⊗ denotes the element wise product.…”

Section: ) Depth Range Selectormentioning

confidence: 99%

“…However, the existing solutions only captures coarse touch-points of a shopper's journey and vulnerable to unconstrained environment settings. With the adaptation of computer vision technologies in gaze estimation, there has been eye tracking-based solutions for customer behaviour analysis in retail as well [1], [8]. Moreover, there are solutions based on virtual reality devices and headmounted displays, wearable eye tracker based solutions [9], and non-intrusive 3D eye tracking solutions [10].…”

Section: Introductionmentioning

confidence: 99%

“…The concept of Gaze Following, which was introduced by Recasense et al [11], refers to the identification of the object being looked at by a person, given the scene image. This concept has been extended by Tomas et al [8] and presented the idea of Gaze Object Prediction in retail, that refers to the task of predicting the bounding box for a human's gazed-at object. Both these concepts only require gaze estimation from the scene image, and it avoids the need to wear special types of devices to capture the eye gaze and remove the restrictions of manual calibration.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Customer Gaze Estimation in Retail Using Deep Learning

et al. 2022

View full text Add to dashboard Cite

At present, intelligent computing applications are widely used in different domains, including retail stores. The analysis of customer behaviour has become crucial for the benefit of both customers and retailers. In this regard, the novel concept of remote gaze estimation using deep learning has shown promising results in analyzing customer behaviour in retail due to its scalability, robustness, low cost, and uninterrupted nature. This study presents a three-stage, three-attention-based deep convolutional neural network for remote gaze estimation in retail using only image data. In the first stage, we design a mechanism to estimate the 3D gaze of the subject using image data and monocular depth estimation. The second stage presents a novel three-attention mechanism to estimate the gaze in the wild from field-of-view, depth range, and object channel attentions. The third stage generates the gaze saliency heatmap from the output attention map of the second stage. We train and evaluate the proposed model on the benchmark GOO-Real dataset and compare the results with baseline models. Further, we adapt our model to real-retail environments by introducing a novel Retail Gaze dataset. Extensive experiments demonstrate that our approach significantly improves remote gaze target estimation performance on GOO-Real and Retail Gaze datasets. INDEX TERMS Computer vision, deep learning, gaze estimation, retail customer behaviour

show abstract

Section: Introductionmentioning

confidence: 90%