2008 15th IEEE International Conference on Image Processing 2008
DOI: 10.1109/icip.2008.4711852
|View full text |Cite
|
Sign up to set email alerts
|

A gaze prediction technique for open signed video content using a track before detect algorithm

Abstract: This paper proposes a gaze prediction model for open signed video content. A face detection algorithm is used to locate faces across each frame in both profile and frontal orientations. A grid-based likelihood ratio track before detect routine is used to predict the orientation of the signer's head, which allows the gaze location to be localised to either the signer or the inset. The face detections are then used to narrow down the gaze prediction further. The gaze predictor is able to predict the results of a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2009
2009
2014
2014

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 8 publications
0
2
0
Order By: Relevance
“…In one approach, the ROI is determined a priori based on saliency maps [11] obtained solely based on content analysis, typically using low-level video features such as spatial contrasts in luminance, temporal changes in motion, appearances of machine-recognized human faces, etc. It has been shown [12,13], however, that prior knowledge and context play important roles in affecting viewer's attention, and modeling these information when calculating saliency maps is a daunting task. In contrast, while we use video content to train HMM parameters during training phase, in operational phase we determine ROI based on real-time eye gaze tracking.…”
Section: Related Workmentioning
confidence: 99%
“…In one approach, the ROI is determined a priori based on saliency maps [11] obtained solely based on content analysis, typically using low-level video features such as spatial contrasts in luminance, temporal changes in motion, appearances of machine-recognized human faces, etc. It has been shown [12,13], however, that prior knowledge and context play important roles in affecting viewer's attention, and modeling these information when calculating saliency maps is a daunting task. In contrast, while we use video content to train HMM parameters during training phase, in operational phase we determine ROI based on real-time eye gaze tracking.…”
Section: Related Workmentioning
confidence: 99%
“…Nevertheless, it has been shown [39,40] that prior knowledge and context play important roles in affecting viewer's attention. Thus, video analysis can at best provide a rough estimate of where viewers may look, in the absence of real-time information.…”
Section: Roi-based Bit Allocation For Video Coding / Streamingmentioning
confidence: 99%