Proceedings of the 14th ACM International Conference on Multimodal Interaction 2012
DOI: 10.1145/2388676.2388720
|View full text |Cite
|
Sign up to set email alerts
|

Investigating the midline effect for visual focus of attention recognition

Abstract: This paper addresses the recognition of people's visual focus of attention (VFOA), the discrete version of gaze indicating who is looking at whom or what. In absence of high definition images, we rely on people's head pose to recognize the VFOA. To the contrary of most previous works that assumed a fixed mapping between head pose directions and gaze target directions, we investigate novel gaze models documented in psychovision that produce a dynamic (temporal) mapping between them. This mapping accounts for tw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 9 publications
(19 citation statements)
references
References 11 publications
0
19
0
Order By: Relevance
“…Experimental results have revealed considerable differences between statistical methods and non-statistical measurements [23][24][25][26]. The former mainly focus on appearance-based measurements, whereas the latter usually consider geometric relationship cues, such as the deviation of the nose from the mid-line and the deviation between the new head pose and the original state.…”
Section: Non-statistical Approachesmentioning
confidence: 99%
“…Experimental results have revealed considerable differences between statistical methods and non-statistical measurements [23][24][25][26]. The former mainly focus on appearance-based measurements, whereas the latter usually consider geometric relationship cues, such as the deviation of the nose from the mid-line and the deviation between the new head pose and the original state.…”
Section: Non-statistical Approachesmentioning
confidence: 99%
“…Often, researchers assume a fixed setting, with people facing the camera at a given distance and set time-independent means manually or through learning [2,4]. In this work, we follow the approach of [9] that leverages on body-head-gaze behavioral studies and better accounts for natural gaze shifts. Accordingly, the means were set dynamically as a combination of the direction in which the person should gaze to look at a target, and of the body orientation Rt (estimated as a proxy through the average of the past head poses).…”
Section: Conversation Aware Vfoa Recognitionmentioning
confidence: 99%
“…Accordingly, the means were set dynamically as a combination of the direction in which the person should gaze to look at a target, and of the body orientation Rt (estimated as a proxy through the average of the past head poses). See [9] for more details. Contextual prior.…”
Section: Conversation Aware Vfoa Recognitionmentioning
confidence: 99%
See 2 more Smart Citations