Identification of the Driver's Interest Point using a Head Pose Trajectory for Situated Dialog Systems

Kim, Young‐Ho; Misu, Teruhisa

doi:10.1145/2663204.2663230

Cited by 10 publications

(5 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, referencing objects outside the vehicle has been investigated using different approaches and modalities. Rümelin et al [30] used free-hand pointing gestures, Fujimura et al [6] used hand-constrained pointing gestures, Kang et al [11] used eye gaze gestures, while Kim et al [13] and Misu et al [17] used speech-triggered head pose trajectories. However, these studies focused on single-modality approaches that were lacking in performance.…”

Section: Related Workmentioning

confidence: 99%

Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle

Gomaa

Reyes

Alles

et al. 2020

Proceedings of the 2020 International Conference on Multimodal Interaction

View full text Add to dashboard Cite

Hand pointing and eye gaze have been extensively investigated in automotive applications for object selection and referencing. Despite significant advances, existing outside-the-vehicle referencing methods consider these modalities separately. Moreover, existing multimodal referencing methods focus on a static situation, whereas the situation in a moving vehicle is highly dynamic and subject to safety-critical constraints. In this paper, we investigate the specific characteristics of each modality and the interaction between them when used in the task of referencing outside objects (e.g. buildings) from the vehicle. We furthermore explore person-specific differences in this interaction by analyzing individuals' performance for pointing and gaze patterns, along with their effect on the driving task. Our statistical analysis shows significant differences in individual behaviour based on object's location (i.e. driver's right side vs. left side), object's surroundings, driving mode (i.e. autonomous vs. normal driving) as well as pointing and gaze duration, laying the foundation for a user-adaptive approach. CCS CONCEPTS • Human-centered computing → User studies; Pointing; Gestural input; HCI theory, concepts and models.

show abstract

Section: Related Workmentioning

confidence: 99%

Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle

Gomaa

Reyes

Alles

et al. 2020

Proceedings of the 2020 International Conference on Multimodal Interaction

View full text Add to dashboard Cite

show abstract

“…EyePointing [35] makes use of finger pointing to trigger the selection of objects on a screen using gaze direction. Misu et al [22] and Kim et al [17] make use of head pose using speech as a trigger for driver queries of outside-vehicle objects.…”

Section: Related Workmentioning

confidence: 99%

Multimodal Driver Referencing: A Comparison of Pointing to Objects Inside and Outside the Vehicle

Aftab

Beeck

2022

27th International Conference on Intelligent User Interfaces

View full text Add to dashboard Cite

show abstract

“…Therefore, researchers have attempted various approaches for controlling objects using symbolic hand gestures [22,38,45,52,54], deictic (pointing) hand gestures [23,25,34], eye gaze [13,24,27,31,36,37,50,51,53], and facial expressions [3,18,46]. Specifically for the automotive domain, in-vehicle interaction has been attempted using hand gestures [5,16,33,40], eye gaze [36], and facial expressions [44], while outside-the-vehicle interaction has been attempted using pointing gestures [17,41], eye gaze [24], and head pose [26,30]. Although most of the previous methods focus on single-modality approaches while using a button or voice commands as event triggers, more recent work focused on multimodal fusion approaches to enhance performance.…”

Section: Related Workmentioning

confidence: 99%

ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle

Gomaa,

Reyes,

Feld

2021

Preprint

View full text Add to dashboard Cite

Over the past decades, the addition of hundreds of sensors to modern vehicles has led to an exponential increase in their capabilities. This allows for novel approaches to interaction with the vehicle that go beyond traditional touch-based and voice command approaches, such as emotion recognition, head rotation, eye gaze, and pointing gestures. Although gaze and pointing gestures have been used before for referencing objects inside and outside vehicles, the multimodal interaction and fusion of these gestures have so far not been extensively studied. We propose a novel learning-based multimodal fusion approach for referencing outside-the-vehicle objects while maintaining a long driving route in a simulated environment. The proposed multimodal approaches outperform single-modality approaches in multiple aspects and conditions. Moreover, we also demonstrate possible ways to exploit behavioral differences between users when completing the referencing task to realize an adaptable personalized system for each driver. We propose a personalization technique based on the transfer-of-learning concept for exceedingly small data sizes to enhance prediction and adapt to individualistic referencing behavior. Our code is publicly available at https://github.com/amr-gomaa/ML-PersRef. CCS CONCEPTS• Human-centered computing → HCI design and evaluation methods; Pointing; Gestural input; • Computing methodologies → Support vector machines; Neural networks.

show abstract

Identification of the Driver's Interest Point using a Head Pose Trajectory for Situated Dialog Systems

Cited by 10 publications

References 10 publications

Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle

Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle

Multimodal Driver Referencing: A Comparison of Pointing to Objects Inside and Outside the Vehicle

ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle

Contact Info

Product

Resources

About