ConeSpeech: Exploring Directional Speech Interaction for Multi-Person Remote Communication in Virtual Reality

Yan, Yukang; Liu, Haohua; Shi, Yingtian; Guo, Ruici; Li, Zisu; Xu, Xuhai; Yu, Chun; Wang, Yuntao; Shi, Yuanchun

doi:10.1109/tvcg.2023.3247085

Cited by 5 publications

(2 citation statements)

References 46 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, largegroup collaboration with many-to-many participation and various roles may cater for multi-modal ways of joining and might create larger physical and/or digital spaces to house participants. While there are studies that examine many-to-many interactions in MR [33,59], multi-modal and mixed presence collaboration [28,52] or scaled spatial architecture [54], these evaluative studies either manage complexity by making participants co-located [33,54] or isolate a particular scale problem to evaluate [28,52].…”

Section: Large-scale Distributed Collaboration In Mixed Reality (Mr)mentioning

confidence: 99%

Practice-informed Patterns for Organising Large Groups in Distributed Mixed Reality Collaboration

Wong,

Sánchez Esquivel,

Leiva

et al. 2024

Proceedings of the CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

show abstract

Section: Large-scale Distributed Collaboration In Mixed Reality (Mr)mentioning

confidence: 99%

Practice-informed Patterns for Organising Large Groups in Distributed Mixed Reality Collaboration

Wong,

Sánchez Esquivel,

Leiva

et al. 2024

Proceedings of the CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

show abstract

“…As the cyber and physical spaces quickly merge, people exhibit a significant demand for information retrieval (IR) anywhere and anytime in their daily lives [10,14,15,22,56], no longer confined to a specific device or location. With advancements in the computational capabilities of wearable devices, the incorporation of a virtual assistant that can provide on-demand, in-situ answers to users' inquiries has the potential to greatly facilitate the interaction with surrounding targets [68,75] and enhance the naturalness of the user's information retrieval experience [3]. Specifically, smart glasses with gaze tracking open new possibilities for natural information retrieval techniques in daily scenarios by combining the voice and gaze modalities [35,62].…”

Section: Introductionmentioning

confidence: 99%

G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

Wang,

Shi,

Wang

et al. 2024

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.

Self Cite

View full text Add to dashboard Cite

Modern information querying systems are progressively incorporating multimodal inputs like vision and audio. However, the integration of gaze --- a modality deeply linked to user intent and increasingly accessible via gaze-tracking wearables --- remains underexplored. This paper introduces a novel gaze-facilitated information querying paradigm, named G-VOILA, which synergizes users' gaze, visual field, and voice-based natural language queries to facilitate a more intuitive querying process. In a user-enactment study involving 21 participants in 3 daily scenarios (p = 21, scene = 3), we revealed the ambiguity in users' query language and a gaze-voice coordination pattern in users' natural query behaviors with G-VOILA. Based on the quantitative and qualitative findings, we developed a design framework for the G-VOILA paradigm, which effectively integrates the gaze data with the in-situ querying context. Then we implemented a G-VOILA proof-of-concept using cutting-edge deep learning techniques. A follow-up user study (p = 16, scene = 2) demonstrates its effectiveness by achieving both higher objective score and subjective score, compared to a baseline without gaze data. We further conducted interviews and provided insights for future gaze-facilitated information querying systems.

show abstract

Natural Language Processing for a Personalised Educational Experience in Virtual Reality

Alghamdi,

Cristea

2024

Communications in Computer and Information Science

View full text Add to dashboard Cite

ConeSpeech: Exploring Directional Speech Interaction for Multi-Person Remote Communication in Virtual Reality

Cited by 5 publications

References 46 publications

Practice-informed Patterns for Organising Large Groups in Distributed Mixed Reality Collaboration

Practice-informed Patterns for Organising Large Groups in Distributed Mixed Reality Collaboration

G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

Natural Language Processing for a Personalised Educational Experience in Virtual Reality

Contact Info

Product

Resources

About