2009
DOI: 10.1007/978-3-642-00525-1_1
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Human Machine Interactions in Virtual and Augmented Reality

Abstract: Abstract. Virtual worlds are developing rapidly over the Internet. They are visited by avatars and staffed with Embodied Conversational Agents (ECAs). An avatar is a representation of a physical person. Each person controls one or several avatars and usually receives feedback from the virtual world on an audio-visual display. Ideally, all senses should be used to feel fully embedded in a virtual world. Sound, vision and sometimes touch are the available modalities. This paper reviews the technological developm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 66 publications
(75 reference statements)
0
2
0
Order By: Relevance
“…They further classified this method in identification, verification, detection, segmentation, clustering, and diarization and proposed issues regarding variability, insufficient data, background noise, and adversarial attacks. Chollet et al [129] reviewed technological developments in augmented reality worlds, emphasizing speech and gesture interfaces. They stressed that speaker verification could be used for authentication purposes before starting a dialogue, for example, regarding a bank transfer in a virtual world.…”
Section: Speech-basedmentioning
confidence: 99%
“…They further classified this method in identification, verification, detection, segmentation, clustering, and diarization and proposed issues regarding variability, insufficient data, background noise, and adversarial attacks. Chollet et al [129] reviewed technological developments in augmented reality worlds, emphasizing speech and gesture interfaces. They stressed that speaker verification could be used for authentication purposes before starting a dialogue, for example, regarding a bank transfer in a virtual world.…”
Section: Speech-basedmentioning
confidence: 99%
“…Given the complexity and the multimodal nature of the phenomenon, there has been a branching of engineering approaches toward the improvement and development of automatic video-audio processing, detection and synthesis techniques [23,61] with the goal of developing advanced mathematical models and algorithms for encoding/decoding emotional states from faces [27,65,102,130], speech [2,5,6,11,131] and/or body movements [16,24,77,79]. In order to succeed, the above research lines are seeking for deeper investigations and analyses of human interactional behaviors.…”
Section: Introductionmentioning
confidence: 99%