2005
DOI: 10.1007/s10055-005-0155-3
|View full text |Cite
|
Sign up to set email alerts
|

Untethered gesture acquisition and recognition for virtual world manipulation

Abstract: Humans use a combination of gesture and speech to interact with objects and usually do so more naturally without holding a device or pointer. We present a system that incorporates user body-pose estimation, gesture recognition and speech recognition for interaction in virtual reality environments. We describe a vision-based method for tracking the pose of a user in real time and introduce a technique that provides parameterized gesture recognition. More precisely, we train a support vector classifier to model … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2008
2008
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(13 citation statements)
references
References 35 publications
(42 reference statements)
0
13
0
Order By: Relevance
“…Table 1.1 illustrates how using multiple modalities, dialog management, context, and semantics improves recognition. Except for the research of Demirdjian et al [25], those using multiple HCI techniques have lower error rates than those using only a single HCI technique. Even the research of Demirdjian et al produced lower error rates when multiple techniques were used.…”
Section: Problem Statementmentioning
confidence: 88%
See 2 more Smart Citations
“…Table 1.1 illustrates how using multiple modalities, dialog management, context, and semantics improves recognition. Except for the research of Demirdjian et al [25], those using multiple HCI techniques have lower error rates than those using only a single HCI technique. Even the research of Demirdjian et al produced lower error rates when multiple techniques were used.…”
Section: Problem Statementmentioning
confidence: 88%
“…Error Rate Demirdjian et al [25] Vision & speech 0% Demirdjian et al [25] Speech 5% Demirdjian et al [25] Vision 8% Morency et al [61] Gesture & dialog context 8% Morency and Darrell [60] Gestures & dialog state 9% Quattoni et al [67] Vision & semantics 9% Wang and Demirdjian [86] Speech & gestures 12% Webb et al [87] Speech & dialog state 17% Metze et al [59] Speech, context, & gesture 17% Morency et al [61] Gesture 22% Saenko et al [72] Vision 34% Eisenstein and Davis [31] Linguistic context 34% Bugmann [13] Speech 40% However, these techniques have not been applied to the same extent in Human…”
Section: Hci Techniquesmentioning
confidence: 99%
See 1 more Smart Citation
“…From these works, it has been identified the following specified shortcomings [3,4,5,6,7,8,9,10,11,12,13,14,15]:…”
Section: Related Workmentioning
confidence: 99%
“…In [17], control theory is used to maintain a correspondence between model based feature points and depth points. In [18], Iterative Closest Point (ICP) is used to track a pose initialized using a hashing method. In [19], information in depth images and silhouettes is used in a learning framework to infer poses from a database.…”
Section: Related Workmentioning
confidence: 99%