2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.01216
|View full text |Cite
|
Sign up to set email alerts
|

EventHands: Real-Time Neural 3D Hand Pose Estimation from an Event Stream

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
54
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 38 publications
(56 citation statements)
references
References 61 publications
1
54
0
Order By: Relevance
“…Novel View Synthesis from Event Streams. Unique properties of event cameras (i.e., low latency, ultra-low power consumption, high dynamic range and no motion blur) motivated their usage in Simultaneous Localisation and Mapping (SLAM) [48,19,35] and 3D reconstruction of hands [38,30] and human bodies [49,58]. Several event-based SLAM methods [19,35] rely on explicit feature matching in the event space and reconstruct sparse 3D models of environments from a single event stream.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Novel View Synthesis from Event Streams. Unique properties of event cameras (i.e., low latency, ultra-low power consumption, high dynamic range and no motion blur) motivated their usage in Simultaneous Localisation and Mapping (SLAM) [48,19,35] and 3D reconstruction of hands [38,30] and human bodies [49,58]. Several event-based SLAM methods [19,35] rely on explicit feature matching in the event space and reconstruct sparse 3D models of environments from a single event stream.…”
Section: Related Workmentioning
confidence: 99%
“…The analysis-by-synthesis method of Nevhi et al [30] operates on events only and supports arbitrary non-rigid objects but still requires a 3D mesh of the target object. EventHands [38] regresses sparse 3D hand keypoints and, thus, cannot be used for novel view synthesis of dense hand surfaces. In stark contrast to these works, we learn-for the first time-an implicit 3D scene representation from event streams that enables photo-realistic novel view synthesis in the RGB space.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…EventCap of Xu et al [29] is a hybrid approach that relies on deblurred greyscale images recorded at usual frame rates and an event stream from DAVIS240C to track a rigged 3D model of an actor. The recent work of Rudnev et al [22] proposes EventHands, i.e., a neural method for 3D hand reconstruction from a single event stream. In contrast to EventHands, our method does not require large corpora of training data, and works for general non-rigid objects with hands only being one example object.…”
Section: Related Workmentioning
confidence: 99%
“…Neural Hand Pose and Shape Estimation. Most approaches for 3D hand reconstruction and tracking from monocular RGB, depth and event cameras estimate hand poses only, i.e., a sparse set of 3D hand joints [1], [2], [3], [21], [22], [23]. The approaches which estimate hand shape and pose simultaneously are still in the minority.…”
Section: Related Workmentioning
confidence: 99%