2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015
DOI: 10.1109/cvpr.2015.7299061
|View full text |Cite
|
Sign up to set email alerts
|

First-person pose recognition using egocentric workspaces

Abstract: We tackle the problem of estimating the 3D pose of an individual's upper limbs (arms+hands) from a chest mounted depth-camera. Importantly, we consider pose estimation during everyday interactions with objects. Past work shows that strong pose+viewpoint priors and depth-based features are crucial for robust performance. In egocentric views, hands and arms are observable within a well defined volume in front of the camera. We call this volume an egocentric workspace. A notable property is that hand appearance c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
80
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 93 publications
(80 citation statements)
references
References 34 publications
0
80
0
Order By: Relevance
“…Hand-object reconstruction. Joint reconstruction of hands and objects has been studied with multi-view RGB [2,42,74] and RGB-D input with either optimization [16,17,43,47,62,[69][70][71] or classification [51][52][53][54] approaches. These works use rigid objects, except for a few that use articulated [70] or deformable objects [69].…”
Section: Related Workmentioning
confidence: 99%
“…Hand-object reconstruction. Joint reconstruction of hands and objects has been studied with multi-view RGB [2,42,74] and RGB-D input with either optimization [16,17,43,47,62,[69][70][71] or classification [51][52][53][54] approaches. These works use rigid objects, except for a few that use articulated [70] or deformable objects [69].…”
Section: Related Workmentioning
confidence: 99%
“…Capturing full 3D body motion from head-mounted cameras is considerably more challenging. Some head-mounted capture systems are based on RGB-D input and reconstruct mostly hand, arm and torso motions [40,57]. Jiang and Grauman [20] reconstruct full body pose from footage taken from a camera worn on the chest by estimating egomotion from the observed scene, but their estimates lack accuracy and have high uncertainty.…”
Section: Related Workmentioning
confidence: 99%
“…An important difficulty in hand pose estimation lies in object occlusions and self-occlusions that make it hard to localize hidden joints/ parts of the hand. Some authors proposed the use of 3D cameras or depth sensors in conjunction with sensorbased techniques to train hand pose estimators more robust to self-occlusions [17], [97], [79], [69], [98]. However, as discussed above, the use of 3D imaging techniques might not be easily translated to FPV.…”
Section: Hand Pose Estimation and Fingertip Detectionmentioning
confidence: 99%
“…One of the advantages of using depth images for extracting the hand pose is the possibility to synthesize large training sets of realistic depth maps by using computer graphics [17], [97]. In [17], the authors tackled hand pose estimation as a multiclass classification problem by using a hierarchical cascade architecture.…”
Section: Hand Pose Estimation Using 3d/depth Sensorsmentioning
confidence: 99%
See 1 more Smart Citation