2023
DOI: 10.1109/tpami.2020.3029700
|View full text |Cite
|
Sign up to set email alerts
|

SelfPose: 3D Egocentric Pose Estimation From a Headset Mounted Camera

Abstract: We present a new solution to egocentric 3D body pose estimation from monocular images captured from a downward looking fish-eye camera installed on the rim of a head mounted virtual reality device. This unusual viewpoint leads to images with unique visual appearance, characterized by severe self-occlusions and strong perspective distortions that result in a drastic difference in resolution between lower and upper body. We propose a new encoder-decoder architecture with a novel multi-branch decoder designed spe… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
51
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 58 publications
(52 citation statements)
references
References 74 publications
(79 reference statements)
1
51
0
Order By: Relevance
“…Previous researchers used multiple body-mounted cameras for whole-body pose estimation [9] and used RGB-D cameras for upper-body (i.e., hands, arms, torso) motion estimation [10]. In recent years, methods for whole-body 3D pose estimation with more practical settings have been proposed, such as using a wearable camera with a wide viewing angle to capture more body parts [1][2][3][4]11]. Moreover, several studies have achieved 3D pose estimation under the severe condition in which the human body is completely hidden from the camera's line of sight [5,6,12].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Previous researchers used multiple body-mounted cameras for whole-body pose estimation [9] and used RGB-D cameras for upper-body (i.e., hands, arms, torso) motion estimation [10]. In recent years, methods for whole-body 3D pose estimation with more practical settings have been proposed, such as using a wearable camera with a wide viewing angle to capture more body parts [1][2][3][4]11]. Moreover, several studies have achieved 3D pose estimation under the severe condition in which the human body is completely hidden from the camera's line of sight [5,6,12].…”
Section: Related Workmentioning
confidence: 99%
“…The majority of recent wearable camera-based 3D human pose estimation methods used fisheye cameras, expecting more body parts to be captured [1][2][3][4]. As the conventional datasets for 3D human pose estimation cannot be directly applied to these fisheye camera-based methods, these methods use synthetic datasets dedicated to each method to train their network.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Furthermore, there is a lack of real-world datasets with accurate ground truth data as the collection using a motion capture (MoCap) system is labor-intensive and limited to small laboratory settings. To partially address these issues, recent effort within the vision community has been directed to building public datasets using synthetic human models [13] [14]. However, training with synthetic datasets can affect the generalization capability of the model when subsequently applied to real-world scenarios.…”
Section: Introductionmentioning
confidence: 99%