2019 International Conference on 3D Vision (3DV) 2019
DOI: 10.1109/3dv.2019.00052
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Person 3D Human Pose Estimation from Monocular Images

Abstract: Multi-person 3D human pose estimation from a single image is a challenging problem, especially for in-the-wild settings due to the lack of 3D annotated data. We propose HG-RCNN, a Mask-RCNN based network that also leverages the benefits of the Hourglass architecture for multiperson 3D Human Pose Estimation. A two-staged approach is presented that first estimates the 2D keypoints in every Region of Interest (RoI) and then lifts the estimated keypoints to 3D. Finally, the estimated 3D poses are placed in camera-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
37
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 56 publications
(38 citation statements)
references
References 41 publications
0
37
0
Order By: Relevance
“…To that point, 3D models may prove more feature computation more robust to the angle of recording, and so it is possible that 3D reconstruction would improve the performance of this system. Recent studies have shown promise in estimating full 3D pose reconstruction based on data recorded using a single 2D camera [ 63 , 64 , 65 , 66 ], although challenges remain [ 21 ].…”
Section: Discussionmentioning
confidence: 99%
“…To that point, 3D models may prove more feature computation more robust to the angle of recording, and so it is possible that 3D reconstruction would improve the performance of this system. Recent studies have shown promise in estimating full 3D pose reconstruction based on data recorded using a single 2D camera [ 63 , 64 , 65 , 66 ], although challenges remain [ 21 ].…”
Section: Discussionmentioning
confidence: 99%
“…Several further algorithms use different variants of anatomical constraints for the human body (e.g., body symmetry) and show improved results in weakly-supervised [Dabral et al 2018;Wandt and Rosenhahn 2019] or even unsupervised 3D human pose estimation [Kovalenko et al 2019]. [Hassan et al 2019;] use geometric vicinity and collision avoidance constraints for the reconstruction of human-object interactions, and [Dabral et al 2019;Fabbri et al 2020;Mehta et al 2020;Rogez et al 2019;Zanfir et al 2018] can generalise to multiple persons in the scene.…”
Section: Related Workmentioning
confidence: 99%
“…Some preliminary works [11], [12] have proposed methods to recover a complete 3D pose from RGB images with promising results. However, even if these methods predict a good estimation of the pose, they fail to recover the correct positioning in the camera space as well as the real scale of the body [12]. Thus, the errors will affect the computation of the corresponding measures of body parts and limbs (e.g.…”
Section: Introductionmentioning
confidence: 99%