2021
DOI: 10.48550/arxiv.2105.06599
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

TriPose: A Weakly-Supervised 3D Human Pose Estimation via Triangulation from Video

Abstract: Estimating 3D human poses from video is a challenging problem. The lack of 3D human pose annotations is a major obstacle for supervised training and for generalization to unseen datasets. In this work, we address this problem by proposing a weakly-supervised training scheme that does not require 3D annotations or calibrated cameras. The proposed method relies on temporal information and triangulation. Using 2D poses from multiple views as the input, we first estimate the relative camera orientations and then g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 36 publications
(118 reference statements)
0
4
0
Order By: Relevance
“…The accuracy of our single-frame model trained with 2D GT poses (Ours–MvP&P 🟉) matched that of the single-frame TriPose model. Like our framework, TriPose [ 38 ] is a monocular weakly-supervised training scheme that leverages multi-view 2D poses during training. Unlike our framework, TriPose estimates relative camera orientations, which are combined with input 2D poses from multiple views to triangulate a 3D pose.…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The accuracy of our single-frame model trained with 2D GT poses (Ours–MvP&P 🟉) matched that of the single-frame TriPose model. Like our framework, TriPose [ 38 ] is a monocular weakly-supervised training scheme that leverages multi-view 2D poses during training. Unlike our framework, TriPose estimates relative camera orientations, which are combined with input 2D poses from multiple views to triangulate a 3D pose.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…The following self-supervised works proposed different strategies for acquiring 3D pose annotations from multi-view 2D data. Gholami et al (TriPose) [ 38 ] triangulated a 3D pose given 2D poses from multiple views and estimated the relative orientation of poses. The triangulated 3D poses were then used as pseudo-annotations to train their 2D–3D pose lifting network.…”
Section: Related Workmentioning
confidence: 99%
“…Ref. [ 26 ] have developed a weakly supervised 3D pose estimation approach that combines temporal information and triangulation. They estimate the 3D pose by triangulating the location of body joints in each camera view.…”
Section: Related Workmentioning
confidence: 99%
“…Researchers have investigated multi-camera pose estimation techniques to overcome the limitations of single-camera and RGB-D camera pose estimation. These techniques, including triangulation and Kalman filtering, present potential solutions [ 24 , 25 , 26 , 27 , 28 ]. However, triangulation heavily relies on precise feature matching and assumes known camera parameters, making it vulnerable to occlusion and complex backgrounds.…”
Section: Introductionmentioning
confidence: 99%