2020
DOI: 10.1007/978-3-030-58574-7_23
|View full text |Cite
|
Sign up to set email alerts
|

DOPE: Distillation of Part Experts for Whole-Body 3D Pose Estimation in the Wild

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 43 publications
(21 citation statements)
references
References 59 publications
0
21
0
Order By: Relevance
“…Most of these works [39], [47]- [52] focus on how to design the pose estimation network to improve the model accuracy, while ignoring the overall process in the practice of multi-person pose estimation task. The generation stage of human candidate frame has not been widely concerned.…”
Section: Deep Learning Modelmentioning
confidence: 99%
“…Most of these works [39], [47]- [52] focus on how to design the pose estimation network to improve the model accuracy, while ignoring the overall process in the practice of multi-person pose estimation task. The generation stage of human candidate frame has not been widely concerned.…”
Section: Deep Learning Modelmentioning
confidence: 99%
“…Since the DJSLC dataset [5] used by [12] is not publicly available to facilitate a comparison, we turn to the publicly available BSLCORPUS with boundary annotations and compare to our re-implementation of their approach. In contrast to [12], which assumes 3D skeletal information given by motion capture, we estimate the 3D pose coordinates with the recently proposed monocular DOPE model [35]. While the performance is bounded by the quality of the pose estimation, this ensures that the method is applicable to unconstrained sign language videos.…”
Section: Comparison To Prior Workmentioning
confidence: 99%
“…Knowledge distillation methods have been widely used in many vision tasks, including object detection [30,6,13], line detection [20], semantic segmentation [62,18,34] and human pose estimation [66,40,56,58]. DOPE [58] proposes to distill the 2D and 3D poses from three independent body part expert models to the single whole-body pose detection model. Nie et al [40] distill the pose kernels via leveraging temporal cues from the previous frame in a one-shot feed-forward manner.…”
Section: Related Workmentioning
confidence: 99%