2019
DOI: 10.48550/arxiv.1901.00148
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Rethinking on Multi-Stage Networks for Human Pose Estimation

Abstract: Existing pose estimation approaches fall into two categories: single-stage and multi-stage methods. While multistage methods are seemingly more suited for the task, their performance in current practice is not as good as singlestage methods.This work studies this issue. We argue that the current multi-stage methods' unsatisfactory performance comes from the insufficiency in various design choices. We propose several improvements, including the single-stage module design, cross stage feature aggregation, and co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
97
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 73 publications
(108 citation statements)
references
References 41 publications
0
97
0
Order By: Relevance
“…Heatmap-based pose estimation. Heatmap-based 2D pose estimation methods [2,3,6,7,14,21,25,27,36] estimate perpixel likelihoods for each keypoint location, and currently dominate in the field of 2D human pose estimation. A few works [2,25,27] attempt to design powerful backbone networks which can maintain high-resolution feature maps for heatmap supervision.…”
Section: Related Workmentioning
confidence: 99%
“…Heatmap-based pose estimation. Heatmap-based 2D pose estimation methods [2,3,6,7,14,21,25,27,36] estimate perpixel likelihoods for each keypoint location, and currently dominate in the field of 2D human pose estimation. A few works [2,25,27] attempt to design powerful backbone networks which can maintain high-resolution feature maps for heatmap supervision.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, multi-stage networks [15,33,108,110] have attained promising results relative to the aforementioned single-stage models on the challenging deblurring and deraining tasks [22,33,108]. These multi-stage frameworks are generally inspired by their success on higher-level problems such as pose estimation [17,47], action segmentation [24,46], and image generation [113,114].…”
Section: Related Workmentioning
confidence: 99%
“…We deem full resolution processing [15,67,75] a better approach than a multi-patch hierarchy [81,108,110], since the latter would potentially induce boundary effects across patches. To impose stronger supervision, we apply a multi-scale approach [17,19,47] at each stage to help the network learn. We leverage the supervised attention module [108] to propagate attentive feature maps progressively along the stages.…”
Section: Multi-stage Multi-scale Frameworkmentioning
confidence: 99%
“…Most methods adopt deep convolutional neural network (CNN) as feature encoder owing to its great performance. In terms of the decoder part, existing approaches fall into two broad categories: heatmap-based [2,3,5,6,40,16,17,19,22,29,37,38] and regression-based [34,15,31,20,32,15] methods. The former is adopted in most cases.…”
Section: Introductionmentioning
confidence: 99%