2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00582
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Person Pose Estimation With Enhanced Channel-Wise and Spatial Information

Abstract: Multi-person pose estimation is an important but challenging problem in computer vision. Although current approaches have achieved significant progress by fusing the multi-scale feature maps, they pay little attention to enhancing the channel-wise and spatial information of the feature maps. In this paper, we propose two novel modules to perform the enhancement of the information for the multi-person pose estimation. First, a Channel Shuffle Module (CSM) is proposed to adopt the channel shuffle operation on th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
72
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 156 publications
(72 citation statements)
references
References 24 publications
0
72
0
Order By: Relevance
“…Nowadays, researchers have made painstaking efforts [17,29,38,40] to accelerate its progress. For examples, CASNet [38] improves the feature representation via adopting the spatial and channel-wise attention. HRNet [40] builds a new strong baseline via elaborated network design.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Nowadays, researchers have made painstaking efforts [17,29,38,40] to accelerate its progress. For examples, CASNet [38] improves the feature representation via adopting the spatial and channel-wise attention. HRNet [40] builds a new strong baseline via elaborated network design.…”
Section: Related Workmentioning
confidence: 99%
“…Multi-person pose estimation devotes to locate body parts for multiple persons in 2D image, such as keypoints on the arms, torsos, and the face [38]. It's fundamental to deal with other high-level tasks, such as human action recognition [6] and human-computer interaction [32].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Nevertheless, the process of feature extraction at all scales will generate redundant information. As far as for the attention mechanism, Su et al [6] proposed a spatial, channel-wise attention residual bottleneck (SCARB) aiming to boost the original residual unit with the attention mechanism. However, SCARB makes spatial attention and channel attention separate, causing the feature maps cannot be maintained and transmitted completely.…”
Section: Hsinchuan Lin Jiaying Zhu and Haifeng Hu ✉mentioning
confidence: 99%
“…They can be used for recognition of human action [1][2][3], tracking [4,5] and re-identification [6] of human in online surveillance and human-object interaction [7]. Most of the latest algorithms on pose estimation tend to embed a human detector at the beginning of its data processing unit, such as [9][10][11] which ranked top three in COCO key-point challenge 2019. Those human detection-based algorithms are called top-down methods.…”
Section: Introductionmentioning
confidence: 99%