2020
DOI: 10.1007/978-3-030-58529-7_36
|View full text |Cite
|
Sign up to set email alerts
|

Adversarial Semantic Data Augmentation for Human Pose Estimation

Abstract: Human pose estimation is the task of localizing body keypoints from still images. The state-of-the-art methods suffer from insufficient examples of challenging cases such as symmetric appearance, heavy occlusion and nearby person. To enlarge the amounts of challenging cases, previous methods augmented images by cropping and pasting image patches with weak semantics, which leads to unrealistic appearance and limited diversity. We instead propose Semantic Data Augmentation (SDA), a method that augments images by… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
27
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 52 publications
(27 citation statements)
references
References 31 publications
0
27
0
Order By: Relevance
“…An issue surrounding the 2D human pose estimation literature is that it is often difficult to make fair comparisons of model performance due to the heavy use of model-agnostic improvements. Examples include the use of different learning rate schedules [24], [49], more data augmentation [49], [50], loss functions that target more challenging keypoints [23], specialized post-processing steps [51], [52], or more accurate person detectors [49], [52]. These discrepancies in training algorithms can potentially account for the reported differences in accuracy.…”
Section: B 2d Human Pose Estimation Using Deep Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…An issue surrounding the 2D human pose estimation literature is that it is often difficult to make fair comparisons of model performance due to the heavy use of model-agnostic improvements. Examples include the use of different learning rate schedules [24], [49], more data augmentation [49], [50], loss functions that target more challenging keypoints [23], specialized post-processing steps [51], [52], or more accurate person detectors [49], [52]. These discrepancies in training algorithms can potentially account for the reported differences in accuracy.…”
Section: B 2d Human Pose Estimation Using Deep Learningmentioning
confidence: 99%
“…. , 0.90, 0.95), AP 50 (AP at OKS = 0.50), AP 75 , AP M (medium objects), AP L (large objects), and AR (mean AR at OKS = 0.50, 0.55, . .…”
Section: ) Microsoft Cocomentioning
confidence: 99%
“…Human pose estimation is a field of active interest in computer vision. The currently best scoring approaches on common benchmarks like COCO [4] or MPII Human Pose [5] are based on convolutional neural networks [6,7]. In contrast to multi-stage approaches like Mask R-CNN [8] which find person bounding boxes at first and detect one keypoint of each type in the second step, these recent approaches find all keypoints in a single step.…”
Section: Related Workmentioning
confidence: 99%
“…In contrast to multi-stage approaches like Mask R-CNN [8] which find person bounding boxes at first and detect one keypoint of each type in the second step, these recent approaches find all keypoints in a single step. The underlying backbone used in many recent models like [6,7] is the High Resolution Net (HRNet) [9] and its variants for human pose estimation, e.g. [10].…”
Section: Related Workmentioning
confidence: 99%
“…In addition to improving the resolution of the hot spot map in the classification sub-network in different ways, Yanrui Bin et al [58] proposed Semantic Data Augmentation (SDA). By pasting segmented body parts with various semantic granularity, it can enhance the effect of human body detection in the image when the target in the detected image is seriously occluded.…”
Section: Related Workmentioning
confidence: 99%