2018
DOI: 10.1007/978-3-030-01231-1_29
|View full text |Cite
|
Sign up to set email alerts
|

Simple Baselines for Human Pose Estimation and Tracking

Abstract: There has been significant progress on pose estimation and increasing interests on pose tracking in recent years. At the same time, the overall algorithm and system complexity increases as well, making the algorithm analysis and comparison more difficult. This work provides simple and effective baseline methods. They are helpful for inspiring and evaluating new ideas for the field. State-of-the-art results are achieved on challenging benchmarks. The code will be available at https://github. com/leoxiaobin/pose… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

8
1,715
1
7

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 1,635 publications
(1,731 citation statements)
references
References 25 publications
8
1,715
1
7
Order By: Relevance
“…We adopt the network proposed in [31] as our base network and use ResNet-152 as its backbone, which was pretrained on the ImageNet classification dataset. The input Table 1.…”
Section: Implementation Detailsmentioning
confidence: 99%
See 2 more Smart Citations
“…We adopt the network proposed in [31] as our base network and use ResNet-152 as its backbone, which was pretrained on the ImageNet classification dataset. The input Table 1.…”
Section: Implementation Detailsmentioning
confidence: 99%
“…Table 1 shows the results on the most important joints when we train, either only on the H36M dataset, or on a combination of the H36M and MPII datasets. It compares our approach with the baseline method [31], termed Single, which does not perform cross view feature fusion. We also compare with two baselines which compute sum or max values over the epipolar line using the camera parameters.…”
Section: Implementation Detailsmentioning
confidence: 99%
See 1 more Smart Citation
“…[22] utilises a Symmetric Spatial Transformer Network to handle inaccurate bounding boxes. [24] uses simple deconvolution layers to obtain high-resolution heatmaps for human pose estimation. On the side of bottom-up methods, [9] proposes a limb descriptor and an efficient bottom-up grouping approach to associate neighbouring joints.…”
Section: Data Annotationmentioning
confidence: 99%
“…A lot of practical problems, such as smart environments, human-computer interaction [1], augmented reality [2], virtual reality, human behaviour analysis and recognition [3], require the information of human body keypoints. Similar to many vision problems, deep learning based methods in the problem of human pose estimation had significant progress in the past few years [4]. While the problem of human pose estimation is in quick developing, we also notice that the network architecture has a tendency of becoming more deep and complex.…”
Section: Introductionmentioning
confidence: 99%