2018
DOI: 10.1007/978-3-030-01219-9_12
|View full text |Cite
|
Sign up to set email alerts
|

Deeply Learned Compositional Models for Human Pose Estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

4
132
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 224 publications
(136 citation statements)
references
References 43 publications
4
132
0
Order By: Relevance
“…High-to-low and low-to-high. The high-to-low process aims to generate low-resolution and high-level representations, and the low-to-high process aims to produce highresolution representations [4,11,23,72,40,62]. Both the two processes are possibly repeated several times for boosting the performance [77,40,14].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…High-to-low and low-to-high. The high-to-low process aims to generate low-resolution and high-level representations, and the low-to-high process aims to produce highresolution representations [4,11,23,72,40,62]. Both the two processes are possibly repeated several times for boosting the performance [77,40,14].…”
Section: Related Workmentioning
confidence: 99%
“…The testing procedure is almost the same to that in COCO except that we adopt the standard testing strategy to use the provided person boxes instead of detected person boxes. Following [14,77,62], a six-scale pyramid testing procedure is performed. Evaluation metric.…”
Section: Mpii Human Pose Estimationmentioning
confidence: 99%
“…The spatial model for articulated pose is either based on tree-structured graphical models [7], [8], [9], [10], [11], [12], [13], which parametrically encode the spatial relationship between adjacent parts following a kinematic chain, or non-tree models [14], [15], [16], [17], [18] that augment the tree structure with additional edges to capture occlusion, symmetry, and longrange relationships. To obtain reliable local observations of body parts, Convolutional Neural Networks (CNNs) have been widely used, and have significantly boosted the accuracy on body pose estimation [19], [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], [32]. Tompson et al [23] used a deep architecture with a graphical model whose parameters are learned jointly with the network.…”
Section: Single Person Pose Estimationmentioning
confidence: 99%
“…An upsample process can be used to gradually recover the high-resolution representations from the low-resolution representations. The upsample subnetwork could be a symmetric version of the downsample process (e.g., VGGNet), with skipping connection over some mirrored layers to transform the pooling indices, e.g., SegNet [3] and DeconvNet [85], or copying the feature maps, e.g., U-Net [95] and Hourglass [6], [7], [21], [24], [51], [83], [109], [131], [132], encoder-decoder [90], and so on. An extension of U-Net, full-resolution residual network [92], introduces an extra full-resolution stream that carries information at the full image resolution, to replace the skip connections, and each unit in the downsample and upsample subnetworks receives information from and sends information to the full-resolution stream.…”
Section: Related Workmentioning
confidence: 99%