2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) 2021
DOI: 10.1109/iccvw54120.2021.00259
|View full text |Cite
|
Sign up to set email alerts
|

Learning Decoupled Representations for Human Pose Forecasting

Abstract: Human pose forecasting involves complex spatiotemporal interactions between body parts (e.g., arms, legs, spine). State-of-the-art approaches use Long Short-Term Memories (LSTMs) or Variational AutoEncoders (VAEs) to solve the problem. Yet, they do not effectively predict human motions when both global trajectory and local pose movements exist. We propose to learn decoupled representations for the global and local pose forecasting tasks. We also show that it is better to stop the prediction when the uncertaint… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 16 publications
(12 citation statements)
references
References 43 publications
0
12
0
Order By: Relevance
“…Results of SPADE baseline [16] are obtained by re-training their model with their hyper-parameters publicly available. 2 c) Qualitative results: Qualitative results are shown in Figure 3 for Cistyscapes and in Figure 4 for CMP Facades. Having the semantic matching head and different feature maps, each focusing on a specific object, could generate more semantically consistent details, e.g., the windows and balconies are less blurry and with more details for the facades.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Results of SPADE baseline [16] are obtained by re-training their model with their hyper-parameters publicly available. 2 c) Qualitative results: Qualitative results are shown in Figure 3 for Cistyscapes and in Figure 4 for CMP Facades. Having the semantic matching head and different feature maps, each focusing on a specific object, could generate more semantically consistent details, e.g., the windows and balconies are less blurry and with more details for the facades.…”
Section: Methodsmentioning
confidence: 99%
“…Take the joint distribution of training data as p * (x, s), the goal is to find an approximate joint distribution p θ (x, s). The full objective function was defined in Equation (1) and Equation (2). For simplicity, we ignore the reconstruction loss here and therefore the objective function is as follows:…”
Section: E Stabilizing the Trainingmentioning
confidence: 99%
See 1 more Smart Citation
“…In these models, each joint is represented as a node and each relation between joints as an edge. In the literature, this problem is still handled in single-modal setting, despite a recent attempt to better consider randomness [49].…”
Section: Multiple Trajectory Prediction In Roboticsmentioning
confidence: 99%
“…The scene plays an important role in vehicle trajectory prediction as it constrains the future positions of the agents. Therefore, modeling the scene is common in spite of some human trajectory prediction models [13,39]. In order to reason over the scene in the predictions, some suggested using a semantic segmented map to build circular distributions and outputting the most probable regions [21].…”
Section: Related Workmentioning
confidence: 99%