2021
DOI: 10.1109/tpami.2019.2949414
|View full text |Cite
|
Sign up to set email alerts
|

Forecasting People Trajectories and Head Poses by Jointly Reasoning on Tracklets and Vislets

Abstract: In this work, we explore the correlation between people trajectories and their head orientations. We argue that people trajectory and head pose forecasting can be modelled as a joint problem. Recent approaches on trajectory forecasting leverage short-term trajectories (aka tracklets) of pedestrians to predict their future paths. In addition, sociological cues, such as expected destination or pedestrian interaction, are often combined with tracklets. In this paper, we propose MiXing-LSTM (MX-LSTM) to capture th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
25
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
2
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 43 publications
(25 citation statements)
references
References 87 publications
(174 reference statements)
0
25
0
Order By: Relevance
“…Our approach delivers stateof-the-art results as it won the first place at the Argoverse Motion Forecasting Challenge, as presented on the NeurIPS 2019 workshop on "Machine Learning for Autonomous Driving". In our future work, we want to explore multiple trajectory prediction for different domains such as human motion prediction, and extend existing approaches such as [34]. Also, we want to evaluate the potential of robust regression methods as proposed in [35] for the prediction.…”
Section: Discussionmentioning
confidence: 99%
“…Our approach delivers stateof-the-art results as it won the first place at the Argoverse Motion Forecasting Challenge, as presented on the NeurIPS 2019 workshop on "Machine Learning for Autonomous Driving". In our future work, we want to explore multiple trajectory prediction for different domains such as human motion prediction, and extend existing approaches such as [34]. Also, we want to evaluate the potential of robust regression methods as proposed in [35] for the prediction.…”
Section: Discussionmentioning
confidence: 99%
“…Bisagno et al [51] proposed to consider only pedestrians not belonging to the same group during social pooling. While modelling social interactions, Hasan et al [59], [60] based on domain knowledge, only consider the pedestrians in the visual frustum of attention [62]. Gupta et al [52] propose to encode neighbourhood information through the use of a permutation-invariant (symmetric) max-pooling function.…”
Section: Related Workmentioning
confidence: 99%
“…Neural Networks (NN) hold the promise to master the complexity of predicting the intention of humans in a relatively simple way. Similarly to out work, [13] modelled an area of visual attention and social interaction for the pedestrian, jointed to the head orientation, and used to strengthen the training of a Long Short Term Memory network (LSTM). A deep learning-based classifier is used in [14] to learn behaviour patterns from visual cues is mixed with a game theory model encoding the SFM to forecast the interaction between multiple pedestrians.…”
Section: A Related Workmentioning
confidence: 99%