2021
DOI: 10.48550/arxiv.2106.04419
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Asymmetrical Bi-RNN for pedestrian trajectory encoding

Raphaël Rozenberg,
Joseph Gesnouin,
Fabien Moutarde

Abstract: Pedestrian motion behavior involves a combination of individual goals and social interactions with other agents. In this article, we present an asymmetrical bidirectional recurrent neural network architecture called U-RNN to encode pedestrian trajectories and evaluate its relevance to replace LSTMs for various forecasting models. Experimental results on the Trajnet++ benchmark show that the U-LSTM variant yields better results regarding every available metrics (ADE, FDE, Collision rate) than common trajectory … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…Poses sequences are converted into 2D image-like spatio-temporal representations and selfspatio-temporal attention is applied via CNN-based models for multiple time resolutions. Each remaining feature is independently processed via either U-GRUs [45] or feed forward neural network and fused by either applying temporal and modality attention or sent to a fc layer to predict crossing behaviors.…”
Section: B Baselines and State-of-the-art Modelsmentioning
confidence: 99%
“…Poses sequences are converted into 2D image-like spatio-temporal representations and selfspatio-temporal attention is applied via CNN-based models for multiple time resolutions. Each remaining feature is independently processed via either U-GRUs [45] or feed forward neural network and fused by either applying temporal and modality attention or sent to a fc layer to predict crossing behaviors.…”
Section: B Baselines and State-of-the-art Modelsmentioning
confidence: 99%
“…• We secondly represent a skeleton sequence as its evolution of Euclidean pairwise distances of skeletal joints over time and encode them with U-GRUs [36]: a non-symmetrical bidirectional recurrent architecture designed to exploit the bidirectional temporal context and long-term temporal information for challenging skeletal dynamics having similar patterns but different outputs. This compensates for the inabilities of the first stream in learning temporal patterns invariant to locations and viewpoints.…”
Section: Introductionmentioning
confidence: 99%