Asymmetrical Bi-RNN for pedestrian trajectory encoding

Rozenberg, Raphaël; Gesnouin, Joseph; Moutarde, Fabien

doi:10.48550/arxiv.2106.04419

Cited by 2 publications

(2 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Poses sequences are converted into 2D image-like spatio-temporal representations and selfspatio-temporal attention is applied via CNN-based models for multiple time resolutions. Each remaining feature is independently processed via either U-GRUs [45] or feed forward neural network and fused by either applying temporal and modality attention or sent to a fc layer to predict crossing behaviors.…”

Section: B Baselines and State-of-the-art Modelsmentioning

confidence: 99%

Assessing Cross-dataset Generalization of Pedestrian Crossing Predictors

Gesnouin¹,

Pechberti²,

Stanciulescu³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Pedestrian crossing prediction has been a topic of active research, resulting in many new algorithmic solutions. While measuring the overall progress of those solutions over time tends to be more and more established due to the new publicly available benchmark and standardized evaluation procedures, knowing how well existing predictors react to unseen data remains an unanswered question. This evaluation is imperative as serviceable crossing behavior predictors should be set to work in various scenarii without compromising pedestrian safety due to misprediction. To this end, we conduct a study based on direct cross-dataset evaluation. Our experiments show that current state-of-the-art pedestrian behavior predictors generalize poorly in cross-dataset evaluation scenarii, regardless of their robustness during a direct training-test set evaluation setting. In the light of what we observe, we argue that the future of pedestrian crossing prediction, e.g. reliable and generalizable implementations, should not be about tailoring models, trained with very little available data, and tested in a classical train-test scenario with the will to infer anything about their behavior in real life. It should be about evaluating models in a crossdataset setting while considering their uncertainty estimates under domain shift.

show abstract

Section: B Baselines and State-of-the-art Modelsmentioning

confidence: 99%

Assessing Cross-dataset Generalization of Pedestrian Crossing Predictors

Gesnouin¹,

Pechberti²,

Stanciulescu³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…• We secondly represent a skeleton sequence as its evolution of Euclidean pairwise distances of skeletal joints over time and encode them with U-GRUs [36]: a non-symmetrical bidirectional recurrent architecture designed to exploit the bidirectional temporal context and long-term temporal information for challenging skeletal dynamics having similar patterns but different outputs. This compensates for the inabilities of the first stream in learning temporal patterns invariant to locations and viewpoints.…”

Section: Introductionmentioning

confidence: 99%

TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction

Gesnouin¹,

Pechberti²,

Stanciulescu³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Understanding the behaviors and intentions of pedestrians is still one of the main challenges for vehicle autonomy, as accurate predictions of their intentions can guarantee their safety and driving comfort of vehicles. In this paper, we address pedestrian crossing prediction in urban traffic environments by linking the dynamics of a pedestrian's skeleton to a binary crossing intention. We introduce TrouSPI-Net: a context-free, lightweight, multi-branch predictor. TrouSPI-Net extracts spatio-temporal features for different time resolutions by encoding pseudo-images sequences of skeletal joints' positions and processes them with parallel attention modules and atrous convolutions. The proposed approach is then enhanced by processing features such as relative distances of skeletal joints, bounding box positions, or ego-vehicle speed with U-GRUs. Using the newly proposed evaluation procedures for two large public naturalistic data sets for studying pedestrian behavior in traffic: JAAD and PIE, we evaluate TrouSPI-Net and analyze its performance. Experimental results show that TrouSPI-Net achieved 76% F1 score on JAAD and 80% F1 score on PIE, therefore outperforming current state-of-the-art while being lightweight and context-free.

show abstract

Asymmetrical Bi-RNN for pedestrian trajectory encoding

Cited by 2 publications

References 24 publications

Assessing Cross-dataset Generalization of Pedestrian Crossing Predictors

Assessing Cross-dataset Generalization of Pedestrian Crossing Predictors

TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction

Contact Info

Product

Resources

About