2021 IEEE Winter Conference on Applications of Computer Vision (WACV) 2021
DOI: 10.1109/wacv48630.2021.00130
|View full text |Cite
|
Sign up to set email alerts
|

Benchmark for Evaluating Pedestrian Action Prediction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
94
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 76 publications
(96 citation statements)
references
References 44 publications
1
94
1
Order By: Relevance
“…It integrates four feature sources: semantic map, pedestrian trajectory, grid position and vehicle speed. Kotseruba [13] considered four feature sources: local environment, pedestrian posture, pedestrian bounding box and vehicle speed. A threedimensional volume integral branch is used to encode visual information and a single RNN branch is used to process other information in parallel.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…It integrates four feature sources: semantic map, pedestrian trajectory, grid position and vehicle speed. Kotseruba [13] considered four feature sources: local environment, pedestrian posture, pedestrian bounding box and vehicle speed. A threedimensional volume integral branch is used to encode visual information and a single RNN branch is used to process other information in parallel.…”
Section: Related Workmentioning
confidence: 99%
“…For j = 0 (the bottom level of the stack), x t 0 = c t p and for j > 0, x t j = h t−1 j + c t p . Meanwhile, inspired by [3,13], we introduced the attention mechanism [18] into GRU to form At-GRU (attention-GRU). The attention module can selectively focus on some features, so as to better deal with key objects.…”
Section: Model Constructionmentioning
confidence: 99%
See 2 more Smart Citations
“…Also, we can formulate it as a higher-level semantic prediction such as the early anticipation of the future action of the pedestrian, for example, walking, running, performing hand gestures, or most importantly crossing or not crossing the street in front of the AV. Recently, trajectory and action prediction solutions have been proposed based on sequential reasoning that mainly use algorithms built on recurrent neural networks (i.e., RNN, LSTM) [4], [5], [6], [7]. However, it has recently become clear that LSTM lacks many capabilities to model sequential data.…”
Section: Introductionmentioning
confidence: 99%