2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00636
|View full text |Cite
|
Sign up to set email alerts
|

PIE: A Large-Scale Dataset and Models for Pedestrian Intention Estimation and Trajectory Prediction

Abstract: Predicting pedestrian behavior is one of the main challenges for intelligent driving systems. In this paper, we present a new paradigm for evaluating egocentric pedestrian trajectory prediction algorithms. Based on various contextual information, we extract driving scenarios for a meaningful and systematic approach to identifying challenges for prediction models. In this regard, we also propose a new metric for more effective ranking within the scenario-based evaluation. We conduct extensive empirical studies … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
220
1
8

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 269 publications
(230 citation statements)
references
References 72 publications
1
220
1
8
Order By: Relevance
“…In [ 26 ], the creators of JAAD released a new dataset, called PIE (details in Table 1 ), recorded continuously in a single day in Toronto, with clear weather. In addition to label more relevant objects than in JAAD, it includes ego-motion information of the car.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In [ 26 ], the creators of JAAD released a new dataset, called PIE (details in Table 1 ), recorded continuously in a single day in Toronto, with clear weather. In addition to label more relevant objects than in JAAD, it includes ego-motion information of the car.…”
Section: Related Workmentioning
confidence: 99%
“…This new information is used in combination with estimated pose keypoints, bounding box coordinates, and bounding boxes image crops to train a stacked GRU model in [ 5 ]. In the same release article of PIE [ 26 ], a multi-task and multi-branch model is proposed. It combines the use of LSTM layers for non-image information (bounding box coordinates and ego-motion information through ego-vehicle OBD measures) and convolutional LSTM layers for image information (bounding boxes crops).…”
Section: Related Workmentioning
confidence: 99%
“…Most often, recurrent layers are also used for this purpose. [15], [16], [17]. Then, having the calibration parameters of the camera, we can understand the position of the object in space [18].…”
Section: Fig 1 An Example Of Input Representationmentioning
confidence: 99%
“…Unsupervised learning was combined with the Gaussian process to quickly detect the changes in pedestrian intention. Rasouli et al [43] proposed an RNN encoder-decoder architecture for pedestrian trajectory prediction. Pedestrian identified latent intention and the predicted vehicle speed were combined to the trajectory prediction module by the decoder.…”
Section: Introductionmentioning
confidence: 99%
“…Rasouli et al. [43] proposed an RNN encoder‐decoder architecture for pedestrian trajectory prediction. Pedestrian identified latent intention and the predicted vehicle speed were combined to the trajectory prediction module by the decoder.…”
Section: Introductionmentioning
confidence: 99%