2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.233
|View full text |Cite
|
Sign up to set email alerts
|

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Abstract: We introduce a Deep Stochastic IOC 1 RNN Encoderdecoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes. DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i.e., given the same context, future may vary), 2) foreseeing the potential future outcomes and make a strategic prediction based on that, and 3) reasoning not only from the past motion history, but also from t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

6
915
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 961 publications
(921 citation statements)
references
References 49 publications
6
915
0
Order By: Relevance
“…Neural nets can also learn how multiple agents move in each others presence Yi et al 2016), even from a vehicle perspective (Karasev et al 2016;Lee et al 2017). In the IV domain, interaction of road users with the ego-vehicle is especially important.…”
Section: Static Environment Cuesmentioning
confidence: 99%
“…Neural nets can also learn how multiple agents move in each others presence Yi et al 2016), even from a vehicle perspective (Karasev et al 2016;Lee et al 2017). In the IV domain, interaction of road users with the ego-vehicle is especially important.…”
Section: Static Environment Cuesmentioning
confidence: 99%
“…An alternative is to employ a nonparametric prediction architecture (107). A sample generation module consisting of a conditional variational autoencoder was able to learn a sampling model that, given observations of past trajectories, produces a diverse set of prediction hypotheses to capture the multimodality of the space of plausible futures.…”
Section: Learning-based Approachesmentioning
confidence: 99%
“…Encoder-decoder architectures excel in many sequential problems like trajectory prediction [1], [2]. Encoder-decoder architectures consist of two separate recurrent networks: the encoder processes the input sequence x i = (x i,1 , .…”
Section: Encoder-decodermentioning
confidence: 99%
“…Additionally, we introduce a new metric tailored for ambiguous problems, as we found previous ones to be unsuited. Indeed, many previous works use some form of oracle metric, which only considers the best hypothesis [1], [2]. Although theoretically sound, this could easily be fooled by guessing a multitude of diverse solutions in hopes of approximating the correct one.…”
Section: Introductionmentioning
confidence: 99%