2020
DOI: 10.48550/arxiv.2010.09662
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Attention Augmented ConvLSTM for Environment Prediction

Abstract: Safe and proactive planning in robotic systems generally requires accurate predictions of the environment. Prior work on environment prediction applied video frame prediction techniques to bird's-eye view environment representations, such as occupancy grids. ConvLSTM-based frameworks used previously often result in significant blurring and vanishing of moving objects, thus hindering their applicability for use in safety-critical applications. In this work, we propose two extensions to the ConvLSTM to address t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…Lastly, it is expected that the performance can be further improved by applying attention to features that have greater influence on the future sequence using an attention block. In the future, we will also evaluate incorporating more recent ConvLSTM [25,42] and transformer-based modules (handles missing/noisy observations more naturally) [43,44] in our proposed seq2seq model along with evaluating these on a much larger 1000 patient lung radiotherapy cohort we are assembling.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Lastly, it is expected that the performance can be further improved by applying attention to features that have greater influence on the future sequence using an attention block. In the future, we will also evaluate incorporating more recent ConvLSTM [25,42] and transformer-based modules (handles missing/noisy observations more naturally) [43,44] in our proposed seq2seq model along with evaluating these on a much larger 1000 patient lung radiotherapy cohort we are assembling.…”
Section: Discussionmentioning
confidence: 99%
“…Zhang et al [23] studied tumor growth prediction using ConvLSTM with cropped 32 × 32 patches of tumor CT, tumor contour, and intercellular volume fraction images. Even though the results achieved 0.80 dice overlap, their model suffered from the known blurring and limited long-term dependency ConvLSTM issues [25,26]. Moreover, the study focused on predicting a single timepoint from two earlier timepoints.…”
Section: Introduction and Purposementioning
confidence: 99%
“…Itkina et al proposed to use evidential occupancy grid and implement PredNet architecture for the prediction [16]. The approach is then carried forward to develop the double-pronged architecture [17] and attention-augmented ConvLSTM [18]. The latter work is able to make long-term predictions, however at turns the predictions still lose the scene structure.…”
Section: B Occupancy Grid Predictionmentioning
confidence: 99%