2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.01518
|View full text |Cite
|
Sign up to set email alerts
|

MotionRNN: A Flexible Model for Video Prediction with Spacetime-Varying Motions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
65
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 119 publications
(65 citation statements)
references
References 9 publications
0
65
0
Order By: Relevance
“…This feature allows them to be trained on sequences to reflect the underlying temporal dynamics. Thus, recurrent structures have become popular solutions for video prediction [27], embedded dynamics [28], trajectory forecasting [29] and translation [30], etc.…”
Section: Related Workmentioning
confidence: 99%
“…This feature allows them to be trained on sequences to reflect the underlying temporal dynamics. Thus, recurrent structures have become popular solutions for video prediction [27], embedded dynamics [28], trajectory forecasting [29] and translation [30], etc.…”
Section: Related Workmentioning
confidence: 99%
“…Guen et al proposed the PhyCell [19] to disentangle physical dynamics from unknown factors to predict more reliable motions. And Wu et al [20] proposed the Motion-GRU to independently model the transient variation and motion trend for more satisfactory predictions.…”
Section: Related Workmentioning
confidence: 99%
“…Some works [17], [18] attempted to utilize the preserved visual details during the feature extraction to augment the visual quality of the predictions. [19], [20] have explored to disentangle the physical dynamics (motion patterns) to help predict more satisfactory human actions. In addition, some works [21], [22], [23], [24], [25], [26] have begun to augment the long-term memorizing ability for LSTMs with the help of the attention mechanism, which can also help broaden the spatiotemporal receptive field of the predictive units.…”
Section: Introductionmentioning
confidence: 99%
“…wider (e.g., TrajGRU [14], PredRNN [7], MIM [15], and SA-ConvLSTM [10]) or deeper (e.g., PredRNN++ [9] and MotionRNN [18]). As a result, they bring limited improvement in model performance but introduce significant growth in GPU memory usage.…”
Section: Introductionmentioning
confidence: 99%