2022
DOI: 10.48550/arxiv.2206.03010
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MS-RNN: A Flexible Multi-Scale Framework for Spatiotemporal Predictive Learning

Abstract: Spatiotemporal predictive learning, which predicts future frames through historical prior knowledge with the aid of deep learning, is widely used in many fields. Previous work essentially improves the model performance by widening or deepening the network, but it also brings surging memory overhead, which seriously hinders the development and application of this technology. In order to improve the performance without increasing memory consumption, we focus on scale, which is another dimension to improve model … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 33 publications
0
9
0
Order By: Relevance
“…Their origin is the ConvLSTM [16] proposed by Shi et al in 2015. Later, a large number of followers continue to refresh the forecasting performance, for example, Traj-GRU [8], PredRNN [28], PredRNN++ [45], MIM [29], Mo-tionRNN [31], PrecipLSTM [30], MS-RNN [32], and MS-LSTM [26]. GAN [17], [19], [46]- [48] uses UNet or Con-vRNN as the generator and uses one or more discriminators with different roles to play the minimax game.…”
Section: A Radar Video Prediction Modelsmentioning
confidence: 99%
See 3 more Smart Citations
“…Their origin is the ConvLSTM [16] proposed by Shi et al in 2015. Later, a large number of followers continue to refresh the forecasting performance, for example, Traj-GRU [8], PredRNN [28], PredRNN++ [45], MIM [29], Mo-tionRNN [31], PrecipLSTM [30], MS-RNN [32], and MS-LSTM [26]. GAN [17], [19], [46]- [48] uses UNet or Con-vRNN as the generator and uses one or more discriminators with different roles to play the minimax game.…”
Section: A Radar Video Prediction Modelsmentioning
confidence: 99%
“…Although they gain stronger forecasting capabilities, they will consume huge memory and computing resources. Multiscale RNN (MS-RNN [32]) proposes to adopt a multiscale architecture to improve these ConvRNN models, which will make them have less memory and computing requirements but stronger spatiotemporal modeling capabilities while keeping the number of parameters constant. Specifically, MS-RNN reintegrates these increasingly complex tensor flows and embeds the UNet [40] IV.…”
Section: B Ms-rnnmentioning
confidence: 99%
See 2 more Smart Citations
“…The RNN models are dominated by ConvLSTM [16] and its variants, such as Traj-GRU [17], PredRNN [18], PredRNN++ [22], E3D-LSTM [23], MIM [19], CubicLSTM [37], SA-ConvLSTM [24], Motion-RNN [20], and so on. These RNN models are getting wider and deeper [38]. Although they alleviate the prediction ambiguity problem to some extent, it also brings a significant increase in computational cost.…”
Section: Related Work Model Classificationmentioning
confidence: 99%