2019 IEEE International Conference on Image Processing (ICIP) 2019
DOI: 10.1109/icip.2019.8803624
|View full text |Cite
|
Sign up to set email alerts
|

Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
14
0
5

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(19 citation statements)
references
References 7 publications
0
14
0
5
Order By: Relevance
“…In terms of network architecture, most previous methods use some form of recurrent convolutional encoder-decoder architecture [6,9]. There are also some methods that use 3D convolutions for handling temporal information [10].…”
Section: Frame Predictionmentioning
confidence: 99%
See 2 more Smart Citations
“…In terms of network architecture, most previous methods use some form of recurrent convolutional encoder-decoder architecture [6,9]. There are also some methods that use 3D convolutions for handling temporal information [10].…”
Section: Frame Predictionmentioning
confidence: 99%
“…Several works formulate the frame prediction problem as a synthesis problem directly in the pixel domain [6,9], whereas others model similarity between successive frames by means of explicit transformations [11,12].…”
Section: Frame Predictionmentioning
confidence: 99%
See 1 more Smart Citation
“…LSTM is a widely applicable kind of RNN which contains feedback connections for both single data points and entire data sequences in deep learning [ 50 ]. The optimization task regarding accurate future image prediction has been a highlighted problem in artificial intelligence in recent several years [ 51 , 52 , 53 , 54 , 55 , 56 , 57 , 58 , 59 , 60 , 61 , 62 , 63 , 64 , 65 , 66 , 67 ]. Kalchbrenner et al have developed a video pixel network to predict the joint distribution of future image in pixel videos [ 60 ].…”
Section: Introductionmentioning
confidence: 99%
“…Xue et al proposed a cross convolutional network to synthesize future images in a probabilistic manner, based on auto-encoders of future maps and convolutional kernels, respectively, with the single input image and unknown motions [ 52 ]. Additionally, subsequent layers model [ 66 ], generative adversarial networks [ 56 ], CNNs [ 55 ], convolutional LSTM [ 68 ], and cubic LSTM [ 58 ] play significant roles in the prediction of future images.…”
Section: Introductionmentioning
confidence: 99%