2020
DOI: 10.3390/app10228288
|View full text |Cite
|
Sign up to set email alerts
|

Sequence-to-Sequence Video Prediction by Learning Hierarchical Representations

Abstract: Video prediction which maps a sequence of past video frames into realistic future video frames is a challenging task because it is difficult to generate realistic frames and model the coherent relationship between consecutive video frames. In this paper, we propose a hierarchical sequence-to-sequence prediction approach to address this challenge. We present an end-to-end trainable architecture in which the frame generator automatically encodes input frames into different levels of latent Convolutional Neural N… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…Authors in [9] explored a topic, namely whether network topologies are necessary, and instead propose a new approach: the main goal is to find the video prediction algorithm with the least inductive bias while optimising network capacity. Finally, it is shown that by just boosting the capacity of a regular neural network, a high-quality video prediction may be produced even without the usage of the previously stated techniques (such as adversarial objectives, optical flows, and so on) [9].…”
Section: Related Workmentioning
confidence: 99%
“…Authors in [9] explored a topic, namely whether network topologies are necessary, and instead propose a new approach: the main goal is to find the video prediction algorithm with the least inductive bias while optimising network capacity. Finally, it is shown that by just boosting the capacity of a regular neural network, a high-quality video prediction may be produced even without the usage of the previously stated techniques (such as adversarial objectives, optical flows, and so on) [9].…”
Section: Related Workmentioning
confidence: 99%
“…Further, they can be used to generate novel examples not found in the original dataset [ 21 ]. Such feature learning also supports a variety of other applications, such as super-resolution [ 22 ], multimodal application [ 23 , 24 , 25 , 26 , 27 ], medical imaging [ 28 , 29 ], video prediction [ 30 , 31 , 32 , 33 , 34 ], natural language processing [ 35 , 36 , 37 ], transfer learning and zero-shot learning [ 38 ].…”
Section: Introductionmentioning
confidence: 99%