2020
DOI: 10.48550/arxiv.2010.08188
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Vid-ODE: Continuous-Time Video Generation with Neural Ordinary Differential Equation

Abstract: Video generation models often operate under the assumption of fixed frame rates, which leads to suboptimal performance when it comes to handling flexible frame rates (e.g., increasing the frame rate of more dynamic portion of the video as well as handling missing video frames). To resolve the restricted nature of existing video generation models' ability to handle arbitrary timesteps, we propose continuous-time video generation by combining neural ODE (Vid-ODE) with pixellevel video processing techniques. Usin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…To the best of our knowledge, most existing works in computer vision process videos as discrete collections of frames. The only exception is Vid-ODE [27], which represent videos by continuous latent states. The latent state can be evaluated at any given timestamp, allowing the video to be rendered with an infinitely high frame rate.…”
Section: Video Representationmentioning
confidence: 99%
“…To the best of our knowledge, most existing works in computer vision process videos as discrete collections of frames. The only exception is Vid-ODE [27], which represent videos by continuous latent states. The latent state can be evaluated at any given timestamp, allowing the video to be rendered with an infinitely high frame rate.…”
Section: Video Representationmentioning
confidence: 99%
“…In contrast to the prior work, our generator is continuous in time. In this way it is similar to Vid-ODE [47]: a continuous-time video interpolation and prediction model based on neural ODEs [13].…”
Section: Related Workmentioning
confidence: 99%
“…Equipped with widely used numerical solvers such as Runge-Kutta and Dormand-Prince method, neural ODE has the capacity to express the latent state in continuous-depth, or equivalently continuous-time. The continuous nature of neural ODE paved a way to design the continuous time-series modeling as shown in following studies [7,9,24,27,41]. Latent ODE [27] introduced ODE-RNN as an encoder and demonstrated the effectiveness of handling the time-series data taken at non-uniform intervals.…”
Section: Related Workmentioning
confidence: 99%
“…Latent ODE [27] introduced ODE-RNN as an encoder and demonstrated the effectiveness of handling the time-series data taken at non-uniform intervals. Furthermore, ODE 2 VAE [41] and Vid-ODE [24] performed continuous-time video prediction conditioned on input video frames, demonstrating the potential to apply neural ODE to computer vision. given two noise vectors, the motion noise vector z m ∈ Z M and the appearance noise vector z a ∈ Z A , where T denotes the number of frames, H and W the height and width of the generated image, respectively.…”
Section: Related Workmentioning
confidence: 99%