2018
DOI: 10.48550/arxiv.1812.01717
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Towards Accurate Generative Models of Video: A New Metric & Challenges

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
109
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 71 publications
(109 citation statements)
references
References 0 publications
0
109
0
Order By: Relevance
“…To quantitatively evaluate MAGE, we apply conventional pixel-based similarity metrics SSIM and PSNR for deterministic video generation. We also report several perceptual similarity metrics including imagelevel Fréchet Inception Distance (FID) [13] and Learned Perceptual Image Patch Similarity (LPIPS) [9], as well as video-level Fréchet-Video-Distance (FVD) [34]. To evaluate the diversity of generated videos given ambiguous text, following previous work [8], we measure the average mutual distance of generated video sequences in the feature space of both VGG-16 [31] and I3D [32] network.…”
Section: B Quantitative Resultsmentioning
confidence: 99%
“…To quantitatively evaluate MAGE, we apply conventional pixel-based similarity metrics SSIM and PSNR for deterministic video generation. We also report several perceptual similarity metrics including imagelevel Fréchet Inception Distance (FID) [13] and Learned Perceptual Image Patch Similarity (LPIPS) [9], as well as video-level Fréchet-Video-Distance (FVD) [34]. To evaluate the diversity of generated videos given ambiguous text, following previous work [8], we measure the average mutual distance of generated video sequences in the feature space of both VGG-16 [31] and I3D [32] network.…”
Section: B Quantitative Resultsmentioning
confidence: 99%
“…Although given only one frame as condition (Cond. ), N ÜWA still significantly pushes the state-of-the-art FVD [38] score from 94±2 to 86.9.…”
Section: Comparison With State-of-the-artmentioning
confidence: 92%
“…However, perfect image quality and static consistency can be achieved by simply not animating anything at all and they cannot reflect the quality of generated videos. Thus we use Fréchet Video Distance [25] to measure the quality of generated videos, especially on their motion. User studies are further conducted to evaluate their quality from human sense.…”
Section: Methodsmentioning
confidence: 99%