2019
DOI: 10.1016/j.cviu.2018.10.009
|View full text |Cite
|
Sign up to set email alerts
|

Pros and cons of GAN evaluation measures

Abstract: Generative models, in particular generative adversarial networks (GANs), have gained significant attention in recent years. A number of GAN variants have been proposed and have been utilized in many applications. Despite large strides in terms of theoretical progress, evaluating and comparing GANs remains a daunting task. While several measures have been introduced, as of yet, there is no consensus as to which measure best captures strengths and limitations of models and should be used for fair model compariso… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
438
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 720 publications
(508 citation statements)
references
References 158 publications
(277 reference statements)
2
438
0
1
Order By: Relevance
“…29 Despite the fact that there is no universal criteria for the simple ranking of different methods in terms of the degree of resemblance to realistic images, FID is widely used as a GAN performance evaluation metric. 31,35,36 For our purposes, FID was suitable for showing an improvement in the quality levels of images reconstructed with GAN-based methods compared to non-GAN-based methods. The FID scores in our study were slightly higher than those on other tasks, such as CIFAR-10, CelebA, and MNIST, among others 31,35 ; this is presumably related to differences in the sample sizes.…”
Section: Discussionmentioning
confidence: 98%
See 1 more Smart Citation
“…29 Despite the fact that there is no universal criteria for the simple ranking of different methods in terms of the degree of resemblance to realistic images, FID is widely used as a GAN performance evaluation metric. 31,35,36 For our purposes, FID was suitable for showing an improvement in the quality levels of images reconstructed with GAN-based methods compared to non-GAN-based methods. The FID scores in our study were slightly higher than those on other tasks, such as CIFAR-10, CelebA, and MNIST, among others 31,35 ; this is presumably related to differences in the sample sizes.…”
Section: Discussionmentioning
confidence: 98%
“…Despite the fact that there is no universal criteria for the simple ranking of different methods in terms of the degree of resemblance to realistic images, FID is widely used as a GAN performance evaluation metric . For our purposes, FID was suitable for showing an improvement in the quality levels of images reconstructed with GAN‐based methods compared to non‐GAN‐based methods.…”
Section: Discussionmentioning
confidence: 99%
“…Despite progress in the theoretical understanding of generative models and increased attention in GAN research, evaluating and comparing the performance of these models still remain a hard task. While several measures have been introduced, there is no consensus yet as to which measure best captures the strengths and limitations of the models and yield a fair model comparison [53]. Moreover, evaluation metrics are usually problem specific.…”
Section: Kinematic Evaluation Of Synthetic Signaturesmentioning
confidence: 99%
“…We train the CGAN for a total of 45 epochs, gradually increasing the training batch size from 32 to 256. In general, quantitative evaluation of GAN-generated images is a topic of ongoing debate with no clear consensus (Borji, 2018). We performed the training using a single Nvidia Tesla K80 general-purpose graphics processing unit; the full training required approximately 40 hr.…”
Section: 1029/2019gl082532mentioning
confidence: 99%