2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00152
|View full text |Cite
|
Sign up to set email alerts
|

Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis

Abstract: Latent code z ImageThis bird has feathers that are black and has a red belly Text Figure 1: Mode seeking generative adversarial networks (MSGANs). (Left) Existing conditional generative adversarial networks tend to ignore the input latent code z and generate images of similar modes. (Right) We propose a simple yet effective mode seeking regularization term that can be applied to arbitrary conditional generative adversarial networks in different tasks to alleviate the mode collapse issue and improve the diversi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
325
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 379 publications
(326 citation statements)
references
References 16 publications
(65 reference statements)
1
325
0
Order By: Relevance
“…For BicycleGAN [Zhu et al 2017b], although it can capture the content information and output roughly correct glyph shapes for the English dataset, it can not process the style information well. The more recent work, MS-Pix2Pix [Mao et al 2019], can not even handle the content information well, and sometimes outputs glyph images with incorrect contents. What is worse, both of these two methods are prone to mode collapse, and can not even converge on the Chinese dataset which is more challenging.…”
Section: Pre-training Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…For BicycleGAN [Zhu et al 2017b], although it can capture the content information and output roughly correct glyph shapes for the English dataset, it can not process the style information well. The more recent work, MS-Pix2Pix [Mao et al 2019], can not even handle the content information well, and sometimes outputs glyph images with incorrect contents. What is worse, both of these two methods are prone to mode collapse, and can not even converge on the Chinese dataset which is more challenging.…”
Section: Pre-training Resultsmentioning
confidence: 99%
“…Examples of some glyph images generated by BicycleGAN[Zhu et al 2017b], MS-Pix2Pix[Mao et al 2019] and our AGIS-Net, on English and Chinese glyph image datasets during pre-training.…”
mentioning
confidence: 99%
“…By varying interpolation coefficients a, we expect to generate diverse images, but one common problem for GAN is mode collapse [27], which means that the generated images may collapse into a few modes. In our fusion generator, when sampling two different interpolation coefficients a 1 and a 2 , the generated images G(a 1 , X S ) and G(a 2 , X S ) are likely to collapse into the same mode.…”
Section: Fusion Discriminatormentioning
confidence: 99%
“…To ensure the diversity of generated images, we additionally employ a mode seeking loss and an interpolation regression loss, both of which are related to interpolation coefficients. Specifically, we use a variant of mode seeking loss [27] to prevent the images generated based on different interpolation coefficients from collapsing to a few modes. Moreover, we propose a novel interpolation regression loss by regressing the interpolation coefficients based on the features of conditional images and generated image, which means that each generated image can recognize its corresponding interpolation coefficients.…”
Section: Introductionmentioning
confidence: 99%
“…Generative adversarial network [23], as an emerging generative algorithm, has made great progress in the quality and variety of generated images [24][25][26][27][28][29][30][31][32][33][34][35][36]. GAN can imitate the distribution of original data and output new samples with similar characteristics.…”
Section: Introductionmentioning
confidence: 99%