2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00787
|View full text |Cite
|
Sign up to set email alerts
|

Diverse Semantic Image Synthesis via Probability Distribution Modeling

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 56 publications
(24 citation statements)
references
References 22 publications
0
24
0
Order By: Relevance
“…Baselines. We compare our method with several semantic image editing baselines: SESAME [Ntavelis et al 2020], INADE [Tan et al 2021], Taming Transformers (TT) [Esser et al 2021a], and ImageBART [Esser et al 2021b]. SESAME and INADE are based on convolutional networks and only support image resolutions of up to 256 × 256 pixels.…”
Section: Resultsmentioning
confidence: 99%
“…Baselines. We compare our method with several semantic image editing baselines: SESAME [Ntavelis et al 2020], INADE [Tan et al 2021], Taming Transformers (TT) [Esser et al 2021a], and ImageBART [Esser et al 2021b]. SESAME and INADE are based on convolutional networks and only support image resolutions of up to 256 × 256 pixels.…”
Section: Resultsmentioning
confidence: 99%
“…The aforementioned methods mainly focus on generating real and semantically-corresponding unimodal result. Paralleled with these methods, some other methods [57,12,59,44] explore multimodal generation, which is also a core target for one-to-many problems like semantic image synthesis. To tackle this issue, BicycleGAN [57] encourages bidirectional mapping between the generated image and latent code, and DSCGAN [12] propose a simple regularization loss to penalize the generator from mode collapse.…”
Section: Semantic Image Synthesismentioning
confidence: 99%
“…To tackle this issue, BicycleGAN [57] encourages bidirectional mapping between the generated image and latent code, and DSCGAN [12] propose a simple regularization loss to penalize the generator from mode collapse. More recently, INADE [44] proposes a framework that supports diverse generation at the instance level by instanceadaptive stochastic sampling. However, these multimodal methods still fail to obtain satisfactory results on generation quality and learned correspondence.…”
Section: Semantic Image Synthesismentioning
confidence: 99%
“…Current conditional generative models can synthesize images according to additional conditions such as image (Choi et al 2020;Lee et al 2020;Liu et al 2021), label (Chen et al 2019;Yang et al 2021), segmentation mask (Park et al 2019;Huang et al 2020;Tan et al 2021), text (Xu et al 2018;Qiao et al 2019;Li et al 2019a;Zhu et al 2019;Li et al 2019b) and layout (Sun and Wu 2019; Ashual and Wolf 2019;Zhao et al 2019). The text conditions can either be natural language sentences or scene graphs (Li et al 2019c;Tseng et al 2020).…”
Section: Related Work Conditional Image Generationmentioning
confidence: 99%