Special Interest Group on Computer Graphics and Interactive Techniques Conference Proceedings 2022
DOI: 10.1145/3528233.3530757
|View full text |Cite
|
Sign up to set email alerts
|

Palette: Image-to-Image Diffusion Models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
483
0
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 1,000 publications
(487 citation statements)
references
References 41 publications
2
483
0
2
Order By: Relevance
“…Diffusion-based generative models have emerged as a powerful new framework for neural image synthesis, in both unconditional [15,33,42] and conditional [16,32,33,35,36,37,38,42] settings, even surpassing the quality of GANs [12] in certain situations [9]. They are also rapidly finding use in other domains such as audio [27,34] and video [18] generation, image segmentation [4,50] and language translation [31].…”
Section: Introductionmentioning
confidence: 99%
“…Diffusion-based generative models have emerged as a powerful new framework for neural image synthesis, in both unconditional [15,33,42] and conditional [16,32,33,35,36,37,38,42] settings, even surpassing the quality of GANs [12] in certain situations [9]. They are also rapidly finding use in other domains such as audio [27,34] and video [18] generation, image segmentation [4,50] and language translation [31].…”
Section: Introductionmentioning
confidence: 99%
“…We find that L 1 norm gives slightly better FID scores than L 2 . However, subsequent work shows that L 2 tends to generate greater diversity in SR3 outputs [34].…”
Section: Cascaded High-resolution Image Synthesismentioning
confidence: 97%
“…We expect both of these variants to work reasonably well if p(γ) is modified to account for the scale of the loss function. Further investigation of the loss function for training the denoising model is an interesting area for future research (e.g., see [34]).…”
Section: Optimizing the Denoising Modelmentioning
confidence: 99%
“…The denoising process can be extended for the purpose of conditional generation by adding other signals as the condition [10]. When implementing the network backbone as a UNet, diffusion models are well suited for image-like data and achieved state-of-the-art results in image generation [44,46,12], superresolution [49,25,35], image-to-image translation [48] and so on. In this paper, Fig.…”
Section: Related Workmentioning
confidence: 99%