“…Diffusion-based generative models have emerged as a powerful new framework for neural image synthesis, in both unconditional [15,33,42] and conditional [16,32,33,35,36,37,38,42] settings, even surpassing the quality of GANs [12] in certain situations [9]. They are also rapidly finding use in other domains such as audio [27,34] and video [18] generation, image segmentation [4,50] and language translation [31].…”