Abstract:The task of text-to-image generation has achieved remarkable progress due to the advances in the conditional generative adversarial networks (GANs). However, existing conditional text-to-image GANs approaches mostly concentrate on improving both image quality and semantic relevance but ignore the explainability of the model which plays a vital role in real-world applications. In this paper, we present a variety of techniques to take a deep look into the latent space and semantic space of the conditional text-t… Show more
“…Zhang et al [9] proposed DiverGAN inserting a dense layer into the pipeline to address the lack-of-diversity problem present in current single-stage text-to-image GAN models. Zhang et al [27] introduced linear-interpolation and triangularinterpolation techniques to explain the single-stage text-toimage GAN model. Moreover, a Good/Bad data set was created to select successfully generated images and corresponding good latent codes.…”
“…Zhang et al [9] proposed DiverGAN inserting a dense layer into the pipeline to address the lack-of-diversity problem present in current single-stage text-to-image GAN models. Zhang et al [27] introduced linear-interpolation and triangularinterpolation techniques to explain the single-stage text-toimage GAN model. Moreover, a Good/Bad data set was created to select successfully generated images and corresponding good latent codes.…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.