2023
DOI: 10.1007/s00521-023-09185-6
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing and interpreting the latent space of the conditional text-to-image GANs

Zhenxing Zhang,
Lambert Schomaker

Abstract: Text-to-image generation intends to automatically produce a photo-realistic image, conditioned on a textual description. To facilitate the real-world applications of text-to-image synthesis, we focus on studying the following three issues: (1) How to ensure that generated samples are believable, realistic or natural? (2) How to exploit the latent space of the generator to edit a synthesized image? (3) How to improve the explainability of a text-to-image generation framework? We introduce two new data sets for … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 47 publications
0
0
0
Order By: Relevance