2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.01370
|View full text |Cite
|
Sign up to set email alerts
|

DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 76 publications
(27 citation statements)
references
References 26 publications
0
27
0
Order By: Relevance
“…DSE-GAN also outperforms others for R-precision (from 71.08 to 76.31, a 7.36% relative improvement), which indicates the better text-image alignment of our models. Although our DSE-GAN is inferior to some previous works (e.g., DMGAN [3] and DAE-GAN [53]) for IS on MSCOCO, IS score is known for failing to evaluate the generation quality on MSCOCO [4,12,20] and the visual inspection in Fig. 5 indicates DSE-GAN's image quality is much higher than others.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 57%
See 2 more Smart Citations
“…DSE-GAN also outperforms others for R-precision (from 71.08 to 76.31, a 7.36% relative improvement), which indicates the better text-image alignment of our models. Although our DSE-GAN is inferior to some previous works (e.g., DMGAN [3] and DAE-GAN [53]) for IS on MSCOCO, IS score is known for failing to evaluate the generation quality on MSCOCO [4,12,20] and the visual inspection in Fig. 5 indicates DSE-GAN's image quality is much higher than others.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 57%
“…. We qualitatively compare the generated images from our method with three recent state-of-the-art GAN methods, i.e., DMGAN [3], DFGAN [12] and DAE-GAN [53]. For the CUB-200 benchmark, as shown in the first four columns in Fig.…”
Section: Qualitative Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Similar to [129], Dynamic Aspect-awarE GAN (DAE-GAN) [136] refers to the importance of aspect in the input text. The model represents text information from multiple granularities of sentence-level, word-level, and aspect-level, for which, besides other attention mechanisms, the aspect-aware dynamic re-drawer (ADR) module is employed.…”
Section: Direct T2imentioning
confidence: 99%
“…So, one future direction could be the use of the latent space of these transformer-based models for text embeddings. Another interpretation in text embeddings is the level of attention, ranging from sentences [7] to words [34], and aspects [136], where further extensions such as sensitivity to grammar, positional, and numerical information are neglected. Therefore, the new level of attention to the models is seemingly interesting.…”
Section: Generative Modelsmentioning
confidence: 99%