2019
DOI: 10.1109/access.2019.2958864
|View full text |Cite
|
Sign up to set email alerts
|

Dualattn-GAN: Text to Image Synthesis With Dual Attentional Generative Adversarial Network

Abstract: Recent generative adversarial network based methods have shown promising results for the charming but challenging task of synthesizing images from text descriptions. These approaches can generate images with general shape and color but often produce distorted global structures with unnatural local semantic details. It is due to ineffectiveness of convolutional neural networks in capturing the high-level semantic information for pixel-level image synthesis. In this paper, we propose a Dual Attentional Generativ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 43 publications
(15 citation statements)
references
References 19 publications
0
15
0
Order By: Relevance
“…We have introduced several kinds of evasion attack strategies against DGMs, all of which are for computer vision. However, DGMs have wide applications in NLP, such as text to image (T2I) [93], [94], or text generation [95], [96], e.g., for writing poems [97] or medical record synthesis [98]. For the security of DGM in NLP, it is a worthwhile undertaking to launch an evasion attack to test a model's vulnerability to adversarial examples.…”
Section: Evasion Attacks On Nlpmentioning
confidence: 99%
“…We have introduced several kinds of evasion attack strategies against DGMs, all of which are for computer vision. However, DGMs have wide applications in NLP, such as text to image (T2I) [93], [94], or text generation [95], [96], e.g., for writing poems [97] or medical record synthesis [98]. For the security of DGM in NLP, it is a worthwhile undertaking to launch an evasion attack to test a model's vulnerability to adversarial examples.…”
Section: Evasion Attacks On Nlpmentioning
confidence: 99%
“…Amodio et al [29] proposed TraVeLGAN to reduce the difficulty of CycleGAN training. To further improve the generation performance, the attention mechanism has been recently investigated in image translation, such as in [30].…”
Section: B Image-to-image Translationmentioning
confidence: 99%
“…In cGAN, the generation of samples can be conditioned on class information [40], text description [24], [41], audio [42], [43], skeleton [44], [45], and attributes [46]. Using an encoder-decoder architecture, the conditions can be applied to conduct domain changes on images such as image editing [47], image segmentation [48] and image inpainting [49].…”
Section: A Generative Adversarial Networkmentioning
confidence: 99%