2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00593
|View full text |Cite
|
Sign up to set email alerts
|

DA-GAN: Instance-Level Image Translation by Deep Attention Generative Adversarial Networks

Abstract: Unsupervised image translation, which aims in translating two independent sets of images, is challenging in discovering the correct correspondences without paired data. Existing works build upon Generative Adversarial Network (GAN) such that the distribution of the translated images are indistinguishable from the distribution of the target set. However, such set-level constraints cannot learn the instance-level correspondences (e.g. aligned semantic parts in object configuration task). This limitation often re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
96
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 144 publications
(96 citation statements)
references
References 21 publications
0
96
0
Order By: Relevance
“…Attention mechanism is an emerging topic in natural language tasks [20] and image/video generation task [26,37,22,36]. Pumarola et al [26] generated facial expression conditioned on action units annotations.…”
Section: Attention Mechanismmentioning
confidence: 99%
“…Attention mechanism is an emerging topic in natural language tasks [20] and image/video generation task [26,37,22,36]. Pumarola et al [26] generated facial expression conditioned on action units annotations.…”
Section: Attention Mechanismmentioning
confidence: 99%
“…Although images with realistic texture have been synthesized on simple datasets, such as birds [29,16] and flowers [33], most existing approaches do not specifically model objects and their relations in images and thus have difficulties in generating complex scenes such as those in the COCO dataset [15]. For example, generating images from a sentence "several people in their ski gear are in the snow" requires modeling of different objects (people, ski gear) and their interactions (people on top of ski gear), as well as filling the missing information (e.g., the rocks in the background).…”
Section: Introductionmentioning
confidence: 99%
“…To further improve the generation performance, the attention mechanism has been recently investigated in image translation, such as [3,45,39,24,26]. However, to the best of our knowledge, our model is the first attempt to incorporate a multi-channel attention selection module within a GAN framework for image-to-image translation task.…”
Section: Related Workmentioning
confidence: 99%