2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00790
|View full text |Cite
|
Sign up to set email alerts
|

ManiGAN: Text-Guided Image Manipulation

Abstract: The goal of our paper is to semantically edit parts of an image matching a given text that describes desired attributes (e.g., texture, colour, and background), while preserving other contents that are irrelevant to the text. To achieve this, we propose a novel generative adversarial network (ManiGAN), which contains two key components: text-image affine combination module (ACM) and detail correction module (DCM). The ACM selects image regions relevant to the given text and then correlates the regions with cor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
291
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 262 publications
(292 citation statements)
references
References 25 publications
1
291
0
Order By: Relevance
“…Recently, several methods to perform image manipulation using natural language descriptions without style transfer have been increasingly proposed [30]- [33]. These methods semantically associate a part of the input image with the user's textual description.…”
Section: Text-guided Image Manipulationmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, several methods to perform image manipulation using natural language descriptions without style transfer have been increasingly proposed [30]- [33]. These methods semantically associate a part of the input image with the user's textual description.…”
Section: Text-guided Image Manipulationmentioning
confidence: 99%
“…These methods semantically associate a part of the input image with the user's textual description. Particularly, one of the state-ofthe-art methods in text-guided image manipulation, Mani-GAN [33] has shown excellent results. ManiGAN is one of the generative adversarial networks for image manipulation consisting of text-image affine combination (ACM) moldules and detail correction (DCM) modules.…”
Section: Text-guided Image Manipulationmentioning
confidence: 99%
“…On the other hand, if image manipulation by the natural language description that humans generally use becomes feasible, the development of various user-friendly applications accelerates. Hence, some text-guided image manipulation methods have been proposed recently [11][12][13]. These methods modify some parts of the input image in accordance This work was partly supported by JSPS KAKENHI Grant Number JP17H01744.…”
Section: Introductionmentioning
confidence: 99%
“…However, the state-ofthe-art methods are still insufficient for accurate image manipulation (see SISGAN [11] and TAGAN [12] in Fig. 1), and even the best performing method, ManiGAN [13], modifies undesired parts such as berries and tree branches (see ManiGAN [13] in Fig. 1).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation