2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00176
|View full text |Cite
|
Sign up to set email alerts
|

Generating a Fusion Image: One's Identity and Another's Shape

Abstract: Generating a novel image by manipulating two input images is an interesting research problem in the study of generative adversarial networks (GANs). We propose a new GAN-based network that generates a fusion image with the identity of input image x and the shape of input image y. Our network can simultaneously train on more than two image datasets in an unsupervised manner. We define an identity loss L I to catch the identity of image x and a shape loss L S to get the shape of y. In addition, we propose a nove… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
31
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 46 publications
(31 citation statements)
references
References 20 publications
0
31
0
Order By: Relevance
“…However, these approaches have limitations: for example, they rely on pre-trained models for extracting object representations that require costly ground-truth data annotations [9,47,3]. Furthermore, these works do not address the problem of animating arbitrary objects: instead, consid-ering a single object category [50] or learning to translate videos from one specific domain to another [4,25]. This paper addresses some of these limitations by introducing a novel deep learning framework for animating a static image using a driving video.…”
Section: Introductionmentioning
confidence: 99%
“…However, these approaches have limitations: for example, they rely on pre-trained models for extracting object representations that require costly ground-truth data annotations [9,47,3]. Furthermore, these works do not address the problem of animating arbitrary objects: instead, consid-ering a single object category [50] or learning to translate videos from one specific domain to another [4,25]. This paper addresses some of these limitations by introducing a novel deep learning framework for animating a static image using a driving video.…”
Section: Introductionmentioning
confidence: 99%
“…Similarly, we apply our representation of motion (pose stick figures) to different target subjects to generate new motions while in contrast our work specializes on synthesizing detailed dance movements. Modern approaches have shown success in generating detailed images of human subjects in novel poses [10,16,22,23,31]. Furthermore, recent methods can synthesize such images for temporally coherent video [2] and future prediction [31].…”
Section: Related Workmentioning
confidence: 99%
“…StarGAN [23] is a GAN that learns the mappings among multiple domains using only a single generator and a discriminator, training effectively from images of all domains. FusionGAN [24] generates a fusion image with the identity of input image x and the shape of input image y. This network can simultaneously train on more than two image datasets in an unsupervised manner.…”
Section: Related Workmentioning
confidence: 99%