2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00279
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis

Abstract: Deep generative models come with the promise to learn an explainable representation for visual objects that allows image sampling, synthesis, and selective modification. The main challenge is to learn to properly model the independent latent characteristics of an object, especially its appearance and pose. We present a novel approach that learns disentangled representations of these characteristics and explains them individually. Training requires only pairs of images depicting the same object appearance, but … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 38 publications
(23 citation statements)
references
References 37 publications
0
23
0
Order By: Relevance
“…Siarohin et al learn unsupervised keypoints in order to warp object parts into novel poses [56,57]. Lorenz et al [37] show that body parts can also be represented by unsupervised learning, which also helps to disentangle body pose and shape [11]. Nevertheless, without the benefit of explicit pose information from a human detection model, these methods struggle to generate good results for challenging driving poses.…”
Section: Related Workmentioning
confidence: 99%
“…Siarohin et al learn unsupervised keypoints in order to warp object parts into novel poses [56,57]. Lorenz et al [37] show that body parts can also be represented by unsupervised learning, which also helps to disentangle body pose and shape [11]. Nevertheless, without the benefit of explicit pose information from a human detection model, these methods struggle to generate good results for challenging driving poses.…”
Section: Related Workmentioning
confidence: 99%
“…Image methods. Controllable generative models have also been developed for images (Härkönen et al, 2020;Esser et al, 2019;Singh et al, 2019;Lample et al, 2017;Karras et al, 2020;Brock et al, 2019;Collins et al, 2020;Shen et al, 2020;Esser et al, 2020;Goetschalckx et al, 2019;Pavllo et al, 2020;, which control the object class, pose, lighting, etc., of an image. Many image style transform methods have also been developed Gatys et al, 2016).…”
Section: Related Workmentioning
confidence: 99%
“…Other works. Esser et al adopt the disentanglement of the human body pose from the corresponding appearance (style) information in the context of a dual-encoder VAE setting, however they use the body-related factors for other human appearance transfer [75] and synthesis [108].…”
Section: Tips and Tricksmentioning
confidence: 99%