2020
DOI: 10.1609/aaai.v34i07.6969
|View full text |Cite
|
Sign up to set email alerts
|

Human Synthesis and Scene Compositing

Abstract: Generating good quality and geometrically plausible synthetic images of humans with the ability to control appearance, pose and shape parameters, has become increasingly important for a variety of tasks ranging from photo editing, fashion virtual try-on, to special effects and image compression. In this paper, we propose a HUSC (HUman Synthesis and Scene Compositing) framework for the realistic synthesis of humans with different appearance, in novel poses and scenes. Central to our formulation is 3d reasoning … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 22 publications
(6 citation statements)
references
References 28 publications
0
6
0
Order By: Relevance
“…A recent promising direction synthesizes realistic looking people in images [57,61]. Zanfir et al [57] use a learned human synthesis method to insert generated people in images such that they make sense relative to the scene geometry and lighting.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…A recent promising direction synthesizes realistic looking people in images [57,61]. Zanfir et al [57] use a learned human synthesis method to insert generated people in images such that they make sense relative to the scene geometry and lighting.…”
Section: Related Workmentioning
confidence: 99%
“…A recent promising direction synthesizes realistic looking people in images [57,61]. Zanfir et al [57] use a learned human synthesis method to insert generated people in images such that they make sense relative to the scene geometry and lighting. While they can condition the generated person on pose and shape, the resulting images contain artifacts that are common to generative models, making the results unsuitable as ground truth.…”
Section: Related Workmentioning
confidence: 99%
“…Others have also considered detailed 3D modeling of people and the input scene (Zanfir et al 2020), or modeling individual component attributes (e.g., mixing clothing items such as shorts and a tank top from different input photographs (Men et al 2020). While the outputs of these models are certainly impressive, their focus (and their evaluation) is on generating visually pleasing fashion photographs in, typically, indoor scenes with a limited variation of imaging conditions.…”
Section: Related Work and Positioningmentioning
confidence: 99%
“…Video generation is a challenging problem. GAN [34] based generative models are proposed benefiting from the development of computing power and achieve great success in image generation [35]- [41], but video generation is much harder for GANs due to more data contents and extra temporal information. Many works [14], [16], [18], [42] directly implement 3D convolutions to encode temporal information.…”
Section: Related Workmentioning
confidence: 99%