2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.01115
|View full text |Cite
|
Sign up to set email alerts
|

Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(5 citation statements)
references
References 28 publications
0
5
0
Order By: Relevance
“…Spherical structure and texture are modeled using cube maps (Han and Suh 2020), cylinder convolution (Liao et al 2022), predicted panoramic three-dimensional structures (Song et al 2018), and scene symmetry (Hara, Mukuta, and Harada 2021).Most previous generative models are limited in handling fixed scenes and low-resolution images. Recent research (Sumantri and Park 2020;Akimoto, Matsuo, and Aoki 2022a) addresses these limitations using hierarchical synthesis networks (Sumantri and Park 2020), U-Net structures (Akimoto, Matsuo, and Aoki 2022a), and separate models for generating and upscaling low-resolution images (Chen, Wang, and Liu 2022). Our proposed method tackles high-resolution panorama context and preserves spherical structure within a single module.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Spherical structure and texture are modeled using cube maps (Han and Suh 2020), cylinder convolution (Liao et al 2022), predicted panoramic three-dimensional structures (Song et al 2018), and scene symmetry (Hara, Mukuta, and Harada 2021).Most previous generative models are limited in handling fixed scenes and low-resolution images. Recent research (Sumantri and Park 2020;Akimoto, Matsuo, and Aoki 2022a) addresses these limitations using hierarchical synthesis networks (Sumantri and Park 2020), U-Net structures (Akimoto, Matsuo, and Aoki 2022a), and separate models for generating and upscaling low-resolution images (Chen, Wang, and Liu 2022). Our proposed method tackles high-resolution panorama context and preserves spherical structure within a single module.…”
Section: Related Workmentioning
confidence: 99%
“…However, simply adopting current state-of-the-art methods such as DALL•E-like (Esser, Rombach, and Ommer 2021) for high-resolution panorama generation can result in spherical distortion and low efficiency. Recent efforts have attempted to address this issue by using separate models to first generate low-resolution panoramas and then upscaling to high-resolution, such as (Akimoto, Matsuo, and Aoki 2022a;Chen, Wang, and Liu 2022). These methods, however, can still result in artifacts caused by error accumulation.…”
Section: Introductionmentioning
confidence: 99%
“…Subsequently, GANs were introduced to the task [16,23,39,49] which do not suffer from the same computational limitations as pixel-wise autoregressive approaches. More recent works have utilized a Vector Quantised-Variational Autoencoder (VQ-VAE) [47] to great success [2,4]. Similar to image outpainting, our task requires generated images to appear coherent in weather and location; however, we also seek to generate distinct, partially overlapping camera views and require that portions of these views are conditionally generated from a BEV layout.…”
Section: Related Workmentioning
confidence: 99%
“…A key problem that arises when generating multiple images in parallel is the quadratic complexity of the selfattention mechanism. One solution to this issue would be to limit the sequence length of our transformer by using performing image extrapolation as in [2]. However, this limits the scene context and can cause later images to appear far different from the first image, despite having local image consistency.…”
Section: Random Maskingmentioning
confidence: 99%
“…Spherical panoramic images, also known as 360 • panoramic images or omnidirectional panoramic images, are used in various domains such as autonomous driving (de La Garanderie, Abarghouei, and Breckon 2018; Ma et al 2021), virtual reality (Xu, Zhang, and Gao 2021;Ai et al 2022), etc. Numerous studies (Yan et al 2022;Hara, Mukuta, and Harada 2021;Akimoto, Matsuo, and Aoki 2022) have been proposed for the synthesis of spherical panoramic images, with a primary focus on reconstructing scenes from narrow field of view (NFOV) images. However, these generation methods often produce images of inferior quality and lack controllability, which are crucial in real applications.…”
Section: Introductionmentioning
confidence: 99%