VConv-DAE: Deep Volumetric Shape Learning Without Object Labels

Sharma, Abhishek; Grau, Oliver; Fritz, Mario

doi:10.1007/978-3-319-49409-8_20

Cited by 223 publications

(157 citation statements)

References 26 publications

Supporting

Mentioning

155

Contrasting

Unclassified

Order By: Relevance

“…The volumetric representation is processed by 3D ShapeNets to identify the observed shape, the free space and the occluded space. The method presented by [31] proposes a network for deep volumetric shape learning. Given a collection of shapes of various objects and their different poses, the network learns the distributions of shapes of various classes by predicting the missing sections.…”

Section: Related Workmentioning

confidence: 99%

“…Although these methods have shown promising results on 3D shapes’ extraction, in most cases, they are limited to specific objects. Furthermore, these methods do not present polyhedral structures’ extraction on buildings or outdoor scenes [31,32,33]. On the other hand, the approaches that have promising results in outdoor scenes, in most cases, are those limited to plane sections’ extraction without providing polyhedral structures’ extraction [15,29].…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

High Level 3D Structure Extraction from a Single Image Using a CNN-Based Approach

Osuna-Coutiño

Martínez-Carranza

2019

Sensors

View full text Add to dashboard Cite

High-Level Structure (HLS) extraction in a set of images consists of recognizing 3D elements with useful information to the user or application. There are several approaches to HLS extraction. However, most of these approaches are based on processing two or more images captured from different camera views or on processing 3D data in the form of point clouds extracted from the camera images. In contrast and motivated by the extensive work developed for the problem of depth estimation in a single image, where parallax constraints are not required, in this work, we propose a novel methodology towards HLS extraction from a single image with promising results. For that, our method has four steps. First, we use a CNN to predict the depth for a single image. Second, we propose a region-wise analysis to refine depth estimates. Third, we introduce a graph analysis to segment the depth in semantic orientations aiming at identifying potential HLS. Finally, the depth sections are provided to a new CNN architecture that predicts HLS in the shape of cubes and rectangular parallelepipeds.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

High Level 3D Structure Extraction from a Single Image Using a CNN-Based Approach

Osuna-Coutiño

Martínez-Carranza

2019

Sensors

View full text Add to dashboard Cite

show abstract

“…The decoder uses two deconvolution layers: the filter size of the first layers and feature map were set to

()(, i, j, k) = 6, f_{out} = 64

, with a stride of two, and the second layer has

()(, i, j, k) = 7, f_{out} = 1

with a stride of three. This modified network was named AE‐CNN, according to the autoencoder introduced in [15].…”

Section: Methodsmentioning

confidence: 99%

“…In addition, autoencoder‐based deep learning approaches have been used to enhance the performance of 3D shape reconstruction. Sharma et al [15] developed an end‐to‐end reconstruction technique, known as a fully volumetric convolutional denoising autoencoder (VConv‐DAE). They used convolution layers to obtain a latent representation of the input object and a learnable upsampling convolution filter (i.e.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Trilateral convolutional neural network for 3D shape reconstruction of objects from a single depth view

et al. 2019

View full text Add to dashboard Cite

In this study, the authors propose a novel three‐dimensional (3D) convolutional neural network for shape reconstruction via a trilateral convolutional neural network (Tri‐CNN) from a single depth view. The proposed approach produces a 3D voxel representation of an object, derived from a partial object surface in a single depth image. The proposed Tri‐CNN combines three dilated convolutions in 3D to expand the convolutional receptive field more efficiently to learn shape reconstructions. To evaluate the proposed Tri‐CNN in terms of reconstruction performance, the publicly available ShapeNet and Big Data for Grasp Planning data sets are utilised. The reconstruction performance was evaluated against four conventional deep learning approaches: namely, fully connected convolutional neural network, baseline CNN, autoencoder CNN, and a generative adversarial reconstruction network. The proposed experimental results show that Tri‐CNN produces superior reconstruction results in terms of intersection over union values and Brier scores with significantly less number of model parameters and memory.

show abstract

Learning spectral normalized adversarial systems with stacked structure for high‐quality 3D object generation

Zhang

Qiu

Wang

et al. 2019

Concurrency and Computation

View full text Add to dashboard Cite

Summary This paper proposes a new method for generating 3D objects based on generative adversarial networks (GANs). Recently, GANs have been used in 3D object generation, but it is still very challenging to generate high‐quality 3D objects because of the complex data distribution over 3D objects. In this paper, we propose a system based on GAN that makes the generated objects more realistic. We use multiple generators and discriminators to enhance the ability of the model for learning complex distributions. Such a stacked structure can be considered as a coarse‐to‐fine or low‐to‐high–resolution mechanism. We employ the spectral normalization technology to control the Lipschitz constant of the discriminators by literally constraining the spectral norm of each layer to get a more stable training process. In this way, the proposed model can generate realistic and high‐quality 3D objects. Moreover, our system can also recover incomplete 3D objects into complete 3D objects. Experiments demonstrate that our model performs better in the quality of the generated objects than the baselines.

show abstract

VConv-DAE: Deep Volumetric Shape Learning Without Object Labels

Cited by 223 publications

References 26 publications

High Level 3D Structure Extraction from a Single Image Using a CNN-Based Approach

High Level 3D Structure Extraction from a Single Image Using a CNN-Based Approach

Trilateral convolutional neural network for 3D shape reconstruction of objects from a single depth view

Learning spectral normalized adversarial systems with stacked structure for high‐quality 3D object generation

Contact Info

Product

Resources

About