2019
DOI: 10.1609/aaai.v33i01.33018376
|View full text |Cite
|
Sign up to set email alerts
|

View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions

Abstract: In this paper we present a novel unsupervised representation learning approach for 3D shapes, which is an important research challenge as it avoids the manual effort required for collecting supervised data. Our method trains an RNN-based neural network architecture to solve multiple view inter-prediction tasks for each shape. Given several nearby views of a shape, we define view inter-prediction as the task of predicting the center view between the input views, and reconstructing the input views in a low-level… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
57
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 121 publications
(58 citation statements)
references
References 6 publications
0
57
0
1
Order By: Relevance
“…MN40(%) MN10(%) 3DGAN [Wu and others, 2016] 83.3 91.0 PointNet++ [Qi and others, 2017] 91.9 -FoldingNet [Yang et al, 2018] 88.4 94.4 PANO [Sfikas and others, 2017] 90.7 91.1 Pairwise [Johns et al, 2016] 90.7 92.8 GIFT [Bai and others, 2017] 89.5 91.5 Domi [Wang and others, 2017] 92.2 -MVCNN [Su and others, 2015] 90.1 -Spherical [Cao et al, 2017] 93.31 -Rotation [Kanezaki et al, 2018] 92.37 94.39 SO-Net [Li and others, 2018] 90.9 94.1 SVSL [Han and others, 2019] 93.31 94.82 VIPGAN [Han et al, 2019a] 91 Attention visualization. We visualize the attention learned by 3DViewGraph under ModelNet40, which demonstrates how 3DViewGraph understands 3D shapes by analyzing views on a view graph.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…MN40(%) MN10(%) 3DGAN [Wu and others, 2016] 83.3 91.0 PointNet++ [Qi and others, 2017] 91.9 -FoldingNet [Yang et al, 2018] 88.4 94.4 PANO [Sfikas and others, 2017] 90.7 91.1 Pairwise [Johns et al, 2016] 90.7 92.8 GIFT [Bai and others, 2017] 89.5 91.5 Domi [Wang and others, 2017] 92.2 -MVCNN [Su and others, 2015] 90.1 -Spherical [Cao et al, 2017] 93.31 -Rotation [Kanezaki et al, 2018] 92.37 94.39 SO-Net [Li and others, 2018] 90.9 94.1 SVSL [Han and others, 2019] 93.31 94.82 VIPGAN [Han et al, 2019a] 91 Attention visualization. We visualize the attention learned by 3DViewGraph under ModelNet40, which demonstrates how 3DViewGraph understands 3D shapes by analyzing views on a view graph.…”
Section: Methodsmentioning
confidence: 99%
“…Global features of 3D shapes can be learned from raw 3D representations, such as meshes, voxels, and point clouds. As an alternative, a number of works in 3D shape analysis employed multiple views [Su and others, 2015;Han et al, 2019b] as raw 3D representation, exploiting the advantage that multiple * Corresponding author: Yu-Shen Liu views can facilitate understanding of both manifold and nonmanifold 3D shapes via computer vision techniques. Therefore, effectively and efficiently aggregating comprehensive information over multiple views, is critical for the discriminability of learned features, especially in deep learning models.…”
Section: Introductionmentioning
confidence: 99%
“…There are also methods jointly learning features from point clouds and multi-view projections [47]. It is also possible to treat point clouds and views as sequences [26,17,15], or to use unsupervised learning [16].…”
Section: Related Workmentioning
confidence: 99%
“…Recently, a series of unsupervised-learning models are proposed to learn from 3D point clouds [14], [15], [16], [17], [18]. For example, 3D GAN converts 3D points to 3D voxels [9], which introduces a lot of empty voxels and loses precision; LatentGAN handles 3D point clouds directly [10]; however, the decoder uses fully-connected layers, which does not explore specific geometric structures of 3D point clouds and requires a huge number of training parameters; and VIP-GAN uses recurrentneural-network-based architecture to solve multiple view interprediction tasks for each shape [19]; [20] learns a continuous signed distance function representation of a class of shapes that enables high quality shape representation, interpolation and completion from partial and noisy 3D input data. In this work, we use deep autoencoder to directly handle unorganized 3D points and propose graph-based operations to explore geometric structures of 3D point clouds.…”
Section: A Unsupervised Learningmentioning
confidence: 99%