2018
DOI: 10.1007/978-3-030-01252-6_4
|View full text |Cite
|
Sign up to set email alerts
|

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images

Abstract: We propose an end-to-end deep learning architecture that produces a 3D shape in triangular mesh from a single color image. Limited by the nature of deep neural network, previous methods usually represent a 3D shape in volume or point cloud, and it is non-trivial to convert them to the more ready-to-use mesh model. Unlike the existing methods, our network represents 3D mesh in a graph-based convolutional neural network and produces correct geometry by progressively deforming an ellipsoid, leveraging perceptual … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
1,147
0
2

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 1,121 publications
(1,206 citation statements)
references
References 29 publications
1
1,147
0
2
Order By: Relevance
“…Tulsianni et al [29] introduced ray-tracing into the picture to predict multiple semantics from an image including a 3D voxel model. Howbeit, voxel representation is known to be inefficient and computationally unfriendly [4,30]. For mesh representation, Wang et al [30] gradually deformed an elliptical mesh given an input image by using graph convolution, but mesh representation requires overhead construction, and graph convolution may result in computing redundancy as masking is needed.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Tulsianni et al [29] introduced ray-tracing into the picture to predict multiple semantics from an image including a 3D voxel model. Howbeit, voxel representation is known to be inefficient and computationally unfriendly [4,30]. For mesh representation, Wang et al [30] gradually deformed an elliptical mesh given an input image by using graph convolution, but mesh representation requires overhead construction, and graph convolution may result in computing redundancy as masking is needed.…”
Section: Related Workmentioning
confidence: 99%
“…Howbeit, voxel representation is known to be inefficient and computationally unfriendly [4,30]. For mesh representation, Wang et al [30] gradually deformed an elliptical mesh given an input image by using graph convolution, but mesh representation requires overhead construction, and graph convolution may result in computing redundancy as masking is needed. There has been a number of studies trying to reconstruct objects without 3D supervision [9,19,28].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Another possible solution is to use the shape‐from‐shading (SFS) technique to recover water surfaces from videos, but this does not handle highlights and occlusions well and brings geometric distortions. Machine learning–based methods have shown promising progress in three‐dimensional (3D) reconstruction and depth estimation recently. Moreover, some machine learning–based fluid simulators have also been developed .…”
Section: Introductionmentioning
confidence: 99%