2017
DOI: 10.1109/tpami.2016.2574713
|View full text |Cite
|
Sign up to set email alerts
|

Learning Category-Specific Deformable 3D Models for Object Reconstruction

Abstract: We address the problem of fully automatic object localization and reconstruction from a single image. This is both a very challenging and very important problem which has, until recently, received limited attention due to difficulties in segmenting objects and predicting their poses. Here we leverage recent advances in learning convolutional networks for object detection and segmentation and introduce a complementary network for the task of camera viewpoint prediction. These predictors are very powerful, but s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
53
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 63 publications
(53 citation statements)
references
References 63 publications
0
53
0
Order By: Relevance
“…PASCAL3D+ [39] Similar to [16,35], we evaluate our method on the the PASCAL3D+ dataset which consists of PASCAL VOC and ImageNet images for 12 rigid object categories with a set of sparse keypoints annotated on each…”
Section: Datasetsmentioning
confidence: 99%
“…PASCAL3D+ [39] Similar to [16,35], we evaluate our method on the the PASCAL3D+ dataset which consists of PASCAL VOC and ImageNet images for 12 rigid object categories with a set of sparse keypoints annotated on each…”
Section: Datasetsmentioning
confidence: 99%
“…Alternatively, 3D shape can be also generated by deforming an initialization, which is more related to our work. Tulsiani et al [43] and Kanazawa et al [19] learn a category-specific 3D deformable model and reasons the shape deformations in different images. Wang et al [45] learn to deform an initial ellipsoid to the desired shape in a coarse to fine fashion.…”
Section: Related Workmentioning
confidence: 99%
“…Later work attempts to predict full 3D models as opposed to just depth maps using voxel mappings and joint classification, multiple 2D views of an object, and using an ensemble of complimentary schemes …”
Section: D Computer Vision With Deep Learningmentioning
confidence: 99%
“…Later work attempts to predict full 3D models as opposed to just depth maps using voxel mappings and joint classification, 39 multiple 2D views of an object, 48,49 and using an ensemble of complimentary schemes. 50 As discussed in the last section, Wu et al 39 develop a voxel prediction network for the purpose of 3D model classification given a single RGB-D image of the model. The proposed network is designed for classification and uses randomization (through Gibbs sampling) to predict the occluded points as free space or filled.…”
Section: Generationmentioning
confidence: 99%