2021 IEEE International Conference on Multimedia and Expo (ICME) 2021
DOI: 10.1109/icme51207.2021.9428171
|View full text |Cite
|
Sign up to set email alerts
|

Vanet: a View Attention Guided Network for 3d Reconstruction from Single and Multi-View Images

Abstract: Reconstructing 3D meshes of objects from 2D images is an important but challenging task. Previous 3D reconstruction methods either only focus on generating the mesh from a single image, or multi-view images. Instead of investigating these problems separately, we present a novel view attention guided network called VANet which addresses both single and multi-view 3D reconstruction under a unified framework. To explore non-visible parts of an object during the reconstruction, a channel-wise view attention mechan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
12
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(13 citation statements)
references
References 13 publications
0
12
0
1
Order By: Relevance
“…The loss function consists of the absolute squared difference between the initial depth map and the GT (Ground Truth) depth map and the absolute squared difference between the refined depth map and the GT depth map, and the expression of the loss function is Eq. (8).…”
Section: Loss Functionmentioning
confidence: 99%
“…The loss function consists of the absolute squared difference between the initial depth map and the GT (Ground Truth) depth map and the absolute squared difference between the refined depth map and the GT depth map, and the expression of the loss function is Eq. (8).…”
Section: Loss Functionmentioning
confidence: 99%
“…Wen Chao et al established a multi-view deformation network based on Pixel2mesh network and realized the 3D reconstruction of neural network for graph convolutional [19]. Yuan et al added the auxiliary view features with weights to the main view features, deformed the preliminary reconstructed 3D model using the graph convolutional neural network, and gradually refined the grid model [20] combined with multiple image features. However, the multi-view 3D reconstruction method based entirely on deep learning performs poorly in reconstructing the model details.…”
Section: Introductionmentioning
confidence: 99%
“…Existing research on single-view reconstruction employs various 3D shape representations, including implicit functions [4][5][6][7], voxels [8][9][10][11], point clouds [12][13][14], meshes [15][16][17][18][19][20][21][22][23], and others. Voxel-based methods are challenging to reconstruct high-precision shapes due to the large amount of memory and computational time required to apply 3D CNN.…”
Section: Introductionmentioning
confidence: 99%