2021 3rd International Conference on Video, Signal and Image Processing 2021
DOI: 10.1145/3503961.3503978
|View full text |Cite
|
Sign up to set email alerts
|

Attention-based Multi-View Stereo Network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(8 citation statements)
references
References 13 publications
0
8
0
Order By: Relevance
“…MVSNet [42] applies a 3D CNN on a plane-swept cost volume at the reference view for depth estimation, achieving high-quality 3D reconstruction that outperforms classical traditional methods [13,32]. Following works have extended this technique with recurrent plane sweeping [43], point-based densification [7], confidence-based aggregation [26], and multiple cost volumes [8,15], improving the reconstruction quality. We propose to combine the cost-volume based deep MVS technique with differentiable volume rendering, enabling efficient reconstruction of radiance fields for neural rendering.…”
Section: Related Workmentioning
confidence: 99%
“…MVSNet [42] applies a 3D CNN on a plane-swept cost volume at the reference view for depth estimation, achieving high-quality 3D reconstruction that outperforms classical traditional methods [13,32]. Following works have extended this technique with recurrent plane sweeping [43], point-based densification [7], confidence-based aggregation [26], and multiple cost volumes [8,15], improving the reconstruction quality. We propose to combine the cost-volume based deep MVS technique with differentiable volume rendering, enabling efficient reconstruction of radiance fields for neural rendering.…”
Section: Related Workmentioning
confidence: 99%
“…Earlier work in this area uses CNN's for two-view [121] and multi-view stereo [33]. Lately, the learning-based MVS rely on the construction of 3D cost volume and use the deep neural networks for regularization and depth regression [18,38,113,46,71,114,111]. As most of these approaches utilize 3D CNN for cost volume regularization -which in general is computationally expensive, the majority of the recent work is motivated to meet the computational requirement with it.…”
Section: Related Workmentioning
confidence: 99%
“…Few methods attempt to address it by down-sampling the input [114,111]. Other attempts to improve the computational requirements uses sequential processing of cost volume [115], cascade of 3D cost volumes [19,32,112,105], small cost volume with point-based refinement [18], sparse cost volume with RGB and 2D CNN to densify the result [120], learning-based patch-wise matching [71,105] with RGB guided depth map super-resolution [105].…”
Section: Related Workmentioning
confidence: 99%
“…The learned representation shows more robustness to low-texture regions and various lightings [22,45,47,7,32].…”
Section: Classical Stereo Matchingmentioning
confidence: 99%
“…Multi-view stereo (MVS) is one of the most fundamental problems in computer vision and has been studied over decades. Recently, learning-based MVS methods have witnessed significant improvement against their traditional counterparts [45,23,47,7]. In general, these methods formulate the task as an optimization problem, where the target is to minimize the overall summation of pixelwise depth discrepancy.…”
Section: Introductionmentioning
confidence: 99%