2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00441
|View full text |Cite
|
Sign up to set email alerts
|

MVSCRF: Learning Multi-View Stereo With Conditional Random Fields

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
50
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 91 publications
(51 citation statements)
references
References 16 publications
0
50
0
1
Order By: Relevance
“…Earlier work in this area uses CNN's for two-view [121] and multi-view stereo [33]. Lately, the learning-based MVS rely on the construction of 3D cost volume and use the deep neural networks for regularization and depth regression [18,38,113,46,71,114,111]. As most of these approaches utilize 3D CNN for cost volume regularization -which in general is computationally expensive, the majority of the recent work is motivated to meet the computational requirement with it.…”
Section: Related Workmentioning
confidence: 99%
“…Earlier work in this area uses CNN's for two-view [121] and multi-view stereo [33]. Lately, the learning-based MVS rely on the construction of 3D cost volume and use the deep neural networks for regularization and depth regression [18,38,113,46,71,114,111]. As most of these approaches utilize 3D CNN for cost volume regularization -which in general is computationally expensive, the majority of the recent work is motivated to meet the computational requirement with it.…”
Section: Related Workmentioning
confidence: 99%
“…Yao et al [31] proposed to replace 3D CNNs with recurrent neural networks, which leads to improved memory efficiency. Xue et al [32] proposed MVSCRF, where multi-scale conditional random fields (MSCRFs) are adopted to constraint the smoothness of depth prediction explicitly. Instead of using voxel grids, in this paper we propose to use a point-based network for MVS tasks to take advantage of 3D geometry learning without being burdened by the inefficiency found in 3D CNN computation.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, learning-based MVS methods [6,7,12,23,24,30,31,[33][34][35][36][37][38][39] have shown superior performance over traditional counterparts on MVS benchmarks [14,19]. These learning-based methods make use of convolutional neural networks (CNNs) to infer a depth map for each view, and carry out a separate multi-view depth fusion process to reconstruct 3D point clouds.…”
Section: Introductionmentioning
confidence: 99%
“…The lack of global context usually leads to local ambiguities in untextured or texture-less regions, thus reducing the robustness of matching. Although some recent works [31,34] try to obtain large context using deformable convolution or multi-scale information aggregation, the solution of mining the global context in each view has not been explored yet for MVS. Besides, in previous methods, the feature of each view is extracted independently from other views.…”
Section: Introductionmentioning
confidence: 99%