2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00567
|View full text |Cite
|
Sign up to set email alerts
|

Recurrent MVSNet for High-Resolution Multi-View Stereo Depth Inference

Abstract: Deep learning has recently demonstrated its excellent performance for multi-view stereo (MVS). However, one major limitation of current learned MVS approaches is the scalability: the memory-consuming cost volume regularization makes the learned MVS hard to be applied to highresolution scenes. In this paper, we introduce a scalable multi-view stereo framework based on the recurrent neural network. Instead of regularizing the entire 3D cost volume in one go, the proposed Recurrent Multi-view Stereo Network (R-MV… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
529
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 552 publications
(529 citation statements)
references
References 29 publications
0
529
0
Order By: Relevance
“…Im et al [24] applied a plane sweeping approach to build a cost volume from deep features, then regularized the cost volume via a context-aware aggregation to improve depth regression. Very recently, Yao et al [31] introduced a scalable MVS framework based on the recurrent neural network to reduce the memory-consuming. Unsupervised Geometric Learning: Unsupervised learning has been developed in monocular depth estimation and binocular stereo matching by exploiting the photometric consistency and regularization.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Im et al [24] applied a plane sweeping approach to build a cost volume from deep features, then regularized the cost volume via a context-aware aggregation to improve depth regression. Very recently, Yao et al [31] introduced a scalable MVS framework based on the recurrent neural network to reduce the memory-consuming. Unsupervised Geometric Learning: Unsupervised learning has been developed in monocular depth estimation and binocular stereo matching by exploiting the photometric consistency and regularization.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, the success of deep convolutional neural networks (CNNs) in monocular depth estimation [18,9,19] and binocular depth estimation [33,34] has been extended to MVS. Existing deep CNNs based MVS approaches [30,31,11,24] tend to represent MVS as an end-to-end regression problem. By exploiting large-scale ground truth 3D training data, these methods outperform traditional geometrybased approaches and dominate the leading boards on different benchmarking datasets [30,31].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Given multiple images with known camera poses and intrinsic calibration, DeepMVS [10] generates cost volumes using learned feature maps and then estimates the disparity map by fusing multiple cost volumes. MVDepthNet [11], DPSNet [12] and MVSNet [13], [14] solve the same reconstruction problem but differ in the calculation of cost volumes and the structure of networks. On the other hand, given an RGB-D keyframe, DeepTAM [15] incrementally tracks the pose of a camera using synthetic viewpoints and can further estimate the depth map of the tracked frame.…”
Section: Related Workmentioning
confidence: 99%