2018
DOI: 10.1007/978-3-030-01237-3_48
|View full text |Cite
|
Sign up to set email alerts
|

ActiveStereoNet: End-to-End Self-supervised Learning for Active Stereo Systems

Abstract: In this paper we present ActiveStereoNet, the first deep learning solution for active stereo systems. Due to the lack of ground truth, our method is fully self-supervised, yet it produces precise depth with a subpixel precision of 1/30th of a pixel; it does not suffer from the common over-smoothing issues; it preserves the edges; and it explicitly handles occlusions. We introduce a novel reconstruction loss that is more robust to noise and texture-less patches, and is invariant to illumination changes. The pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
99
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 100 publications
(99 citation statements)
references
References 55 publications
0
99
0
Order By: Relevance
“…Leveraging this data, StereoNet [25] uses a hierarchical disparity prediction network with a deep visual feature backbone which is capable of running at 60 FPS on a consumer GPU. Its successor [56] extends the work with self-supervision to the domain of active sensing while maintaining the core efficiency. We build upon their work to leverage this computational advantage.…”
Section: Monocular Visionmentioning
confidence: 98%
See 2 more Smart Citations
“…Leveraging this data, StereoNet [25] uses a hierarchical disparity prediction network with a deep visual feature backbone which is capable of running at 60 FPS on a consumer GPU. Its successor [56] extends the work with self-supervision to the domain of active sensing while maintaining the core efficiency. We build upon their work to leverage this computational advantage.…”
Section: Monocular Visionmentioning
confidence: 98%
“…To produce high quality depth maps, our architecture takes inspiration from two state-of-the-art pipelines for realtime disparity prediction, namely StereoNet [25] and Ac-tiveStereoNet [56] which estimate a subpixel precise lowresolution disparity map that is consecutively upsampled and refined with RGB-guidance from the reference image. Our depth estimation network consists of two Siamese towers with shared weights that extract deep image features at 1/8 of the stereo pair resolution following the architecture described in [56]. We construct a cost volume (CV) by concatenation of the displaced features along the epipolar lines of the rectified input images.…”
Section: Disparity Estimationmentioning
confidence: 99%
See 1 more Smart Citation
“…The proposed refinement network described in Table. 7 is inspired by the refinement procedures proposed in CRL [17], iResNet [12], StereoNet [8], and ActiveStere-oNet [31]. We adopted the basic architecture for refinement as described in StereoNet [8] with dilated residual blocks [28] to increase the receptive field of filtering without compromising resolution.…”
Section: D Dilation In Cost Filteringmentioning
confidence: 99%
“…Inspired by recent progress in self-supervised depth [5], [41], flow [12], [20] and representation learning [3], [24], we propose a novel approach for self-supervised image deblurring which only relies on real-world blurry image sequences for training. Self-supervised learning improves the network's generalization performance, by enabling the network to adapt to scenarios where ground truth sharp images are not available.…”
Section: Introductionmentioning
confidence: 99%