2018 International Conference on 3D Vision (3DV) 2018
DOI: 10.1109/3dv.2018.00017
|View full text |Cite
|
Sign up to set email alerts
|

Sparse and Dense Data with CNNs: Depth Completion and Semantic Segmentation

Abstract: Convolutional neural networks are designed for dense data, but vision data is often sparse (stereo depth, point clouds, pen stroke, etc.). We present a method to handle sparse depth data with optional dense RGB, and accomplish depth completion and semantic segmentation changing only the last layer. Our proposal efficiently learns sparse features without the need of an additional validity mask. We show how to ensure network robustness to varying input sparsities. Our method even works with densities as low as 0… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
194
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 271 publications
(196 citation statements)
references
References 25 publications
2
194
0
Order By: Relevance
“…Ma et al concatenated the sparse depth and color image as the inputs of an off-the-shelf network [26] and further explored the feasibility of self-supervised Li-DAR completion [23]. Moreover, [14,16,33,4] proposed different network architectures to better exploit the potential of the encoder-decoder framework. However, the encoderdecoder architecture tends to predict the depth maps comprehensively but fails to concentrate on the local areas.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Ma et al concatenated the sparse depth and color image as the inputs of an off-the-shelf network [26] and further explored the feasibility of self-supervised Li-DAR completion [23]. Moreover, [14,16,33,4] proposed different network architectures to better exploit the potential of the encoder-decoder framework. However, the encoderdecoder architecture tends to predict the depth maps comprehensively but fails to concentrate on the local areas.…”
Section: Related Workmentioning
confidence: 99%
“…With the advances of deep learning methods, many depth completion approaches based on convolutional neural networks (CNNs) have been proposed. The mainstream of these methods is to directly input the sparse depth maps (with/without color images) into an encoder-decoder network and predict dense depth maps [26,16,36,15,10,23,2]. These black-box methods force the CNN to learn a mapping from sparse depth measurements to dense maps, which is generally a challenging task and leads to unsatisfactory completion results, as shown in Fig.…”
Section: Introductionmentioning
confidence: 99%
“…Also, Ma [22] proposed a self-supervised framework without the need for dense labels, achieving great performance on KITTI [8] dataset. Some works combined semantic segmentation [16] to improve the prediction.…”
Section: Depth Reconstruction From Sparse Samplesmentioning
confidence: 99%
“…Triggered by the KITTI depth completion benchmark the following depth augmentation methods have been proposed: [14] explicitly tackle the problem of sparsity in depth maps originating from different depth sensing modalities and employ a convolutional encoder-decoder architecture to predict depth or semantic labels. [15] propose another unsupervised learning method to overcome the lack of suitable training data by employing an unsupervised adversarial learning framework for depth augmentation.…”
Section: A Inference Of 3d Structurementioning
confidence: 99%