Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints

Xu, Yan; Zhu, Xinge; Shi, Jianping; Zhang, Guofeng; Bao, Hujun; Li, Hongsheng

doi:10.1109/iccv.2019.00290

Cited by 236 publications

(151 citation statements)

References 41 publications

(61 reference statements)

Supporting

Mentioning

149

Contrasting

Unclassified

Order By: Relevance

“…First, UberATG-Fusenet [29] was divided because it exploits the 3D point cloud of LiDAR data. DeepLidar [26] and PwP [27] use the surface normal information to determine the representation power for the depth completion task. Then, we divided the model trained with only the official dataset and the model trained with the official dataset and additional data.…”

Section: ) Comparison To State-of-the-art Methodsmentioning

confidence: 99%

“…Ma et al [20] developed a deep regression model with an early fusion method for direct mapping from the sparse depth to the dense depth and a self-supervised training framework. The surface normal was introduced by [26], [27] to obtain more accurate three-dimensional (3D) geometric information for the depth completion task. The confidence mask for refining the results from the network was the same as in [19].…”

Section: Related Work a Depth Completionmentioning

confidence: 99%

See 1 more Smart Citation

Deep Architecture With Cross Guidance Between Single Image and Sparse LiDAR Data for Depth Completion

Lee

Kim

et al. 2020

IEEE Access

View full text Add to dashboard Cite

It is challenging to apply depth maps generated from sparse laser scan data to computer vision tasks, such as robot vision and autonomous driving, because of the sparsity and noise in the data. To overcome this problem, depth completion tasks have been proposed to produce a dense depth map from sparse LiDAR data and a single RGB image. In this study, we developed a deep convolutional architecture with cross guidance for multi-modal feature fusion to compensate for the lack of representation power of their modality. Two encoders, which are part of the proposed architecture, receive different modalities as inputs. They interact with each other by exchanging information in each stage through the attention mechanism during encoding. We also propose a residual atrous spatial pyramid block, comprising multiple dilated convolutions with different dilation rates, which are used to derive highly significant features. The experimental results of the KITTI depth completion benchmark dataset demonstrate that the proposed architecture shows higher performance than that of the other models trained in a two-dimensional space without pre-training or finetuning other datasets. INDEX TERMS Depth estimation, depth completion, LiDAR data, cross guidance, multi-scale dilated convolutional block. Recently, artificial neural network models with deep learning have been used in state-of-the-art technologies of pattern recognition and machine learning. In particular, convolutional neural networks (CNNs) exhibit excellent performance in many computer vision tasks. While conventional CNNs [3]-[5] comprise blocks of stacking convolution

show abstract

Section: ) Comparison To State-of-the-art Methodsmentioning

confidence: 99%

Section: Related Work a Depth Completionmentioning

confidence: 99%

Deep Architecture With Cross Guidance Between Single Image and Sparse LiDAR Data for Depth Completion

Lee

Kim

et al. 2020

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Gansbeke et al [33] proposed the use of confidences to fuse two network streams utilizing sparse depth and RGB images respectively. Similarly, Xu et al [36] predict a confidence mask that is used to mitigate the impact of noisy measurements on different components of their network. However, none of these methods provided any prediction uncertainty measure for the final prediction.…”

Section: Related Workmentioning

confidence: 99%

“…Modeling uncertainty for this task is crucial due to the inherent noisy and sparse nature of depth sensors, caused by multi-path interference and depth ambiguities [11]. Previous approaches proposed to learn some intermediate confidence masks to mitigate the impact of disturbed measurements inside their networks [28,33,36]. However, none of these approaches has demonstrated the probabilistic validity of the intermediate confidence masks.…”

Section: Introductionmentioning

confidence: 99%

Uncertainty-Aware CNNs for Depth Completion: Uncertainty from Beginning to End

Eldesokey

Felsberg

Holmquist

et al. 2020

2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

115

View full text Add to dashboard Cite

The focus in deep learning research has been mostly to push the limits of prediction accuracy. However, this was often achieved at the cost of increased complexity, raising concerns about the interpretability and the reliability of deep networks. Recently, an increasing attention has been given to untangling the complexity of deep networks and quantifying their uncertainty for different computer vision tasks. Differently, the task of depth completion has not received enough attention despite the inherent noisy nature of depth sensors. In this work, we thus focus on modeling the uncertainty of depth data in depth completion starting from the sparse noisy input all the way to the final prediction. We propose a novel approach to identify disturbed measurements in the input by learning an input confidence estimator in a self-supervised manner based on the normalized convolutional neural networks (NCNNs). Further, we propose a probabilistic version of NCNNs that produces a statistically meaningful uncertainty measure for the final prediction. When we evaluate our approach on the KITTI dataset for depth completion, we outperform all the existing Bayesian Deep Learning approaches in terms of prediction accuracy, quality of the uncertainty measure, and the computational efficiency. Moreover, our small network with 670k parameters performs on-par with conventional approaches with millions of parameters. These results give strong evidence that separating the network into parallel uncertainty and prediction streams leads to state-of-the-art performance with accurate uncertainty estimates.

show abstract

“…Comparing with binary mask, this strategy allows network to adaptive adjust confidence signal and maximize output probability at the same time. Xu et al [18] utilize the constraint between surface normal and depth map to guide the depth estimation process. This constraint is formed by the distances between piece-wise planes and the origin.…”

Section: B Depth Completion From Rgbd Datamentioning

confidence: 99%

Revisiting Sparsity Invariant Convolution: A Network for Image Guided Depth Completion

Lin

Belyaev

2020

IEEE Access

View full text Add to dashboard Cite

The limitation of LiDAR (Light Detection And Ranging) sensor causes the general sparsity of produced depth measurement. However, the sparse representation of the world is insufficient for applications such as 3D reconstruction. Thus, depth completion is an important computer vision task in which a synchronized RGB image is commonly available. In this paper, we propose a deep neural network to tackle this image guided depth completion problem. By revisiting the sparsity invariant convolution and revealing how it can be used in a novel approach, we propose three mask aware operations to process, downscale, and fuse sparse features. These operations explicitly consider the observation mask of its corresponding feature map. In addition, the structure of this network follows a novel scheme in which data from image and depth domain are processed by these proposed operations independently. Our proposed model achieves state-of-the-art performance on the KITTI depth completion benchmark. Furthermore, it presents a strong robustness for bearing input sparsity under different densities and patterns.

show abstract

Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints

Cited by 236 publications

References 41 publications

Deep Architecture With Cross Guidance Between Single Image and Sparse LiDAR Data for Depth Completion

Deep Architecture With Cross Guidance Between Single Image and Sparse LiDAR Data for Depth Completion

Uncertainty-Aware CNNs for Depth Completion: Uncertainty from Beginning to End

Revisiting Sparsity Invariant Convolution: A Network for Image Guided Depth Completion

Contact Info

Product

Resources

About