High‐order Markov random field for single depth image super‐resolution

Shabaninia, Elham; Naghsh-Nilchi, Ahmad Reza; Kasaei, Shohreh

doi:10.1049/iet-cvi.2016.0373

Cited by 13 publications

(9 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A lot of works have been proposed to improve the depth using an aligned highresolution color image. One family of approach is depth super-resolution that targets improving the resolution of the depth image [35,43,15,53,32,23,36,47]. These methods assume a low-resolution but dense depth map without missing signal.…”

Section: Related Workmentioning

confidence: 99%

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene From Sparse LiDAR Data and Single Color Image

Qiu

Cui

Zhang

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

358

314

View full text Add to dashboard Cite

In this paper, we propose a deep learning architecture that produces accurate dense depth for the outdoor scene from a single color image and a sparse depth. Inspired by the indoor depth completion, our network estimates surface normals as the intermediate representation to produce dense depth, and can be trained end-to-end. With a modified encoder-decoder structure, our network effectively fuses the dense color image and the sparse LiDAR depth. To address outdoor specific challenges, our network predicts a confidence mask to handle mixed LiDAR signals near foreground boundaries due to occlusion, and combines estimates from the color image and surface normals with learned attention maps to improve the depth accuracy especially for distant areas. Extensive experiments demonstrate that our model improves upon the state-of-the-art performance on KITTI depth completion benchmark. Ablation study shows the positive impact of each model components to the final performance, and comprehensive analysis shows that our model generalizes well to the input with higher sparsity or from indoor scenes. * indicates equal contributions.† indicates corresponding author. Color Image Sparse Data from LiDARDeepLiDAR: Our Dense Prediction (colored with input color image)DeepLiDAR: Our Dense Prediction (colored with surface normal)

show abstract

Section: Related Workmentioning

confidence: 99%

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene From Sparse LiDAR Data and Single Color Image

Qiu

Cui

Zhang

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

358

314

View full text Add to dashboard Cite

show abstract

“…Several methods have been proposed to improve the spatial resolution of depth images using high-resolution color. They have exploited a variety of approaches, including Markov random fields [48,15,46,56,63], shape-from-shading [27,76], segmentation [45], and dictionary methods [21,34,49,69]. Although some of these techniques may be used for depth completion, the challenges of super-resolution are quite different -there the focus is on improving spatial resolution, where lowresolution measurements are assumed to be complete and regularly sampled.…”

Section: Related Workmentioning

confidence: 99%

Deep Depth Completion of a Single RGB-D Image

Zhang

Funkhouser

2018

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition

383

336

View full text Add to dashboard Cite

The goal of our work is to complete the depth channel of an RGB-D image. Commodity-grade depth cameras often fail to sense depth for shiny, bright, transparent, and distant surfaces. To address this problem, we train a deep network that takes an RGB image as input and predicts dense surface normals and occlusion boundaries. Those predictions are then combined with raw depth observations provided by the RGB-D camera to solve for depths for all pixels, including those missing in the original observation. This method was chosen over others (e.g., inpainting depths directly) as the result of extensive experiments with a new depth completion benchmark dataset, where holes are filled in training data through the rendering of surface reconstructions created from multiview RGB-D scans. Experiments with different network inputs, depth representations, loss functions, optimization methods, inpainting methods, and deep depth estimation networks show that our proposed approach provides better depth completions than these alternatives. Depth representation:The obvious approach to address our problem is to use the new dataset as supervision to train a fully convolutional network to regress depth directly from RGB-D. However, that approach does not work very well, especially for large holes like the one shown in the bottom row of Figure 1. Estimating absolute depths from a monocular color image is difficult even for people [53]. Rather, we train the network to predict only local differential properties of depth (surface normals and occlusion boundaries), which are much easier to estimate [35]. We then solve for the absolute depths with a global optimization. Deep network design:There is no previous work on studying how best to design and train an end-to-end deep network for completing depth images from RGB-D inputs. At first glance, it seems straight-forward to extend previous net-1 arXiv:1803.09326v2 [cs.CV]

show abstract

“…If we slide horizontally on the image, we can calculate this potential to detect sudden changes in depth [ 72 ]. This is computed by summation of these sudden depth horizontal windows (

) in a square window (

) and multiplying the figured-out value with corresponding disparity in the pixel, even the obstacles are not available.…”

Section: Methodsmentioning

confidence: 99%

Obstacle Detection and Safely Navigate the Autonomous Vehicle from Unexpected Obstacles on the Driving Lane

Haris

Hou

2020

Sensors

View full text Add to dashboard Cite

Nowadays, autonomous vehicle is an active research area, especially after the emergence of machine vision tasks with deep learning. In such a visual navigation system for autonomous vehicle, the controller captures images and predicts information so that the autonomous vehicle can safely navigate. In this paper, we first introduced small and medium-sized obstacles that were intentionally or unintentionally left on the road, which can pose hazards for both autonomous and human driving situations. Then, we discuss Markov random field (MRF) model by fusing three potentials (gradient potential, curvature prior potential, and depth variance potential) to segment the obstacles and non-obstacles into the hazardous environment. Since the segment of obstacles is done by MRF model, we can predict the information to safely navigate the autonomous vehicle form hazardous environment on the roadway by DNN model. We found that our proposed method can segment the obstacles accuracy from the blended background road and improve the navigation skills of the autonomous vehicle.

show abstract

High‐order Markov random field for single depth image super‐resolution

Cited by 13 publications

References 44 publications

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene From Sparse LiDAR Data and Single Color Image

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene From Sparse LiDAR Data and Single Color Image

Deep Depth Completion of a Single RGB-D Image

Obstacle Detection and Safely Navigate the Autonomous Vehicle from Unexpected Obstacles on the Driving Lane

Contact Info

Product

Resources

About