Semantic Evidential Grid Mapping Using Monocular and Stereo Cameras

Richter, Sven; Wang, Yiqun; Beck, Johannes; Wirges, Sascha; Stiller, Christoph

doi:10.3390/s21103380

Cited by 14 publications

(9 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Currently, most of the work related to improving edge depth requires the introduction of an additional network, e.g., semantic segmentation [10][11][12], edge map detection networks [13][14][15], or optical flow [16]. We found that research on uncertainty, which has only recently entered the limelight, can also improve the quality of edge depth and without learning other complex networks.…”

Section: Introductionmentioning

confidence: 92%

The Constraints between Edge Depth and Uncertainty for Monocular Depth Estimation

Liang

et al. 2021

Electronics

View full text Add to dashboard Cite

The self-supervised monocular depth estimation paradigm has become an important branch of computer vision depth-estimation tasks. However, the depth estimation problem arising from object edge depth pulling or occlusion is still unsolved. The grayscale discontinuity of object edges leads to a relatively high depth uncertainty of pixels in these regions. We improve the geometric edge prediction results by taking uncertainty into account in the depth-estimation task. To this end, we explore how uncertainty affects this task and propose a new self-supervised monocular depth estimation technique based on multi-scale uncertainty. In addition, we introduce a teacher–student architecture in models and investigate the impact of different teacher networks on the depth and uncertainty results. We evaluate the performance of our paradigm in detail on the standard KITTI dataset. The experimental results show that the accuracy of our method increased from 87.7% to 88.2%, the AbsRel error rate decreased from 0.115 to 0.11, the SqRel error rate decreased from 0.903 to 0.822, and the RMSE error rate decreased from 4.863 to 4.686 compared with the benchmark Monodepth2. Our approach has a positive impact on the problem of texture replication or inaccurate object boundaries, producing sharper and smoother depth images.

show abstract

Section: Introductionmentioning

confidence: 92%

The Constraints between Edge Depth and Uncertainty for Monocular Depth Estimation

Liang

et al. 2021

Electronics

View full text Add to dashboard Cite

show abstract

“…None of the above-mentioned publications models occupancy and semantic estimates in a joint evidential context. In [19]- [21] a sensor grid mapping pipeline was presented estimating a BBA on a FOD containing ground and object hypotheses for range sensors and cameras. Here, we present an advancement of the evidential model and rethink the BBA estimation.…”

Section: B Grid Mapping With Camerasmentioning

confidence: 99%

Mapping LiDAR and Camera Measurements in a Dual Top-View Grid Representation Tailored for Automated Vehicles

Richter¹,

Bieder²,

Wirges³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

We present a generic evidential grid mapping pipeline designed for imaging sensors such as LiDAR and cameras. Our grid-based evidential model contains semantic estimates for cell occupancy and ground separately. We specify the estimation steps for input data represented by point sets, but mainly focus on input data represented by images such as disparity maps or LiDAR range images. Instead of relying on an external ground segmentation only, we deduce occupancy evidence by analyzing the surface orientation around measurements. We conduct experiments and evaluate the presented method using LiDAR and stereo camera data recorded in real traffic scenarios. Our method estimates cell occupancy robustly and with a high level of detail while maximizing efficiency and minimizing the dependency to external processing modules.

show abstract

“…Since semantic landmarks are associated with Wi-Fi landmarks, Wi-Fi fingerprinting is used to determine a one-to-one association between semantic landmarks. Additionally, the distance between semantic landmarks is calculated using Equations ( 6)- (8). Utilizing pre-matching significantly improves the efficiency of matching landmarks.…”

Section: 𝑆𝑐𝑜𝑟𝑒mentioning

confidence: 99%

“…Scholars have proposed many sensor-based mapping solutions, such as lidar-based [7], camera-based [8], Wi-Fi-based, inertial measurement unit (IMU)-based, and magneticbased [9], to solve the mapping problem in unknown indoor environments. Due to these sensors' inherent characteristics, single-sensor-based map construction methods have limited application scopes.…”

Section: Introductionmentioning

confidence: 99%

Crowdsourcing-Based Indoor Semantic Map Construction and Localization Using Graph Optimization

Chai

Yang

et al. 2022

Sensors

View full text Add to dashboard Cite

The advancement of smartphones with multiple built-in sensors facilitates the development of crowdsourcing-based indoor map construction and localization. This paper proposes a crowdsourcing-based indoor semantic map construction and localization method using graph optimization. Using waypoints, semantic landmarks, and Wi-Fi landmarks as nodes and the relevance between waypoints and landmarks (i.e., waypoint–waypoint, waypoint–semantic, waypoint–Wi-Fi, semantic–semantic, and Wi-Fi–Wi-Fi) as edges, the optimization graph is constructed. Initializing the venue map is the single-track semantic map with the highest quality, as determined by a proposed map quality evaluation function. The aligned venue and candidate maps are optimized while satisfying the constraints, with the candidate map exhibiting the highest degree of similarity to the venue map. The lightweight venue map is then updated in terms of waypoint and landmark attributes, as well as the relationship between waypoints and landmarks. To determine a pedestrian’s location on a venue map, similarities between a local map and a venue map are evaluated. Experiments conducted in an office building and shopping mall scenes demonstrate that crowdsourcing-based venue maps are superior to single-track semantic maps. Additionally, the landmark matching-based localization method can achieve a mean localization error of less than 0.5 m on the venue map, compared to 0.6 m in a single-track semantic map.

show abstract

Semantic Evidential Grid Mapping Using Monocular and Stereo Cameras

Cited by 14 publications

References 30 publications

The Constraints between Edge Depth and Uncertainty for Monocular Depth Estimation

The Constraints between Edge Depth and Uncertainty for Monocular Depth Estimation

Mapping LiDAR and Camera Measurements in a Dual Top-View Grid Representation Tailored for Automated Vehicles

Crowdsourcing-Based Indoor Semantic Map Construction and Localization Using Graph Optimization

Contact Info

Product

Resources

About