LiDAR-Flow: Dense Scene Flow Estimation from Sparse LiDAR and Stereo Images

Sun

IEEE Open J. Intell. Transp. Syst.

2024

Dynamic object detection, state estimation, and map-building are crucial for autonomous robot systems and intelligent transportation applications in urban scenarios. Most current LiDAR Simultaneous Localization and Mapping (SLAM) systems operate on the assumption that the observed environment is static. However, the overall accuracy and robustness of a SLAM system can be compromised by dynamic objects in the environment. Aiming at the problem of inaccurate odometry estimation and wrong mapping caused by the existing LiDAR SLAM method which cannot detect the dynamic objects, we study the SLAM problem of robots and unmanned vehicles equipped with LiDAR traveling in the dynamic urban scenes. We propose a fast LiDAR-only model-free dynamic objects detection method, which uses the spatial and temporal information of point cloud through a convolutional neural network (CNN), and the detection accuracy is improved by 35% to 86% compared with methods that only use spatial information. We further integrate it into a state-of-the-art LiDAR SLAM framework to improve the SLAM performance. Firstly, the range image constructed by LiDAR point cloud is used for ground extraction and non-ground point clustering. Then, the motion of objects in the scene is estimated by the difference between adjacent frames, and the segmented objects are further divided into dynamic objects and static objects by their motion features. After that, the stable feature points are extracted from the static objects. Finally, the pose transformation of adjacent frames is solved by matching feature point pairs. We evaluated the accuracy and robustness of our system on datasets with different challenging dynamic environments, and the results show our system has significant improvements in accuracy and robustness of odometry and mapping, while still maintain real-time performance, which is sufficient for autonomous robot systems and intelligent transportation applications in urban scenarios.

Section: B Lidar Dynamic Object Detectionmentioning

confidence: 99%

DLOAM: Real-time and Robust LiDAR SLAM System Based on CNN in Dynamic Urban Environments

Sun

IEEE Open J. Intell. Transp. Syst.

2024

“…Camera-LiDAR Fusion Cameras and LiDARs have complementary characteristics, facilitating many computer vision tasks, such as depth estimation [13,30,55], scene flow estimation [2,41], 3D object detection [10,27,36,45,51], etc. Some researchers [2,36,45,55] build a modular network and perform result-level fusion, while the others [13,27,30,41,51] explore feature-level fusion schemes including early-fusion and late-fusion. Instead, we propose a multi-stage and bidirectional fusion pipeline, which not only fully utilizes the characteristic of each modality, but maximizes the inter-modality complementarity as well.…”

Section: Related Workmentioning

confidence: 99%

CamLiFlow: Bidirectional Camera-LiDAR Fusion for Joint Optical Flow and Scene Flow Estimation

Liu¹,

Lü²,

Xu³

et al. 2021

Preprint

In this paper, we study the problem of jointly estimating the optical flow and scene flow from synchronized 2D and 3D data. Previous methods either employ a complex pipeline which splits the joint task into independent stages, or fuse 2D and 3D information in an "early-fusion" or "late-fusion" manner. Such one-size-fits-all approaches suffer from a dilemma of failing to fully utilize the characteristic of each modality or to maximize the inter-modality complementarity. To address the problem, we propose a novel end-to-end framework, called CamLiFlow. It consists of 2D and 3D branches with multiple bidirectional connections between them in specific layers. Different from previous work, we apply a point-based 3D branch to better extract the geometric features and design a symmetric learnable operator to fuse dense image features and sparse point features. We also propose a transformation for point clouds to solve the non-linear issue of 3D-2D projection. Experiments show that CamLiFlow achieves better performance with fewer parameters. Our method ranks 1st on the KITTI Scene Flow benchmark, outperforming the previous art with 1/7 parameters. Code will be made available.

“…RIC-FLow does not use the raw matches directly but generate a superpixel flow from input matching to improve the efficiency of the model estimation. This concept was also adapted for scene flow estimation from sparse LiDAR and RGB image input [15]. Our approach, which we describe in detail in Section III, applies the superpixel method in the context of depth completion.…”

Section: Related Workmentioning

confidence: 99%

PDC: Piecewise Depth Completion utilizing Superpixels

Teutscher¹,

Mangat²,

Wasenmüller³

2021

Preprint

Depth completion from sparse LiDAR and highresolution RGB data is one of the foundations for autonomous driving techniques. Current approaches often rely on CNNbased methods with several known drawbacks: flying pixel at depth discontinuities, overfitting to both a given data set as well as error metric, and many more. Thus, we propose our novel Piecewise Depth Completion (PDC), which works completely without deep learning. PDC segments the RGB image into superpixels corresponding the regions with similar depth value. Superpixels corresponding to same objects are gathered using a cost map. At the end, we receive detailed depth images with state of the art accuracy. In our evaluation, we can show both the influence of the individual proposed processing steps and the overall performance of our method on the challenging KITTI dataset.