Real-Time Road Segmentation Using LiDAR Data Processing on an FPGA

Lyu, Yecheng; Bai, Lin; Huang, Xinming

doi:10.1109/iscas.2018.8351244

Cited by 49 publications

(38 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These networks are not suitable for embedded systems because of their network size and complexity. Thus, some studies proposed specific lightweight networks [11,32] for semantic segmentation to be implemented on an FPGA. Lyu et al [32] proposed a small network for road segmentation with light detection and ranging (LiDAR), and its FPGA implementation meets a real-time processing requirement (59.2 FPS).…”

Section: Fpga Implementation For Cnn-based Semantic Segmentationmentioning

confidence: 99%

“…Thus, some studies proposed specific lightweight networks [11,32] for semantic segmentation to be implemented on an FPGA. Lyu et al [32] proposed a small network for road segmentation with light detection and ranging (LiDAR), and its FPGA implementation meets a real-time processing requirement (59.2 FPS). In exchange for the FPGA realization by the proposed small model, it can deal with road class only, and therefore the implementation challenges of the task for many categories still remain.…”

Section: Fpga Implementation For Cnn-based Semantic Segmentationmentioning

confidence: 99%

See 1 more Smart Citation

FPGA-Based Inter-layer Pipelined Accelerators for Filter-Wise Weight-Balanced Sparse Fully Convolutional Networks with Overlapped Tiling

Shimoda

Sada

Nakahara

2021

J Sign Process Syst

View full text Add to dashboard Cite

Convolutional neural networks (CNNs) exhibit state-of-the-art performance while performing computer-vision tasks. CNNs require high-speed, low-power, and high-accuracy hardware for various scenarios, such as edge environments. However, the number of weights is so large that embedded systems cannot store them owing to their limited on-chip memory. A different method is used to minimize the input image size, for real-time processing, but it causes a considerable drop in accuracy. Although pruned sparse CNNs and special accelerators are proposed, the requirement of random access incurs a large number of wide multiplexers for a high degree of parallelism, which becomes more complicated and unsuitable for FPGA implementation. To address this problem, we propose filter-wise pruning with distillation and block RAM (BRAM)-based zero-weight skipping accelerator. It eliminates weights such that each filter has the same number of nonzero weights, performing retraining with distillation, while retaining comparable accuracy. Further, filter-wise pruning enables our accelerator to exploit inter-filter parallelism, where a processing block for a layer executes filters concurrently, with a straightforward architecture. We also propose an overlapped tiling algorithm, where tiles are extracted with overlap to prevent both accuracy degradation and high utilization of BRAMs storing high-resolution images. Our evaluation using semantic-segmentation tasks showed a 1.8 times speedup and 18.0 times increase in power efficiency of our FPGA design compared with a desktop GPU. Additionally, compared with the conventional FPGA implementation, the speedup and accuracy improvement were 1.09 times and 6.6 points, respectively. Therefore, our approach is useful for FPGA implementation and exhibits considerable accuracy for applications in embedded systems.

show abstract

Section: Fpga Implementation For Cnn-based Semantic Segmentationmentioning

confidence: 99%

Section: Fpga Implementation For Cnn-based Semantic Segmentationmentioning

confidence: 99%

FPGA-Based Inter-layer Pipelined Accelerators for Filter-Wise Weight-Balanced Sparse Fully Convolutional Networks with Overlapped Tiling

Shimoda

Sada

Nakahara

2021

J Sign Process Syst

View full text Add to dashboard Cite

show abstract

“…In this work, however, the LiDAR points are not uniformly distributed on the ground plane but heavily gathered together near the LiDAR scanner, which results in massive dropped points in the near-range and redundant space in the far-range. Lyu et al [17,18] and RangeNet++ [19] improve the projection scheme by replacing the target plane with a sphere surface, in which LiDAR points are nearly uniformly distributed. SqueezeSeg V1 [33], V2 [34], and LO-Net [12] also employ this projection scheme and result in a good performance in LiDAR point semantic segmentation.…”

Section: Related Workmentioning

confidence: 99%

“…Fortunately, Lyu et al [17,18] and RangeNet++ [19] have introduced an algorithm to project the LiDAR data on to a spherical view so that a LiDAR point cloud with geometry features can be transferred to an image-like feature map with minor point losses. By employing this projection, we can efficiently generate image representations of LiDAR frames for feature extraction.…”

Section: Introductionmentioning

confidence: 99%

LodoNet

Zheng

Lyu

et al. 2020

Proceedings of the 28th ACM International Conference on Multimedia

Self Cite

View full text Add to dashboard Cite

Deep learning based LiDAR odometry (LO) estimation attracts increasing research interests in the field of autonomous driving and robotics. Existing works feed consecutive LiDAR frames into neural networks as point clouds and match pairs in the learned feature space. In contrast, motivated by the success of image based feature extractors, we propose to transfer the LiDAR frames to image space and reformulate the problem as image feature extraction. With the help of scale-invariant feature transform (SIFT) for feature extraction, we are able to generate matched keypoint pairs (MKPs) that can be precisely returned to the 3D space. A convolutional neural network pipeline is designed for LiDAR odometry estimation by extracted MKPs. The proposed scheme, namely LodoNet, is then evaluated in the KITTI odometry estimation benchmark, achieving on par with or even better results than the state-of-the-art.

show abstract

“…Automated driving systems (ADS) and advanced driver assistant systems (ADAS) equipped on intelligent vehicles rely on multiple sensors to perceive their surroundings. In recent research works, LiDAR-based algorithms have shown their advantage on drivable region segmentation [8] [9], object detection [18], and simultaneous localization and mapping [19] [15]. LIDARs are also fused with cameras to improve the accuracy of 3D object detection [2].…”

Section: Introductionmentioning

confidence: 99%

An Interactive LiDAR to Camera Calibration

Lyu

Bai

Elhousni

et al. 2019

2019 IEEE High Performance Extreme Computing Conference (HPEC)

Self Cite

View full text Add to dashboard Cite

Recent progress in the automated driving system (ADS) and advanced driver assistant system (ADAS) has shown that the combined use of 3D light detection and ranging (LiDAR) and the camera is essential for an intelligent vehicle to perceive and understand its surroundings. LiDAR-camera fusion requires precise intrinsic and extrinsic calibrations between the sensors. However, due to the limitation of the calibration equipment and susceptibility to noise, algorithms in existing methods tend to fail in finding LiDAR-camera correspondences in long-range. In this paper, we introduced an interactive LiDAR to camera calibration toolbox to estimate the intrinsic and extrinsic transforms. This toolbox automatically detects the corner of a planer board from a sequence of LiDAR frames and provides a convenient user interface for annotating the corresponding pixels on camera frames. Since the toolbox only detects the top corner of the board, there is no need to prepare a precise polygon planar board or a checkerboard with different reflectivity areas as in the existing methods. Furthermore, the toolbox uses genetic algorithms to estimate the transforms and supports multiple camera models such as the pinhole camera model and the fisheye camera model. Experiments using Velodyne VLP-16 LiDAR and Point Grey Chameleon 3 camera show robust results.

show abstract

Real-Time Road Segmentation Using LiDAR Data Processing on an FPGA

Cited by 49 publications

References 36 publications

FPGA-Based Inter-layer Pipelined Accelerators for Filter-Wise Weight-Balanced Sparse Fully Convolutional Networks with Overlapped Tiling

FPGA-Based Inter-layer Pipelined Accelerators for Filter-Wise Weight-Balanced Sparse Fully Convolutional Networks with Overlapped Tiling

LodoNet

An Interactive LiDAR to Camera Calibration

Contact Info

Product

Resources

About