RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation

Xu, Jianyun; Zhang, Ruixiang; Dou, Jie; Yushi, Zhu; Sun, Jie; Pu, Shiliang

doi:10.1109/iccv48922.2021.01572

Cited by 233 publications

(83 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Fusion-based methods (Liong et al, 2020;Xu et al, 2021;Cheng et al, 2021) use two or more of the previous methods to extract features from point clouds such as combination of voxel and point-based method, or voxel, point, projection-based method. Those methods infer the different backbones in parallel and fuse the point-wise features from each backbone using concatenation, attention, and add.…”

Section: Fusion-based Methodsmentioning

confidence: 99%

PCSCNet: Fast 3D Semantic Segmentation of LiDAR Point Cloud for Autonomous Car using Point Convolution and Sparse Convolution Network

Park¹,

Kim²,

Jo³

2022

Preprint

View full text Add to dashboard Cite

The autonomous car must recognize the driving environment quickly for safe driving. As the Light Detection And Range (LiDAR) sensor is widely used in the autonomous car, fast semantic segmentation of LiDAR point cloud, which is the point-wise classification of the point cloud within the sensor framerate, has attracted attention in recognition of the driving environment. Although the voxel and fusion-based semantic segmentation models are the state-of-the-art model in point cloud semantic segmentation recently, their real-time performance suffer from high computational load due to high voxel resolution. In this paper, we propose the fast voxel-based semantic segmentation model using Point Convolution and 3D Sparse Convolution (PCSCNet). The proposed model is designed to outperform at both high and low voxel resolution using point convolution-based feature extraction. Moreover, the proposed model accelerates the feature propagation using 3D sparse convolution after the feature extraction. The experimental results demonstrate that the proposed model outperforms the state-of-the-art real-time models in semantic segmentation of SemanticKITTI and nuScenes, and achieves the real-time performance in LiDAR point cloud inference.

show abstract

Section: Fusion-based Methodsmentioning

confidence: 99%

PCSCNet: Fast 3D Semantic Segmentation of LiDAR Point Cloud for Autonomous Car using Point Convolution and Sparse Convolution Network

Park¹,

Kim²,

Jo³

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Voxelization based methods [10,11,12,13] transform the irregular unordered point cloud into regular 3D grids, and then the powerful 3D convolution is applied in feature extraction and prediction. However, the problem of granular information loss can be caused by using a large voxel size.…”

Section: Voxelization Based Methodsmentioning

confidence: 99%

MVP-Net: Multiple View Pointwise Semantic Segmentation of Large-Scale Point Clouds

Luo¹,

Li²,

Cheng³

et al. 2022

Preprint

View full text Add to dashboard Cite

Semantic segmentation of 3D point cloud is an essential task for autonomous driving environment perception. The pipeline of most pointwise point cloud semantic segmentation methods includes points sampling, neighbor searching, feature aggregation, and classification. Neighbor searching method like K-nearest neighbors algorithm, KNN, has been widely applied. However, the complexity of KNN is always a bottleneck of efficiency. In this paper, we propose an end-to-end neural architecture, Multiple View Pointwise Net, MVP-Net, to efficiently and directly infer large-scale outdoor point cloud without KNN or any complex pre/postprocessing. Instead, assumption-based sorting and multi-rotation of point cloud methods are introduced to point feature aggregation and receptive field expanding. Numerical experiments show that the proposed MVP-Net is 11 times faster than the most efficient pointwise semantic segmentation method RandLA-Net [1] and achieves the same accuracy on the large-scale benchmark SemanticKITTI dataset. KeywordsPoint Cloud • Semantic Segmentation • Autonomous Driving Recently, PointNet-based works [4, 1] were proposed to directly process large-scale point clouds. These pipelines include multi-level point cloud sampling, neighbor searching, and PointNet-based local feature aggregation. RandLA-

show abstract

“…Voxels provide the coarse-grained local features, and points preserve the finegrained geometric features through a simple MLP. Xu et al [12] fuse three different feature representations, including point, range map and voxel, which achieve the promising fusion results by interacting features at various stages.…”

Section: Related Workmentioning

confidence: 99%

“…Nevertheless, they fail to preserve the original neighborhood relationship. In practice, the hybrid methods [10]- [12] fuse two or more of the above feature representations, which can obtain the better results. Unfortunately, this incurs the extra computational load.…”

Section: Introductionmentioning

confidence: 99%

Meta-RangeSeg: LiDAR Sequence Semantic Segmentation Using Multiple Feature Aggregation

Wang¹,

Zhu²,

Zhang³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

LiDAR sensor is essential to the perception system in autonomous vehicles and intelligent robots. To fulfill the realtime requirements in real-world applications, it is necessary to efficiently segment the LiDAR scans. Most of previous approaches directly project 3D point cloud onto the 2D spherical range image so that they can make use of the efficient 2D convolutional operations for image segmentation. Although having achieved the encouraging results, the neighborhood information is not wellpreserved in the spherical projection. Moreover, the temporal information is not taken into consideration in the single scan segmentation task. To tackle these problems, we propose a novel approach to semantic segmentation for LiDAR sequences named Meta-RangeSeg, where a novel range residual image representation is introduced to capture the spatial-temporal information. Specifically, Meta-Kernel is employed to extract the meta features, which reduces the inconsistency between the 2D range image coordinates input and Cartesian coordinates output. An efficient U-Net backbone is used to obtain the multi-scale features. Furthermore, Feature Aggregation Module (FAM) aggregates the meta features and multi-scale features, which tends to strengthen the role of range channel. We have conducted extensive experiments for performance evaluation on SemanticKITTI, which is the de-facto dataset for LiDAR semantic segmentation. The promising results show that our proposed Meta-RangeSeg method is more efficient and effective than the existing approaches. Our full implementation is publicly available at https://github.com/songw-zju/Meta-RangeSeg.

show abstract

RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation

Cited by 233 publications

References 39 publications

PCSCNet: Fast 3D Semantic Segmentation of LiDAR Point Cloud for Autonomous Car using Point Convolution and Sparse Convolution Network

PCSCNet: Fast 3D Semantic Segmentation of LiDAR Point Cloud for Autonomous Car using Point Convolution and Sparse Convolution Network

MVP-Net: Multiple View Pointwise Semantic Segmentation of Large-Scale Point Clouds

Meta-RangeSeg: LiDAR Sequence Semantic Segmentation Using Multiple Feature Aggregation

Contact Info

Product

Resources

About