2023
DOI: 10.48550/arxiv.2303.11301
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

Abstract: 3D object detectors usually rely on hand-crafted proxies, e.g., anchors or centers, and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be densified and processed by dense prediction heads, which inevitably costs extra computation. In this paper, we instead propose VoxelNext for fully sparse 3D object detection. Our core insight is to predict objects directly based on sparse voxel features, without relying on hand-crafted proxies. Our strong sparse convolutional network Vox-elNe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 38 publications
0
2
0
Order By: Relevance
“…It can effectively suppress noise and improve accuracy. Furthermore, VoxelNeXt [21] is a 3D object detection and tracking method that directly predicts objects based on sparse voxel features without relying on hand-crafted proxies. BTCDet [22] predicts the probability of occupancy in these regions.…”
Section: Deep-learning-based Vehicle Pose Estimation Methodsmentioning
confidence: 99%
“…It can effectively suppress noise and improve accuracy. Furthermore, VoxelNeXt [21] is a 3D object detection and tracking method that directly predicts objects based on sparse voxel features without relying on hand-crafted proxies. BTCDet [22] predicts the probability of occupancy in these regions.…”
Section: Deep-learning-based Vehicle Pose Estimation Methodsmentioning
confidence: 99%
“…CenterPoint [29] presents using point representations to describe objects, which solves the constraints imposed by anchor-based methods on angle and size and diminishes the search space for objects. VoxelNeXt [30] proposes an efficient structure that predicts objects directly from sparse voxel features rather than relying on hand-crafted proxies. The voxel-based methods can achieve decent detection performance with promising efficiency.…”
Section: Lidar-only 3d Object Detection Methodsmentioning
confidence: 99%