2022
DOI: 10.1016/j.patcog.2022.108796
|View full text |Cite
|
Sign up to set email alerts
|

3D Object Detection for Autonomous Driving: A Survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

1
63
0
3

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 221 publications
(67 citation statements)
references
References 1 publication
1
63
0
3
Order By: Relevance
“…PointPillars [13] takes advantage of a standard 2D convolutional detection pipeline which codes point clouds into a special partition of voxels (i.e., pillars) in the light of the tradeoff between efficiency and accuracy. However, the step of point clouds discretization in voxel-based detectors will degrade the fine-grained localization accuracy [17]. Some point-voxel-based methods [20,4,35,10] have been proposed, which take the advantage of localization accuracy of point-based detectors and computational efficiency of voxel-based detectors.…”
Section: Lidar-based 3d Object Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…PointPillars [13] takes advantage of a standard 2D convolutional detection pipeline which codes point clouds into a special partition of voxels (i.e., pillars) in the light of the tradeoff between efficiency and accuracy. However, the step of point clouds discretization in voxel-based detectors will degrade the fine-grained localization accuracy [17]. Some point-voxel-based methods [20,4,35,10] have been proposed, which take the advantage of localization accuracy of point-based detectors and computational efficiency of voxel-based detectors.…”
Section: Lidar-based 3d Object Detectionmentioning
confidence: 99%
“…The 3D Sparse Convolutions [31] with positional encoding can focus on voxels with point clouds thus reducing computation cost. The common voxel representation learning operators include mean operator, random sampling, and MLP operator [17]. Reconstructing the voxel features as a regression task is very difficult as the pre-training network needs to learn the distributions of each point.…”
Section: Introductionmentioning
confidence: 99%
“…We briefly introduce the 3D object detection approaches in this section, and refer reader to Qian et al [19] for more detailed description. ImVoteNet [16] proposes to use 2D image RGB, geometric coordinates, semantics, and pixel texture information to assist 3D point clouds object detection.…”
Section: D Object Detectionmentioning
confidence: 99%
“…There exist many deep-learning-based object detection methods, and they have been applied in many fields, such as autonomous driving [ 3 ], facial recognition [ 4 ], defect detection [ 5 , 6 ], and medical imaging [ 7 , 8 ]; the historical champion network ResNet from the ImageNet LSVRC (Large-Scale Visual Recognition Challenge) competition outperformed the human recognition level [ 9 ]. Models based on CNNs have shown good accuracy and robustness for recognition of certain species.…”
Section: Introductionmentioning
confidence: 99%