2020
DOI: 10.48550/arxiv.2006.11275
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Center-based 3D Object Detection and Tracking

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
132
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 50 publications
(132 citation statements)
references
References 45 publications
0
132
0
Order By: Relevance
“…SECOND [38] improves the voxel feature learning process by introducing 3D sparse convolutions. CenterPoints [42] proposes a center-based assignment that can be applied on feature maps for accurate location prediction. Pillar-based approaches generally transform point clouds into Bird-Eye-View (BEV) pillars and apply 2D CNNs for 3D object detection.…”
Section: Related Workmentioning
confidence: 99%
“…SECOND [38] improves the voxel feature learning process by introducing 3D sparse convolutions. CenterPoints [42] proposes a center-based assignment that can be applied on feature maps for accurate location prediction. Pillar-based approaches generally transform point clouds into Bird-Eye-View (BEV) pillars and apply 2D CNNs for 3D object detection.…”
Section: Related Workmentioning
confidence: 99%
“…Configuration Details: In this experiment, the backbone task f is a CenterPoint LiDAR detector [41] for both vehicles and pedestrians, trained on the simulated data from Carla in addition to proprietary real-world data. The downstream planner g is a modified version of the BasicAgent included in the Carla Python API, where changes were made to improve the performance of the planner.…”
Section: Carla Leaderboard Evaluationmentioning
confidence: 99%
“…• We perform extensive large-scale experiments demonstrating the efficacy of our approach using Carla [8] simulator for driving tasks. We create surrogate models of two well-known LiDAR detectors, PIXOR [40] and Centerpoint [41], as our backbone task. • We show that our approach is closest to the backbone task compared to the baselines evaluated on several metrics, and yields a 20 times reduction in compute time.…”
Section: Introductionmentioning
confidence: 99%
“…We use a dense, single-stage convolutional header for detecting objects using the per-cell features. Similarly to [4,43], we first predict if a cell contains the center of an object for some class. For each center cell, we then predict an associated bounding box and use non-maximum suppression to remove duplicates.…”
Section: Output Predictionmentioning
confidence: 99%