2022
DOI: 10.48550/arxiv.2203.09704
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention

Abstract: Detecting objects from LiDAR point clouds is of tremendous significance in autonomous driving. In spite of good progress, accurate and reliable 3D detection is yet to be achieved due to the sparsity and irregularity of LiDAR point clouds. Among existing strategies, multi-view methods have shown great promise by leveraging the more comprehensive information from both bird's eye view (BEV) and range view (RV). These multi-view methods either refine the proposals predicted from single view via fused features, or … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…While reproducing the results of VISTA [37] on nuScenes [16], it was discovered that the detection performance of VISTA heavily relies on the class-balanced grouping and sampling (CBGS) preprocessing step [45]. Without the resampling step, poor results were achieved (Table 2).…”
Section: Resultsmentioning
confidence: 84%
See 3 more Smart Citations
“…While reproducing the results of VISTA [37] on nuScenes [16], it was discovered that the detection performance of VISTA heavily relies on the class-balanced grouping and sampling (CBGS) preprocessing step [45]. Without the resampling step, poor results were achieved (Table 2).…”
Section: Resultsmentioning
confidence: 84%
“…As a benchmark algorithm, VISTA [37] was chosen since it scored highest on the nuScenes leaderboard at the time. It uses a voxelized point cloud representation, various convolution operations [38][39][40], projection into two different views and cross-view transformers to bring the projections back together [41].…”
Section: D Object Detectionmentioning
confidence: 99%
See 2 more Smart Citations