2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00329
|View full text |Cite
|
Sign up to set email alerts
|

PVGNet: A Bottom-Up One-Stage 3D Object Detector with Integrated Multi-Level Features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 34 publications
(6 citation statements)
references
References 25 publications
0
5
0
Order By: Relevance
“…8) Confidence Score: Confidence score indicates the optimum threshold to categorize false positives to ensure the predicted bounding box contains minimum standard score and often used for model performance evaluation [19,79,106]. Non optimal settings for any proposed model requires more minimized confidence score for precise bounding box detection for 3D object detection.…”
Section: ) Average Orientation Similarity (Aos): Averagementioning
confidence: 99%
“…8) Confidence Score: Confidence score indicates the optimum threshold to categorize false positives to ensure the predicted bounding box contains minimum standard score and often used for model performance evaluation [19,79,106]. Non optimal settings for any proposed model requires more minimized confidence score for precise bounding box detection for 3D object detection.…”
Section: ) Average Orientation Similarity (Aos): Averagementioning
confidence: 99%
“…We adopt the voxel-based CNN as the backbone due to its efficiency. In order to prevent the loss of geometry information, which is crucial for implicit boundary generation, we simultaneously extract point-wise and voxelwise features in one backbone [23,48]. As the yellow block shown in Fig.…”
Section: Backbone Networkmentioning
confidence: 99%
“…setting as in KITTI benchmark. In particular, we focus on the 'Car' category as many recent works [6,23,46] and adopt IoU = 0.7 for evaluation. When performing experimental studies on the val set, we use the train data for training.…”
Section: Dataset and Protocolsmentioning
confidence: 99%
“…Instead of using voxels, PointPillars [15] introduced a column (pillar) representation to further reduce the complexity. Hybrid methods such as PV-RCNN [34,35] or STD [53] simultaneously process point and voxel information to obtain multi-scale features and fine-grained localization [12,21,22,54]. For example, Part-A 2 [37] uses pointwise features to predict intra-object parts and then aggregates the part information for box refinement to improve the robustness.…”
Section: Related Workmentioning
confidence: 99%