2021
DOI: 10.48550/arxiv.2105.00268
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Lite-FPN for Keypoint-based Monocular 3D Object Detection

Abstract: 3D object detection with a single image is an essential and challenging task for autonomous driving. Recently, keypoint-based monocular 3D object detection has made tremendous progress and achieved great speed-accuracy trade-off. However, there still exists a huge gap with LIDAR-based methods in terms of accuracy. To improve their performance without sacrificing efficiency, we propose a sort of lightweight feature pyramid network called Lite-FPN to achieve multi-scale feature fusion in an effective and efficie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 48 publications
0
4
0
Order By: Relevance
“…3D feature estimation-based methods. To avoid ambiguous results from geometric constraints and further improve accuracy and efficiency, many new methods [15,17,18,[23][24][25][26][27][28] focus on direct 3D information extraction in images which can be used for 3D bounding box regression by CNNs. Mono3DBox [23] proposes an end-to-end detector that can directly predict the center point of the vehicle bottom in the image.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…3D feature estimation-based methods. To avoid ambiguous results from geometric constraints and further improve accuracy and efficiency, many new methods [15,17,18,[23][24][25][26][27][28] focus on direct 3D information extraction in images which can be used for 3D bounding box regression by CNNs. Mono3DBox [23] proposes an end-to-end detector that can directly predict the center point of the vehicle bottom in the image.…”
Section: Related Workmentioning
confidence: 99%
“…Then, vehicle location, dimension, and orientation can be solved by minimizing the reprojection error of 2D-3D bounding boxes. Lite-FPN [27] proposes a light-weight keypoint-based feature pyramid net-work, which contains three parts: backbone, detection head and post-processing. Top-k operation is used in the detection head to connect keypoints and regression sub-network.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations