2022
DOI: 10.1109/tpami.2021.3136899
|View full text |Cite
|
Sign up to set email alerts
|

MonoEF: Extrinsic Parameter Free Monocular 3D Object Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 56 publications
0
5
0
Order By: Relevance
“…standard monocular detectors [16,18,22,41,62] take as input only a single image and mostly adopt center-guided pipelines following conventional 2D detectors [38,44,60]. M3D-RPN [2] designs a depth-aware convolution along with 3D anchors to generate better 3D region proposals.…”
Section: Related Workmentioning
confidence: 99%
“…standard monocular detectors [16,18,22,41,62] take as input only a single image and mostly adopt center-guided pipelines following conventional 2D detectors [38,44,60]. M3D-RPN [2] designs a depth-aware convolution along with 3D anchors to generate better 3D region proposals.…”
Section: Related Workmentioning
confidence: 99%
“…Researchers have been improving detection performance via upgrading detection frameworks (Wang et al 2021d;Liu et al 2019), losses (Simonelli et al 2020) and non-maximum suppression (Kumar, Brazil, and Liu 2021) as well as performing joint detection and 3D reconstruction . To reduce 2D to 3D ambiguities, various strategies have been developed, e.g., Pseudo-LiDAR (Wang et al 2019;Ma et al 2019Ma et al , 2020Simonelli et al 2021;Park et al 2021), novel convolutions (Ding et al 2020) and backbones (Kumar et al 2022), considering camera geometry (Zhou et al 2021), using shape models Chabot et al 2017) and leveraging videos (Brazil et al 2020).…”
Section: Related Workmentioning
confidence: 99%
“…MonoDETR [54] constructed a foreground depth map to help the model understand the scene object depth spatial distributions with the aid of the DETR-liked model structure [55]. MonoEF [56], [57] and MoGDE [58] proposed to model the camera extrinsic changes and provided a grounded depth carrying the camera information, making the model achieve extremely accurate performance. For perspective projection: Ivan et al [11] combined the keypoint method and the projection to do geometry reasoning.…”
Section: B Monocular 3d Object Detectionmentioning
confidence: 99%