2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.01024
|View full text |Cite
|
Sign up to set email alerts
|

MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
54
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 94 publications
(54 citation statements)
references
References 29 publications
0
54
0
Order By: Relevance
“…BB8 [37] and RTM3D [28] locate the corners of the 3D bounding box as keypoints, while PVNet [36] defines the keypoints by farthest point sampling and Deep MANTA [11] by handcrafted templates. On the other hand, dense correspondence methods [13,29,35,46,52] predict pixel-wise 3D coordinates within a cropped 2D region. Most existing geometry-based methods follow a two-stage strategy, where the intermediate representations (i.e., 2D-3D correspondences) are learned with a surrogate loss function, which is sub-optimal compared to end-to-end learning.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…BB8 [37] and RTM3D [28] locate the corners of the 3D bounding box as keypoints, while PVNet [36] defines the keypoints by farthest point sampling and Deep MANTA [11] by handcrafted templates. On the other hand, dense correspondence methods [13,29,35,46,52] predict pixel-wise 3D coordinates within a cropped 2D region. Most existing geometry-based methods follow a two-stage strategy, where the intermediate representations (i.e., 2D-3D correspondences) are learned with a surrogate loss function, which is sub-optimal compared to end-to-end learning.…”
Section: Related Workmentioning
confidence: 99%
“…This inspired works such as DSAC [4], a smooth RANSAC with a finite hypothesis pool. Meanwhile, simple parametric distributions (e.g., normal distribution) are often used in predicting continuous variables [13,18,22,25,26,51], and mixture distributions can be employed to further capture ambiguity [3,5,31], e.g., ambiguous 6DoF pose [7]. In this paper, we propose yet a unique contribution: backpropagating a complicated continuous distribution derived from a nested optimization layer (the PnP layer), essentially making the continuous counterpart of Softmax tractable.…”
Section: End-to-end Correspondence Learningmentioning
confidence: 99%
See 2 more Smart Citations
“…M3DSSD [26] proposes a two-step feature alignment approach to overcome feature mismatching. MonoRUN [5] learns dense correspondences and geometry in a self-supervised manner with simple 3D bounding box annotations. Fig.…”
Section: Monocular 3d Object Detectionmentioning
confidence: 99%