2020
DOI: 10.1007/978-3-030-58545-7_2
|View full text |Cite
|
Sign up to set email alerts
|

Monocular 3D Object Detection via Feature Domain Adaptation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 35 publications
(19 citation statements)
references
References 34 publications
0
19
0
Order By: Relevance
“…The learning-based methods [43,6,36,37] directly regress the distance of objects by adding distance branches to 2D object detectors, which are simple and efficient. The pseudo-LiDAR-based methods [40,27,42] first predict the depth map of an input image using an external monocular depth estimator, then predict the distance of objects from the estimated depth map using a point-cloudbased 3D object detector. Though the explicit depth cues from the the estimated depth map can ease the distance prediction, the generalization of the methods is bounded by that of the monocular depth estimators [35].…”
Section: Monocular 3d Object Detectionmentioning
confidence: 99%
See 2 more Smart Citations
“…The learning-based methods [43,6,36,37] directly regress the distance of objects by adding distance branches to 2D object detectors, which are simple and efficient. The pseudo-LiDAR-based methods [40,27,42] first predict the depth map of an input image using an external monocular depth estimator, then predict the distance of objects from the estimated depth map using a point-cloudbased 3D object detector. Though the explicit depth cues from the the estimated depth map can ease the distance prediction, the generalization of the methods is bounded by that of the monocular depth estimators [35].…”
Section: Monocular 3d Object Detectionmentioning
confidence: 99%
“…MonoRCNN directly predicts 3D bounding boxes from RGB images with a compact architecture, making the training and inference much more simple and efficient than those depth-based and video-based methods. Specifically, MonoRCNN runs 3 times faster than D 4 LCN [7], and 5 times faster than AM3D [27], DA-3Ddet [42], and PatchNet [26].…”
Section: Comparisons On the Kitti Benchmarkmentioning
confidence: 99%
See 1 more Smart Citation
“…Following Frustum PointNet [33], most coordinate-based methods [43,46,28,44] Gradient boosting [13,14] is a general learning framework that combines multiple weak learners into a single strong one in an iterative fashion. Let L(x) be the risk of ensemble models, the algorithm devotes to seek an approximation h(x) to minimize L(y * , x) = Ψ(y * , h(x)), where y * is a target value, Ψ(•) is the loss function.…”
Section: Confidence-aware Localization Boostingmentioning
confidence: 99%
“…Typical coordinate-based methods [43,46,28,44] perform 3D detection based on 2D RoIs, which is similar to two-stage 2D detection frameworks, such as Faster R-CNN [35]. In a two-stage 2D object detection framework, the second stage reuses the features from the first stage via RoIPooling [35] or RoIAlign operators [19] guided by ROI proposals, and then a small decoder is used for localization refinement.…”
Section: Global Context Encodingmentioning
confidence: 99%