2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.01303
|View full text |Cite
|
Sign up to set email alerts
|

IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous Driving

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
29
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 60 publications
(29 citation statements)
references
References 28 publications
0
29
0
Order By: Relevance
“…Compared to previous RGB-based methods that only predict 3D bounding box representation and cannot output detailed and complete object surface normals, our approach complements them with shape reconstruction capability without sacrificing object detection performance. Instance-level modeling in 3D object detection builds a feature representation for an interested object in order to estimate its 3D attributes [34,37,43,44,54]. FQ-Net [43] draws a re-projected 3D cuboid on an instance patch to predict its 3D Intersection over Union (IoU) with the ground truth.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Compared to previous RGB-based methods that only predict 3D bounding box representation and cannot output detailed and complete object surface normals, our approach complements them with shape reconstruction capability without sacrificing object detection performance. Instance-level modeling in 3D object detection builds a feature representation for an interested object in order to estimate its 3D attributes [34,37,43,44,54]. FQ-Net [43] draws a re-projected 3D cuboid on an instance patch to predict its 3D Intersection over Union (IoU) with the ground truth.…”
Section: Related Workmentioning
confidence: 99%
“…The binocular human visual system effortlessly perceives various 3D attributes of surrounding objects including their locations, orientations and detailed surface geometry. However, existing visual 3D object detection (3DOD) approaches are often limited to approximate object shapes as 3D bounding boxes [38,40,54,57]. A complete and detailed shape description for the detected objects is desirable as a complement to these 3D bounding boxes for various machine vision applications.…”
Section: Introductionmentioning
confidence: 99%
“…Some traditional methods [26,39] use region-based stereo matching to estimate object's depth, but their performance is poor because they are not based on deep learning. [30] predicts the depth of the target through the method of instance depth perception which can detect the three-dimensional box end-to-end, but it neglects the local structure information. Our approach introduces a structure-aware-depth-estimation module that directly predicts the depth of the 3D bounding box's center, and then rectifies the results by box estimation and dense alignment, which together benefit the accuracy of depth estimation and thus yield better 3D detection performance.…”
Section: Related Workmentioning
confidence: 99%
“…We evaluate the proposed SIDE method with KITTI 3D dataset [10]. Specifically, our AP 3D of car category is better than the state-of-the-art methods IDA-3D [30] in all kinds of cases with IoU=0.7. Especially, in the moderate and hard case, our method performs better than IDA-3D with over 3% AP 3D , which means our method can better detect objects far away or with large occlusions.…”
Section: Introductionmentioning
confidence: 99%
“…The understanding of 3D properties of objects in the real world is critical for visionbased autonomous driving and traffic surveillance systems [1][2][3][4][5]. Compared with a twodimensional (2D) object detection task, the 3D object detection task involves nine degrees of freedom, in which the length, width, height, and pose of the 3D bounding box need to be detected.…”
Section: Introductionmentioning
confidence: 99%