2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00607
|View full text |Cite
|
Sign up to set email alerts
|

Offboard 3D Object Detection from Point Cloud Sequences

Abstract: Occluded and long-range objects are ubiquitous and challenging for 3D object detection. Point cloud sequence data provide unique opportunities to improve such cases, as an occluded or distant object can be observed from different viewpoints or gets better visibility over time. However, the efficiency and effectiveness in encoding long-term sequence data can still be improved. In this work, we propose MoDAR, using motion forecasting outputs as a type of virtual modality, to augment LiDAR point clouds. The MoDAR… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
71
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
5

Relationship

2
8

Authors

Journals

citations
Cited by 129 publications
(71 citation statements)
references
References 91 publications
0
71
0
Order By: Relevance
“…Another motivation for using BEV features to perform perception tasks is that BEV is a desirable bridge to connect temporal and spatial space. For the human visual perception system, temporal information plays a crucial role in inferring the motion state of objects and identifying occluded objects, and many works in vision fields have demonstrated the effectiveness of using video data [2,27,26,33,19]. However, the existing state-of-the-art multi-camera 3D detection methods rarely exploit temporal information.…”
Section: Introductionmentioning
confidence: 99%
“…Another motivation for using BEV features to perform perception tasks is that BEV is a desirable bridge to connect temporal and spatial space. For the human visual perception system, temporal information plays a crucial role in inferring the motion state of objects and identifying occluded objects, and many works in vision fields have demonstrated the effectiveness of using video data [2,27,26,33,19]. However, the existing state-of-the-art multi-camera 3D detection methods rarely exploit temporal information.…”
Section: Introductionmentioning
confidence: 99%
“…An alternative approach by Deuge et al [31] introduced an unsupervised feature learning approach for outdoor object classification by projecting 3D LiDAR scans into 2D depth images. Recently, Qi et al [35] introduced an off-board (e.g. cloud) pipeline to get rid of the computational power limitation of on-board (i.e.…”
Section: A Object Detectionmentioning
confidence: 99%
“…[57] generates multi-frame object proposal features from the object feature vectors in the memory using a nonlocal attention block for aligning features. [58] processes LiDAR point cloud sequences for automatic labeling. A 3D multi-object tracker is used to match single-frame detections.…”
Section: Challenges and Open Fieldsmentioning
confidence: 99%