2024
DOI: 10.1117/1.jei.33.6.063022
|View full text |Cite
|
Sign up to set email alerts
|

Dual-domain deformable feature fusion for multi-modal 3D object detection

Shihao Wang,
Tao Deng

Abstract: Recent advancements in 3D object detection using light detection and ranging (LiDAR)-camera fusion have enhanced autonomous driving perception. However, aligning LiDAR and image data during multimodal fusion remains a significant challenge. We propose a novel multi-modal feature alignment and fusion architecture to effectively align and fuse voxel and image data. The proposed architecture comprises four key modules. Z -axis attention aggregates voxel features along the vertical axis using self-attention. Voxel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 49 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?