2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023
DOI: 10.1109/wacv56688.2023.00423
|View full text |Cite
|
Sign up to set email alerts
|

Li3DeTr: A LiDAR based 3D Detection Transformer

Abstract: Inspired by recent advances in vision transformers for object detection, we propose Li3DeTr, an end-to-end LiDAR based 3D Detection Transformer for autonomous driving, that inputs LiDAR point clouds and regresses 3D bounding boxes. The LiDAR local and global features are encoded using sparse convolution and multi-scale deformable attention respectively. In the decoder head, firstly, in the novel Li3DeTr cross-attention block, we link the LiDAR global features to 3D predictions leveraging the sparse set of obje… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(4 citation statements)
references
References 45 publications
0
4
0
Order By: Relevance
“…This technology enhances robot autonomy and self-driving cars. Studies include pointnet++ [44], voteNET [45], and 3DETR [46,47] for 3D object detection. Real-time 3D object detection, exemplified by Media pipe objectron [48,49], is an active area of research.…”
Section: D Object Detectionsmentioning
confidence: 99%
“…This technology enhances robot autonomy and self-driving cars. Studies include pointnet++ [44], voteNET [45], and 3DETR [46,47] for 3D object detection. Real-time 3D object detection, exemplified by Media pipe objectron [48,49], is an active area of research.…”
Section: D Object Detectionsmentioning
confidence: 99%
“…Additionally, Refs. [37,38] apply the Detection Transformer (DETR) to LiDAR 3D detection tasks, generating predictions for each object query.…”
Section: Three-dimensional Object Detectionmentioning
confidence: 99%
“…Box-Attention enables the transformer model to better learn spatial information, thereby improving its performance in various visual tasks. Li3DeTr [27] can directly predict the category, position, and size of 3D objects from the LiDAR point cloud data. This work is the first to apply the deformable attention architecture to LiDAR-based 3D object detection models.…”
Section: Egocentric Perceptionmentioning
confidence: 99%