2022
DOI: 10.48550/arxiv.2203.09830
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Laneformer: Object-aware Row-Column Transformers for Lane Detection

Abstract: We present Laneformer, a conceptually simple yet powerful transformer-based architecture tailored for lane detection that is a long-standing research topic for visual perception in autonomous driving. The dominant paradigms rely on purely CNN-based architectures which often fail in incorporating relations of long-range lane points and global contexts induced by surrounding objects (e.g., pedestrians, vehicles). Inspired by recent advances of the transformer encoder-decoder architecture in various vision tasks,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 21 publications
0
7
0
Order By: Relevance
“…LSTR [14] utilizes DETR-like [4] structure and geometry constraints of lanes to improve detection performance. Laneformer [7] uses a transformer encoder and tries to exploit the relationship between occluded parts of lanes and vehicles. Under the guidance of the vehicle's position information, the occluded lanes are completed.…”
Section: Curve-based Methodsmentioning
confidence: 99%
“…LSTR [14] utilizes DETR-like [4] structure and geometry constraints of lanes to improve detection performance. Laneformer [7] uses a transformer encoder and tries to exploit the relationship between occluded parts of lanes and vehicles. Under the guidance of the vehicle's position information, the occluded lanes are completed.…”
Section: Curve-based Methodsmentioning
confidence: 99%
“…It detects lanes in the image plane and projects them to 3D space with camera pose. In general, advanced monocular lane detectors can be categorized into segmentation approaches [1]- [4] and regression approaches [5]- [12], [27]. SCNN [1] is proposed to propagate context by slice-byslice convolutions within feature maps.…”
Section: Related Workmentioning
confidence: 99%
“…Lane regression algorithms can be grouped into key points estimation [5], [6], anchor-based regression [7]- [11] and row-wise regression [12], [27]. PINet [5] combines key points estimation and instance segmentation, and GANet [6] represents lanes as a set of key points which are only related to the start point.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…It also demonstrated good adaptability to scenarios with nighttime conditions or with occluded parts; however, it does not address complex lane detection tasks. Likewise, in [41], Han et al developed the Laneformer transformer-based architecture and adapted it to lane detection. This better captures the lane shape characteristics and the global semantic context of the series of points defining the lanes using a row and column selfattenuation mechanism.…”
Section: Related Workmentioning
confidence: 99%