LaneAF: Robust Multi-Lane Detection with Affinity Fields

Abualsaud, Hala; Liu, Sean; Lu, David; Situ, Kenny; Rangesh, Akshay; Trivedi, Mohan M.

doi:10.48550/arxiv.2103.12040

Cited by 6 publications

(11 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Next, In the process of feature point detection in a sliding window, the window pixels are traversed, and the coordinates of non-zero pixel values are recorded. When the number of effective pixels in the window is less than the threshold, the window width is increased by the window height and width until the minimum number of pixels is met [29]. Furthermore, taking the average value of the abscissa of the effective pixels in the sliding window as the base point coordinate of the next sliding window, iterative detection is carried out until the total number of sliding windows is satisfied [30].…”

Section: Feature Point Extraction (Fpe)mentioning

confidence: 99%

See 1 more Smart Citation

Accurate and Lightweight RailNet for Real-Time Rail Line Detection

et al. 2021

View full text Add to dashboard Cite

Railway transportation has always occupied an important position in daily life and social progress. In recent years, computer vision has made promising breakthroughs in intelligent transportation, providing new ideas for detecting rail lines. Yet the majority of rail line detection algorithms use traditional image processing to extract features, and their detection accuracy and instantaneity remain to be improved. This paper goes beyond the aforementioned limitations and proposes a rail line detection algorithm based on deep learning. First, an accurate and lightweight RailNet is designed, which takes full advantage of the powerful advanced semantic information extraction capabilities of deep convolutional neural networks to obtain high-level features of rail lines. The Segmentation Soul (SS) module is creatively added to the RailNet structure, which improves segmentation performance without any additional inference time. The Depth Wise Convolution (DWconv) is introduced in the RailNet to reduce the number of network parameters and eventually ensure real-time detection. Afterward, according to the binary segmentation maps of RailNet output, we propose the rail line fitting algorithm based on sliding window detection and apply the inverse perspective transformation. Thus the polynomial functions and curvature of the rail lines are calculated, and rail lines are identified in the original images. Furthermore, we collect a real-world rail lines dataset, named RAWRail. The proposed algorithm has been fully validated on the RAWRail dataset, running at 74 FPS, and the accuracy reaches 98.6%, which is superior to the current rail line detection algorithms and shows powerful potential in real applications.

show abstract

Section: Feature Point Extraction (Fpe)mentioning

confidence: 99%

“…Secondly, we label the rail lines of all the rail images by using LABELME to get the JSON file used as the real rail lines during training and finally compared with the predicted rail lines [29]. In the actual training, the 3000 pictures are divided into the training set, verification set, and test set according to the ratio of 0.9:0.05:0.05.…”

Section: Rawrailmentioning

confidence: 99%

Accurate and Lightweight RailNet for Real-Time Rail Line Detection

et al. 2021

View full text Add to dashboard Cite

show abstract

“…Compared with the early rule-based techniques (Dong et al 2012;Deng and Wu 2018), CNNbased lane detection techniques are more adaptive to various weather changes and show less performance deterioration by occlusion. In these techniques, lanes are predicted by a lane detection head based on local features extracted by CNN (He et al 2016), and the performance is improved with development of lane detection heads that exploit the features of lane lines; Segmentation-based techniques such as (Pan et al 2018) detect lanes by assigning classes (e.g., lanes, and backgrounds) to each predicted pixel, which may cause discontinuous lane lines and marks and additional clustering is introduced to compensate (Abualsaud et al 2021). Anchorbased techniques detect lanes through the regression of coordinate change of anchors that are designated initially.…”

Section: Related Workmentioning

confidence: 99%

“…To design the the detection head, we formulize the lane detection problem as a multi-class segmentation problem, where each pixel is assigned a class and a confidence score. This is because there are multiple advantages with the multiclass segmentation-based approaches over other approaches; First, binary-class (i.e., lane or background) segmentationbased approaches (Abualsaud et al 2021) need additional post-processing to assign a new lane class for a new prediction, while multi-class segmentation-based approaches assign a class directly to a prediction. Second, anchorbased approaches use fixed-shaped anchors that limits the lane shape detection.…”

Section: Detection Head and Loss Functionmentioning

confidence: 99%

K-Lane: Lidar Lane Dataset and Benchmark for Urban Roads and Highways

Paek,

Kong,

Wijaya

2021

Preprint

View full text Add to dashboard Cite

Accurate lane detection under various road conditions is a critical function for autonomous driving. Generally, when detected lane lines from a front camera image are projected into a bird's-eye view (BEV) for motion planning, the resulting lane lines are often distorted. And convolutional neural network (CNN)-based feature extractors often lose resolution when increasing the receptive field to detect global features such as lane lines. However, Lidar point cloud has little image distortion in the BEV-projection. Since lane lines are thin and stretch over entire BEV image while occupying only a small portion, lane lines should be detected as a global feature with high resolution. In this paper, we propose Lane Mixer Network (LMN) that extracts local features from Lidar point cloud, recognizes global features, and detects lane lines using a BEV encoder, a Mixer-based global feature extractor, and a detection head, respectively. In addition, we provide a world-first large urban lane dataset for Lidar, K-Lane, which has maximum 6 lanes under various urban road conditions. We demonstrate that the proposed LMN achieves the stateof-the-art performance, an F1 score of 91.67%, with K-Lane. The K-Lane, LMN training code, pre-trained models, and total dataset development platform are available at [github].

show abstract

“…As for those anchor-free, they equate lane detection to high-order polynomial regression, straightforward yet overly relying on certain parameters. The last main group [23,24,25,26], inspired by the fancy thoughts from human pose estimation, usually extract key points with lane semantic and then cluster them into different lane instances via complex post-processing methods. In general, methods other than sementic segmentation have difficulties in modeling more complex lane forms, like those described in BDD100K [27].…”

Section: Introductionmentioning

confidence: 99%

Lane Detection with Versatile AtrousFormer and Local Semantic Guidance

Yang¹,

Zhang²,

Lu³

2022

Preprint

View full text Add to dashboard Cite

Lane detection is one of the core functions in autonomous driving and has aroused widespread attention recently. The networks to segment lane instances, especially with bad appearance, must be able to explore lane distribution properties. Most existing methods tend to resort to CNN-based techniques. A few have a try on incorporating the recent adorable, the seq2seq Transformer [1]. However, their innate drawbacks of weak global information collection ability and exorbitant computation overhead prohibit a wide range of the further applications. In this work, we propose Atrous Transformer (AtrousFormer) to solve the problem. Its variant local AtrousFormer is interleaved into feature extractor to enhance extraction. Their collecting information first by rows and then by columns in a dedicated manner finally equips our network with stronger information gleaning ability and better computation efficiency. To further improve the performance, we also propose a local semantic guided decoder to delineate the identities and shapes of lanes more accurately, in which the predicted Gaussian map of the start- * Corresponding author * *

show abstract

LaneAF: Robust Multi-Lane Detection with Affinity Fields

Cited by 6 publications

References 25 publications

Accurate and Lightweight RailNet for Real-Time Rail Line Detection

Accurate and Lightweight RailNet for Real-Time Rail Line Detection

K-Lane: Lidar Lane Dataset and Benchmark for Urban Roads and Highways

Lane Detection with Versatile AtrousFormer and Local Semantic Guidance

Contact Info

Product

Resources

About