LS-Net: fast single-shot line-segment detector

Nguyen, Van Nhan; Jenssen, Robert; Roverso, Davide

doi:10.1007/s00138-020-01138-6

Cited by 51 publications

(26 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To the best of authors’ knowledge, almost all the studies regarding PL detection use the dice scores (DSC) (also known as the F1-score), precision, true positive rate (TPR) (also known as recall or sensitivity), false discovery rate (FDR) and accuracy [ 4 , 6 , 7 , 8 , 12 , 13 , 14 ]. These evaluation parameters are defined as: DSC or F1-score = 2TP/(2TP + FP +FN) Precision = TP/(TP + FP) TPR or Recall or Sensitivity = TP/(TP + FN) FDR = FP/(FP + TP) Accuracy = (TP + TN)/(TP + TN + FP + FN) where TP, TN, FP and FN represent the true positive, true negative, false positive and false negative entries of the confusion matrix, respectively.…”

Section: Methodsmentioning

confidence: 99%

“…To the best of our knowledge, no research work has investigated the application of different loss functions for the highly imbalanced PL detection task with thin RoIs. All the studies on PL detection mainly rely on BCE loss [ 6 , 14 ] and its variants [ 4 , 7 , 12 ] to handle the class imbalance problem, with the BBCE loss being the baseline in the majority of these works. The same ACU-Net network architecture is trained with each of these loss functions and the characteristic evaluation parameters are monitored.…”

Section: Methodsmentioning

confidence: 99%

“…To the best of authors’ knowledge, all the studies on PL detection utilize the BCE loss [ 6 , 14 , 15 , 16 ] and its class imbalance variants [ 4 , 7 , 12 ] for segmenting the PLs. Although BCE loss is easier to optimize with lower training times, it might not always be the best choice for training deep classification networks [ 17 , 18 ].…”

Section: Related Work and Theoretical Foundationmentioning

confidence: 99%

“…The loss functions shift the focus to the minority class, thus, allowing it to contribute more to the loss in comparison to the major background class. Despite the fact that class imbalance affects the PL detection task adversely, only a handful of PL detection works address the class imbalance problem either via the popular variant of the binary cross entropy (BCE) loss function, namely the balanced BCE (BBCE) loss [ 4 , 7 , 12 ], or via a compound loss function [ 8 ] or the attention gated networks [ 13 ]. However, other notable studies on PL detection do not consider the prevalent class imbalance problem and utilize the vanilla BCE loss function for training the deep semantic segmentation networks [ 6 , 14 , 15 , 16 ].…”

Section: Introductionmentioning

confidence: 99%

“…However, other notable studies on PL detection do not consider the prevalent class imbalance problem and utilize the vanilla BCE loss function for training the deep semantic segmentation networks [ 6 , 14 , 15 , 16 ]. Moreover, BCE loss and its variants are also the usual choice for loss function in the PL detection tasks [ 4 , 6 , 7 , 12 , 14 , 15 , 16 ]. Nevertheless, it has been demonstrated that other kinds of losses can outperform the popular BCE loss for typical classification tasks [ 17 , 18 ].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net

Jaffari

Hashmani

Reyes-Aldasoro

2021

Sensors

View full text Add to dashboard Cite

The segmentation of power lines (PLs) from aerial images is a crucial task for the safe navigation of unmanned aerial vehicles (UAVs) operating at low altitudes. Despite the advances in deep learning-based approaches for PL segmentation, these models are still vulnerable to the class imbalance present in the data. The PLs occupy only a minimal portion (1–5%) of the aerial images as compared to the background region (95–99%). Generally, this class imbalance problem is addressed via the use of PL-specific detectors in conjunction with the popular class balanced cross entropy (BBCE) loss function. However, these PL-specific detectors do not work outside their application areas and a BBCE loss requires hyperparameter tuning for class-wise weights, which is not trivial. Moreover, the BBCE loss results in low dice scores and precision values and thus, fails to achieve an optimal trade-off between dice scores, model accuracy, and precision–recall values. In this work, we propose a generalized focal loss function based on the Matthews correlation coefficient (MCC) or the Phi coefficient to address the class imbalance problem in PL segmentation while utilizing a generic deep segmentation architecture. We evaluate our loss function by improving the vanilla U-Net model with an additional convolutional auxiliary classifier head (ACU-Net) for better learning and faster model convergence. The evaluation of two PL datasets, namely the Mendeley Power Line Dataset and the Power Line Dataset of Urban Scenes (PLDU), where PLs occupy around 1% and 2% of the aerial images area, respectively, reveal that our proposed loss function outperforms the popular BBCE loss by 16% in PL dice scores on both the datasets, 19% in precision and false detection rate (FDR) values for the Mendeley PL dataset and 15% in precision and FDR values for the PLDU with a minor degradation in the accuracy and recall values. Moreover, our proposed ACU-Net outperforms the baseline vanilla U-Net for the characteristic evaluation parameters in the range of 1–10% for both the PL datasets. Thus, our proposed loss function with ACU-Net achieves an optimal trade-off for the characteristic evaluation parameters without any bells and whistles. Our code is available at Github.

show abstract