2023
DOI: 10.3390/rs16010025
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient Rep-Style Gaussian–Wasserstein Network: Improved UAV Infrared Small Object Detection for Urban Road Surveillance and Safety

Tuerniyazi Aibibu,
Jinhui Lan,
Yiliang Zeng
et al.

Abstract: Owing to the significant application potential of unmanned aerial vehicles (UAVs) and infrared imaging technologies, researchers from different fields have conducted numerous experiments on aerial infrared image processing. To continuously detect small road objects 24 h/day, this study proposes an efficient Rep-style Gaussian–Wasserstein network (ERGW-net) for small road object detection in infrared aerial images. This method aims to resolve problems of small object size, low contrast, few object features, and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 43 publications
0
6
0
Order By: Relevance
“…Due to the relatively limited processing power and storage space of UAV, highprecision target detection models such as YOLOv8 are unsuitable for effective operation on UAV due to their high number of parameters and calculations. T. Aibibu et al [33] gives that the mAP of YOLOv8 on the HIT-UAV dataset is 75.5%, while according to the data provided by the official YOLO, the number of references of [34] YOLOv8 is 68.2 M. Cui.C et al [22] gives the LCNet parameter number without improvement as only 3.0 M. The number of parameters of LCNet network is much smaller than that of YOLOv8. Facing the airborne terminal deployment requirements for real-time processing and rapid identification, lightweight networks are the key to achieving efficient real-time identification.…”
Section: Improved Network Based On Lcnetmentioning
confidence: 99%
See 2 more Smart Citations
“…Due to the relatively limited processing power and storage space of UAV, highprecision target detection models such as YOLOv8 are unsuitable for effective operation on UAV due to their high number of parameters and calculations. T. Aibibu et al [33] gives that the mAP of YOLOv8 on the HIT-UAV dataset is 75.5%, while according to the data provided by the official YOLO, the number of references of [34] YOLOv8 is 68.2 M. Cui.C et al [22] gives the LCNet parameter number without improvement as only 3.0 M. The number of parameters of LCNet network is much smaller than that of YOLOv8. Facing the airborne terminal deployment requirements for real-time processing and rapid identification, lightweight networks are the key to achieving efficient real-time identification.…”
Section: Improved Network Based On Lcnetmentioning
confidence: 99%
“…The dataset consists of 2898 thermal infrared images with a resolution of 640 × 512, contains 24,899 labels, and is divided into five categories, namely Person, Car, Bicycle, OtherVehicle, and don-care. Currently, only T. Aibibu et al [33] and X. Zhao et al [35] have launched and published model detection performance optimization work on the HIT-UAV dataset. Among them, T. Aibibu et al [33] were the first team to optimize the recognition performance of five categories of the complete dataset, and they finally achieved an accuracy of 80%.…”
Section: Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…Yang et al [ 27 ] merged an enhanced channel attention mechanism with a better version of E-ELAN [ 28 ] to introduce an upgraded YOLOv7 model, which is designed to identify small spots on grape leaves. Aibibu et al [ 29 ] combined the strengths of various networks to improve the detection performance of small target vehicles. Liu et al [ 30 ] utilized dynamic snake convolution [ 31 ] and introduced WISE-IoU [ 32 ] to boost the model’s effectiveness in detecting small traffic-related objects.…”
Section: Related Workmentioning
confidence: 99%
“…Ref. [20] proposed an efficient rep-style Gaussian-Wasserstein network (ERGW-net), which effectively addressed the challenges of small object sizes and low contrast in infrared aerial imagery. Ref.…”
Section: Introductionmentioning
confidence: 99%