2023
DOI: 10.1109/access.2023.3233964
|View full text |Cite
|
Sign up to set email alerts
|

YOLO-Extract: Improved YOLOv5 for Aircraft Object Detection in Remote Sensing Images

Abstract: Compared with natural images, remote sensing targets have small and dense target shapes as well as complex target backgrounds. As a result, insufficient detection accuracy and target location cannot be accurately identified. So, this paper proposes the YOLO-extract algorithm based on the YOLOv5 algorithm. Firstly, The YOLO-extract algorithm optimized the model structure of the YOLOv5 algorithm. The YOLOextract algorithm not only deleted the feature layer and prediction head with poor feature extraction ability… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 60 publications
(34 citation statements)
references
References 33 publications
0
10
0
Order By: Relevance
“…In-field wheat field backgrounds are complex, with poor illumination conditions, mutual overlapping occlusion between heads and between heads and leaves, and blurring of wheat heads caused by shooting, all of which affect the accuracy of wheat head recognition. Dong [35], Liu [36], and other researchers have all proposed that the overlapping occlusion of wheat heads will cause difficulties in detection. In this study, these three types of images were selected as the test set to compare the correlation between the predicted wheat heads of YOLOv7-MA, YOLOv7, YOLOX, YOLOv5, and Faster-RCNN and manual counting results.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In-field wheat field backgrounds are complex, with poor illumination conditions, mutual overlapping occlusion between heads and between heads and leaves, and blurring of wheat heads caused by shooting, all of which affect the accuracy of wheat head recognition. Dong [35], Liu [36], and other researchers have all proposed that the overlapping occlusion of wheat heads will cause difficulties in detection. In this study, these three types of images were selected as the test set to compare the correlation between the predicted wheat heads of YOLOv7-MA, YOLOv7, YOLOX, YOLOv5, and Faster-RCNN and manual counting results.…”
Section: Discussionmentioning
confidence: 99%
“…Dong et al [35] introduced a random polarized self-attention mechanism, SPSA, in both spatial and channel dimensions and effectively combined it with a random unit to improve the detection capability of overlapping and occluded wheat heads. Liu et al [36] proposed the YOLO-Extract algorithm based on YOLOv5, which enhances the feature extraction capability of wheat heads by introducing a coordinated attention mechanism. Zhou et al [37] presented the multi-window swine transformer network, which combines self-attention mechanisms and feature pyramid networks to extract multi-scale features and effectively improve the accuracy of detecting complex wheat heads in the field.…”
Section: Introductionmentioning
confidence: 99%
“…The objective of this research is to explore a novel model capable of real-time and accurate detection of insulator defects. Noting the rapid development of YOLO models [11] in the field of general object detection task and their wide application in various industries [12,13], this paper introduces and enhances YOLOv5 model to make it adapt to the task of insulator defect detection in complex environments. Different from Faster R-CNN, YOLO innovatively proposes a single-stage object detection method, which achieves a qualitative breakthrough in computing speed and is often used for various real-time object detection tasks.…”
Section: Introductionmentioning
confidence: 99%
“…This facilitates the extraction of key features from relevant areas within the image space and mitigates the issue of small object leakage. Liu et al [10] proposed the YOLO-extract algorithm, which is based on YOLOv5. The algorithm integrates coordinated attention into the network by adopting the concept of residual networks.…”
Section: Introductionmentioning
confidence: 99%