Pomelo Tree Detection Method Based on Attention Mechanism and Cross-Layer Feature Fusion

Yuan, Haotian; Huang, Ke-Kun; Ren, Chuan-Xian; Xiong, Yan; Duan, Jieli; Zhang, Yang

doi:10.3390/rs14163902

Cited by 10 publications

(9 citation statements)

References 58 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As mentioned in Section 1, in our previous study, we proposed a YOLOx-nano pomelo tree detection method based on an attention mechanism and cross-layer feature fusion and showed that this method was more suitable for pomelo tree detection than was other state-of-the-art object detection algorithms. The present study showed that YOLOv5s and its attention-optimized models can detect IPTs with high accuracy, in line with the results of Yuan et al [49]. Although the structure of the network proposed by Yuan et al [49] was lightweight, the AP value reached 93.74%.…”

Section: Comparison With Other Related Worksupporting

confidence: 90%

“…The present study showed that YOLOv5s and its attention-optimized models can detect IPTs with high accuracy, in line with the results of Yuan et al [49]. Although the structure of the network proposed by Yuan et al [49] was lightweight, the AP value reached 93.74%. Our optimized YOLOv5s models fully outperformed this network in terms of AP, which were all higher than 94.00%, with the highest AP value of 94.50%.…”

Section: Comparison With Other Related Worksupporting

confidence: 90%

“…However, most of those previous works needed to improve the architecture of CNN networks to increase the performance of deep learning models. Yuan et al [49] proposed an improved YOLOx-nano algorithm to detect pomelo trees and compared it with several state-of-the-art object detection algorithms, such as Faster R-CNN, SSD, YOLOv3, and YOLOv4-tiny. Their method showed better suitability for pomelo tree detection in UAV images with better performance and fewer parameters.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Detecting and Mapping Individual Fruit Trees in Complex Natural Environments via UAV Remote Sensing and Optimized YOLOv5

Xiong,

Zeng,

Lai

et al. 2024

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

Self Cite

View full text Add to dashboard Cite

The location and number of individual fruit trees (IFTs) are critical for investigations on planting areas, fruit yield predictions, and smart orchard planning and management. These data are conventionally obtained through manual and statistical investigations that require long, laborious, and costly efforts. Object detection models of deep learning could provide an opportunity to detect IFTs accurately, which is essential for rapidly obtaining these data and reducing human operation errors. This study proposed an approach for detecting IFTs and mapping their spatial distributions by integrating deep learning with unmanned aerial vehicle (UAV) remote sensing. UAV remote sensing was used to collect high-resolution images of fruit trees in pomelo orchards in Meizhou, South China. Based on these images, a new individual pomelo tree image sample (IPTIS) dataset was created through manual interpretation and field investigation. The evaluation results revealed that YOLOv5s was the best model among the five YOLOv5 models (i.e., YOLOv5n, YOLOv5s, YOLOv5m, YOLOv5l, and YOLOv5x, whose layers, parameters, and floating-point operations all increased with the depth and width of layers.) of different scales considered for optimization. Moreover, the coordinate attention (CA)-optimized YOLOv5 model (YOLOv5s-CA) is the best model (named Manuscript

show abstract

Section: Comparison With Other Related Worksupporting

confidence: 90%

Section: Comparison With Other Related Worksupporting

confidence: 90%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Detecting and Mapping Individual Fruit Trees in Complex Natural Environments via UAV Remote Sensing and Optimized YOLOv5

Xiong,

Zeng,

Lai

et al. 2024

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

Self Cite

View full text Add to dashboard Cite

show abstract

“…In the training process of large-scale remote sensing image object extraction algorithms, CNNs are usually used. Due to the limited size of shared convolutional kernels involved in network operations and not changing based on the size of the extracted object in the task, its global modeling ability is limited, which weakens the connection between the object to be extracted and its background in the image, resulting in the loss of some implicit spatial relationship information 52 , 53 . In the process of purifying and fusing feature information using feature extraction networks, the number of pixels per unit area decreases exponentially with the size of the feature map, resulting in an increase in the object information represented by a single pixel.…”

Section: Methodsmentioning

confidence: 99%

“…Due to the limited size of shared convolutional kernels involved in network operations and not changing based on the size of the extracted object in the task, its global modeling ability is limited, which weakens the connection between the object to be extracted and its background in the image, resulting in the loss of some implicit spatial relationship information. 52,53 In the process of purifying and fusing feature information using feature extraction networks, the number of pixels per unit area decreases exponentially with the size of the feature map, resulting in an increase in the object information represented by a single pixel. When the objects are densely arranged, due to the continuous feature information abstraction, interference and mosaic phenomena will appear between the feature information of adjacent objects, resulting in dense object features being difficult to distinguish and interfering with each other.…”

Section: Dual Attention Mechanism Modulementioning

confidence: 99%

Optimized anchor-free network for dense rotating object detection in remote sensing images

Yan,

Zhang,

Hong

et al. 2023

J. Electron. Imag.

View full text Add to dashboard Cite

Extracting dense rotating objects accurately from remote sensing images is an emerging task in object detection. To increase the applicability of existing algorithms in the above tasks, an optimized anchor-free network optimized by a dual attention mechanism (DAM) and gate multiscale feature fusion (GMFF) is designed. The DAM module is composed of two attention mechanisms with different functions. This part can enhance the backbone network's ability to extract and model information at different levels and reduce the accuracy loss caused by object density changes in the image. The GMFF module uses the gating structure to realize adaptive transmission and integration of multiscale information. Through this module, the useless information in features will be filtered, and the key information will be retained. Several experiments are designed to verify the feasibility of the algorithm. Compared with the baseline model, adding DAM and GMFF to the dense rotating object extraction task in remote sensing images improves the model accuracy by 3.5% and 2.1%, respectively, while adding two modules simultaneously, and the accuracy increases from 79.1% to 84.3%. In conventional object extraction tasks, such as dataset for object detection in aerial images and HRSC2016, our method has the highest accuracy compared to other similar algorithms, with 76.5% and 90.3%, respectively.

show abstract

Detection network for multi-size and multi-target tea bud leaves in the field of view via improved YOLOv7

Chen,

Li,

Chen

et al. 2024

Computers and Electronics in Agriculture

View full text Add to dashboard Cite

Pomelo Tree Detection Method Based on Attention Mechanism and Cross-Layer Feature Fusion

Cited by 10 publications

References 58 publications

Detecting and Mapping Individual Fruit Trees in Complex Natural Environments via UAV Remote Sensing and Optimized YOLOv5

Detecting and Mapping Individual Fruit Trees in Complex Natural Environments via UAV Remote Sensing and Optimized YOLOv5

Optimized anchor-free network for dense rotating object detection in remote sensing images

Detection network for multi-size and multi-target tea bud leaves in the field of view via improved YOLOv7

Contact Info

Product

Resources

About