TranSDet: Toward Effective Transfer Learning for Small-Object Detection

Xu, Xinkai; Zhang, Hailan; Ma, Yong; Liu, Kang; Bao, Hong; Xu, Qian

doi:10.3390/rs15143525

Cited by 14 publications

(2 citation statements)

References 61 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To mitigate these challenges, various innovative solutions have emerged. One such solution [12] involves per-pixel prediction, while another approach [13,14] utilizes key points to replace anchor boxes, with enhancements made through the incorporation of a central point. Additionally, some methods [15] leverage global context information between detected instances and images to eliminate the reliance on anchor boxes and non-maximum suppression (NMS).…”

Section: Related Workmentioning

confidence: 99%

Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

An,

Duanmu,

et al. 2024

Aerospace

View full text Add to dashboard Cite

Generalized target detection algorithms perform well for large- and medium-sized targets but struggle with small ones. However, with the growing importance of aerial images in urban transportation and environmental monitoring, detecting small targets in such imagery has been a promising research hotspot. The challenge in small object detection lies in the limited pixel proportion and the complexity of feature extraction. Moreover, current mainstream detection algorithms tend to be overly complex, leading to structural redundancy for small objects. To cope with these challenges, this paper recommends the PCSG model based on yolov5, which optimizes both the detection head and backbone networks. (1) An enhanced detection header is introduced, featuring a new structure that enhances the feature pyramid network and the path aggregation network. This enhancement bolsters the model’s shallow feature reuse capability and introduces a dedicated detection layer for smaller objects. Additionally, redundant structures in the network are pruned, and the lightweight and versatile upsampling operator CARAFE is used to optimize the upsampling algorithm. (2) The paper proposes the module named SPD-Conv to replace the strided convolution operation and pooling structures in yolov5, thereby enhancing the backbone’s feature extraction capability. Furthermore, Ghost convolution is utilized to optimize the parameter count, ensuring that the backbone meets the real-time needs of aerial image detection. The experimental results from the RSOD dataset show that the PCSG model exhibits superior detection performance. The value of mAP increases from 97.1% to 97.8%, while the number of model parameters decreases by 22.3%, from 1,761,871 to 1,368,823. These findings unequivocally highlight the effectiveness of this approach.

show abstract

Section: Related Workmentioning

confidence: 99%

Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

An,

Duanmu,

et al. 2024

Aerospace

View full text Add to dashboard Cite

show abstract

“…From the previous section, detecting small fire images has some limitations in our model. To solve the accuracy factor, we drove forward the concept of the TranSDet [67] model. This model proposes a meta-learning-based dynamic resolution adaption transfer learning (DRAT) schema to adapt the pre-trained general model to detect small objects.…”

Section: Detect Small-size Imagementioning

confidence: 99%

Robust Forest Fire Detection Method for Surveillance Systems Based on You Only Look Once Version 8 and Transfer Learning Approaches

Yunusov,

Islam,

Abdusalomov

et al. 2024

Processes

View full text Add to dashboard Cite

Forest fires have emerged as a significant global concern, exacerbated by both global warming and the expanding human population. Several adverse outcomes can result from this, including climatic shifts and greenhouse effects. The ramifications of fire incidents extend widely, impacting human communities, financial resources, the natural environment, and global warming. Therefore, timely fire detection is essential for quick and effective response and not to endanger forest resources, animal life, and the human economy. This study introduces a forest fire detection approach utilizing transfer learning with the YOLOv8 (You Only Look Once version 8) pretraining model and the TranSDet model, which integrates an improved deep learning algorithm. Transfer Learning based on pre-trained YoloV8 enhances a fast and accurate object detection aggregate with the TranSDet structure to detect small fires. Furthermore, to train the model, we collected 5200 images and performed augmentation techniques for data, such as rotation, scaling, and changing due and saturation. Small fires can be detected from a distance by our suggested model both during the day and at night. Objects with similarities can lead to false predictions. However, the dataset augmentation technique reduces the feasibility. The experimental results prove that our proposed model can successfully achieve 98% accuracy to minimize catastrophic incidents. In recent years, the advancement of deep learning techniques has enhanced safety and secure environments. Lastly, we conducted a comparative analysis of our method’s performance based on widely used evaluation metrics to validate the achieved results.

show abstract

Bi-AFN++CA: Bi-directional adaptive fusion network combining context augmentation for small object detection

Zhang,

Chen

2023

Appl Intell

View full text Add to dashboard Cite

TranSDet: Toward Effective Transfer Learning for Small-Object Detection

Cited by 14 publications

References 61 publications

Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

Enhancing Small Object Detection in Aerial Images: A Novel Approach with PCSG Model

Robust Forest Fire Detection Method for Surveillance Systems Based on You Only Look Once Version 8 and Transfer Learning Approaches

Bi-AFN++CA: Bi-directional adaptive fusion network combining context augmentation for small object detection

Contact Info

Product

Resources

About