Dual-Resolution and Deformable Multihead Network for Oriented Object Detection in Remote Sensing Images

Yu, Donghang; Xu, Qing; Liu, Xiangyun; Guo, Hao; Lu, Jun; Lin, Yun; Lyu, Liang

doi:10.1109/jstars.2022.3230797

Cited by 3 publications

(2 citation statements)

References 67 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Meanwhile, attention mechanism, which focuses on important features and suppresses unnecessary ones, has been widely integrated in CNNs, especially in U-Net like or other variants of the encoder-decoder architecture, for improving the representation of interests and the segmentation results. For example, Cui et al [23] created a reverse attention module that suppresses seawater features, enabling the learning characteristics for both apparent and inapparent aquaculture sites. Qin et al [24] embedded the convolutional block attention module (CBAM) [25] into the decoder of the network they proposed to gain accurate feature maps for offshore farm extraction, etc.…”

Section: Introductionmentioning

confidence: 99%

An Attention-Fused Deep Learning Model for Accurately Monitoring Cage and Raft Aquaculture at Large-Scale Using Sentinel-2 Data

Xu,

2024

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

Cage and raft aquaculture (CRA) is vital for coastal economy and provides high-quality aquatic products. Accurately monitoring large-scale CRA lays the foundation for predicting CRA product yield and mitigating environmental impacts. This study, focusing on the challenges of detecting large-scale CRA from freely-downloaded, multi-spectral remote sensing imagery due to the complexity of both CRA and marine environment, proposed an attention-fused deep learning model for accurately retrieving large-scale CRA in China's offshore sea using open-source Sentinel-2 (S2) satellite data. We first downloaded the cloud-free preprocessed S2 images in selected study areas. Manual labeling of cage, raft and background areas was performed using high-resolution remote sensing images, with labeled images clipped into 32×32 patches. To enhance the perception ability of feature, the convolutional block attention module was integrated into the well-performing UNet++ by incorporating both channel and spatial attention in each convolutional block of encoder as well as the Level 1 convolutional blocks of decoder. Using the sample dataset in 2021, the proposed AF-UNet++ was trained, and compared to four mainstream convolutional neural networks, and then adopted to map CRA in both 2021 and 2018 in the study areas, as well as four additional sites. Experimental results demonstrate: 1) Our model has the highest OA, F1 and mIoU, with IoU for raft 4.15% higher than other models. 2) Visual comparisons illustrate that AF-UNet++ best excels in extracting CRA. 3) Extraction results both in 2021 and 2018 confirm the proposed model can effectively monitoring large-scale CRA and has the spatio-temporal stability.

show abstract

Section: Introductionmentioning

confidence: 99%

An Attention-Fused Deep Learning Model for Accurately Monitoring Cage and Raft Aquaculture at Large-Scale Using Sentinel-2 Data

Xu,

2024

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

show abstract

“…Different methods have been developed to address these problems. Yu et al [15] employed deformable convolution to align feature maps of different scales, and designed a feature fusion module using dilated convolution to enhance the perception of object shape and direction. Hou et al [16] designed an asymmetric feature pyramid network to enrich the spatial representation of features and improve the detection of objects with extreme aspect ratios.…”

Section: Introductionmentioning

confidence: 99%

Adaptive Adjacent Layer Feature Fusion for Object Detection in Remote Sensing Images

Zhang,

Gong,

Guo

et al. 2023

Remote Sensing

Self Cite

View full text Add to dashboard Cite

Object detection in remote sensing images faces the challenges of a complex background, large object size variations, and high inter-class similarity. To address these problems, we propose an adaptive adjacent layer feature fusion (AALFF) method, which is developed on the basis of RTMDet. Specifically, the AALFF method incorporates an adjacent layer feature fusion enhancement (ALFFE) module, designed to capture high-level semantic information and accurately locate object spatial positions. ALFFE also effectively preserves small objects by fusing adjacent layer features and employs involution to aggregate contextual information in a wide spatial range for object essential features extraction in complex backgrounds. Additionally, the adaptive spatial feature fusion (ASFF) module is introduced to guide the network to select and fuse the crucial features to improve the adaptability to objects with different sizes. The proposed method achieves mean average precision (mAP) values of 77.1%, 88.9%, and 95.7% on the DIOR, HRRSD, and NWPU VHR-10 datasets, respectively. Notably, our approach achieves mAP75 values of 60.8% and 79.0% on the DIOR and HRRSD datasets, respectively, surpassing the state-of-the-art performance on the DIOR dataset.

show abstract

Ship Detection with Deep Learning in Optical Remote-Sensing Images: A Survey of Challenges and Advances

Zhao,

Wang,

et al. 2024

Remote Sensing

View full text Add to dashboard Cite

Ship detection aims to automatically identify whether there are ships in the images, precisely classifies and localizes them. Regardless of whether utilizing early manually designed methods or deep learning technology, ship detection is dedicated to exploring the inherent characteristics of ships to enhance recall. Nowadays, high-precision ship detection plays a crucial role in civilian and military applications. In order to provide a comprehensive review of ship detection in optical remote-sensing images (SDORSIs), this paper summarizes the challenges as a guide. These challenges include complex marine environments, insufficient discriminative features, large scale variations, dense and rotated distributions, large aspect ratios, and imbalances between positive and negative samples. We meticulously review the improvement methods and conduct a detailed analysis of the strengths and weaknesses of these methods. We compile ship information from common optical remote sensing image datasets and compare algorithm performance. Simultaneously, we compare and analyze the feature extraction capabilities of backbones based on CNNs and Transformer, seeking new directions for the development in SDORSIs. Promising prospects are provided to facilitate further research in the future.

show abstract

Dual-Resolution and Deformable Multihead Network for Oriented Object Detection in Remote Sensing Images

Cited by 3 publications

References 67 publications

An Attention-Fused Deep Learning Model for Accurately Monitoring Cage and Raft Aquaculture at Large-Scale Using Sentinel-2 Data

An Attention-Fused Deep Learning Model for Accurately Monitoring Cage and Raft Aquaculture at Large-Scale Using Sentinel-2 Data

Adaptive Adjacent Layer Feature Fusion for Object Detection in Remote Sensing Images

Ship Detection with Deep Learning in Optical Remote-Sensing Images: A Survey of Challenges and Advances

Contact Info

Product

Resources

About