A Multi-Branch Feature Fusion Strategy Based on an Attention Mechanism for Remote Sensing Image Scene Classification

Shi, Cuiping; Zhao, Xiaonan; Wang, Liguo

doi:10.3390/rs13101950

Cited by 34 publications

(22 citation statements)

References 46 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To demonstrate the superiority of our proposed method, it is compared with other methods on UCM, including Bidirectional adaptive feature fusion method (BDFF method) [25], Multiscale CNN (MCNN) [37], ResNet with weighted spatial pyramid matching collaborative representation-based classification (ResNet with WSPM-CRC) [38], VGG16 with multi-layer stacked covariance pooling (VGG16 with MSCP) [26], Gated bidirectional network (GBNet) [29], Feature aggregation CNN (FACNN) [39], Scale-free CNN (SF-CNN) [40], Deep discriminative representation learning with attention map method (DDRL-AM method) [41], and CNN based on attention-oriented multi-branch feature fusion (AMB-CNN) [42]. The training ratio of 80% is used on this dataset, and OA is taken as the evaluation index.…”

Section: Results On Ucmmentioning

confidence: 99%

A Deformable Convolutional Neural Network with Spatial-Channel Attention for Remote Sensing Scene Classification

Wang

Lan

2021

Remote Sensing

View full text Add to dashboard Cite

Remote sensing scene classification converts remote sensing images into classification information to support high-level applications, so it is a fundamental problem in the field of remote sensing. In recent years, many convolutional neural network (CNN)-based methods have achieved impressive results in remote sensing scene classification, but they have two problems in extracting remote sensing scene features: (1) fixed-shape convolutional kernels cannot effectively extract features from remote sensing scenes with complex shapes and diverse distributions; (2) the features extracted by CNN contain a large number of redundant and invalid information. To solve these problems, this paper constructs a deformable convolutional neural network to adapt the convolutional sampling positions to the shape of objects in the remote sensing scene. Meanwhile, the spatial and channel attention mechanisms are used to focus on the effective features while suppressing the invalid ones. The experimental results indicate that the proposed method is competitive to the state-of-the-art methods on three remote sensing scene classification datasets (UCM, NWPU, and AID).

show abstract

Section: Results On Ucmmentioning

confidence: 99%

A Deformable Convolutional Neural Network with Spatial-Channel Attention for Remote Sensing Scene Classification

Wang

Lan

2021

Remote Sensing

View full text Add to dashboard Cite

show abstract

“…In contrast, our global information extraction considers the linkage of various locations on the image, and the accuracy is 1.14% higher than GLANet when the training ratio is 10% and 0.55% higher than GLANet when the training ratio is 20%. [64] 91.03 ± 0.18 93.45 ± 0.17 SCCov [58] 89.30 ± 0.35 92.10 ± 0.25 DDRL-AM [40] 92.17 ± 0.08 92.46 ± 0.09 AMB-CNN [59] 88.99 ± 0.14 92.42 ± 0.14 ResNet-50+EAM [60] 90.87 ± 0.15 93.51 ± 0.12 Attention based Residual Network [61] -92.10 ± 0.30 ACNet [62] 91.09 ± 0.13 92.42 ± 0.16 Our Method 92.11 ± 0.06 94.00 ± 0.13…”

Section: Accuracy Evaluationmentioning

confidence: 99%

“…Compared with other methods, it can describe the content in the scene more effectively and has better accuracy. GLANet [64] 95.02 ± 0.28 96.66 ± 0.19 SCCov [58] 93.12 ± 0.25 96.10 ± 0.16 DDRL-AM [40] 92.36 ± 0.10 -AMB-CNN [59] 93.27 ± 0.22 95.54 ± 0.13 ResNet-50+EAM [60] 93.64 ± 0.25 96.62 ± 0.13 ACNet [62] 93. 33…”

Section: Accuracy Evaluationmentioning

confidence: 99%

Remote Sensing Image Scene Classification Based on Global Self-Attention Module

Yan

2021

Remote Sensing

View full text Add to dashboard Cite

The complexity of scene images makes the research on remote-sensing image scene classification challenging. With the wide application of deep learning in recent years, many remote-sensing scene classification methods using a convolutional neural network (CNN) have emerged. Current CNN usually output global information by integrating the depth features extricated from the convolutional layer through the fully connected layer; however, the global information extracted is not comprehensive. This paper proposes an improved remote-sensing image scene classification method based on a global self-attention module to address this problem. The global information is derived from the depth characteristics extracted by the CNN. In order to better express the semantic information of the remote-sensing image, the multi-head self-attention module is introduced for global information augmentation. Meanwhile, the local perception unit is utilized to improve the self-attention module’s representation capabilities for local objects. The proposed method’s effectiveness is validated through comparative experiments with various training ratios and different scales on public datasets (UC Merced, AID, and NWPU-NESISC45). The precision of our proposed model is significantly improved compared to other methods for remote-sensing image scene classification.

show abstract

“…Xie et al [43] developed a remote sensing image scene classification model with label augmentation, in which Kullback-Leibler divergence is utilized as the intra-class constraint to restrict the distribution of training data. Shi et al [44] proposed a lightweight CNN based on attention-oriented multi-branch feature fusion for remote sensing image scene classification.…”

Section: Remote Sensing Image Scene Classificationmentioning

confidence: 99%

Prototype Calibration with Feature Generation for Few-Shot Remote Sensing Image Scene Classification

et al. 2021

View full text Add to dashboard Cite

Few-shot classification of remote sensing images has attracted attention due to its important applications in various fields. The major challenge in few-shot remote sensing image scene classification is that limited labeled samples can be utilized for training. This may lead to the deviation of prototype feature expression, and thus the classification performance will be impacted. To solve these issues, a prototype calibration with a feature-generating model is proposed for few-shot remote sensing image scene classification. In the proposed framework, a feature encoder with self-attention is developed to reduce the influence of irrelevant information. Then, the feature-generating module is utilized to expand the support set of the testing set based on prototypes of the training set, and prototype calibration is proposed to optimize features of support images that can enhance the representativeness of each category features. Experiments on NWPU-RESISC45 and WHU-RS19 datasets demonstrate that the proposed method can yield superior classification accuracies for few-shot remote sensing image scene classification.

show abstract

A Multi-Branch Feature Fusion Strategy Based on an Attention Mechanism for Remote Sensing Image Scene Classification

Cited by 34 publications

References 46 publications

A Deformable Convolutional Neural Network with Spatial-Channel Attention for Remote Sensing Scene Classification

A Deformable Convolutional Neural Network with Spatial-Channel Attention for Remote Sensing Scene Classification

Remote Sensing Image Scene Classification Based on Global Self-Attention Module

Prototype Calibration with Feature Generation for Few-Shot Remote Sensing Image Scene Classification

Contact Info

Product

Resources

About