Pairwise Comparison Network for Remote-Sensing Scene Classification

Zhang, Yue; Zheng, Xiangtao; Lu, Xiaoqiang

doi:10.1109/lgrs.2021.3139695

Cited by 11 publications

(6 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The experiment was split into two parts, using 50% of the training samples or 20% of the test samples. Table 3 [40] 89.64 ± 0.36 86.59 ± 0.29 CRAN [42] 96.65 ± 0.20 95.24 ± 0.16 MobileNet V2 [43] 95.96 ± 0.27 94.13 ± 0.28 SE-MDPMNet [44] 97.14 ± 0.15 94.68 ± 0.07 Two-Stream Fusion [45] 94.58 ± 0.25 92.32 ± 0.41 ViT [4] 96.88 ± 0.19 95.58 ± 0.18 CFDNN [46] 96.56 ± 0.24 94.56 ± 0.24 Inception-v3-CapsNet [18] 96.32 ± 0.12 93.79 ± 0.13 GSSF [47] 97.65 ± 0.80 95.71 ± 0.22 PCNet [48] 96.76 ± 0.25 95.53 ± 0.16 GAN [26] 96. 45 Table 3 shows that FCIHMRT produced the greatest outcomes at both 50% and 20% training rates.…”

Section: Results Using Aidmentioning

confidence: 99%

“…Meanwhile, it was demonstrated that the OA of palace scenes was reduced to 80%, in which some palace scenes were categorized into the church intersection and island classes, indicating that the classification capacity of FCIHMRT still requires improvement for similar scenes but in general, it can distinguish different scenes with rich spatial information. [40] 79.79 ± 0.15 76.47 ± 0.18 CRAN [42] 94.07 ± 0.08 91.28 ± 0.19 MobileNet V2 [43] 83.26 ± 0.17 80.32 ± 0.16 SE-MDPMNet [44] 94.11 ± 0.03 91.80 ± 0.07 Two-Stream Fusion [45] 83.16 ± 0.18 80.22 ± 0.22 ViT [4] 94.50 ± 0.18 91.17 ± 0.13 CFDNN [46] 93.83 ± 0.09 91.17 ± 0.13 Inception-v3-CapsNet [18] 92.6 ± 0.11 89.03 ± 0.21 GSSF [47] 94.48 ± 0.26 91.98 ± 0.19 PCNet [48] 94.59 ± 0.07 92.64 ± 0.13 GAN [26] 93.63 ± 0.12 91.06 ± 0. classification accuracy of the golf course and mobile home park classes reached 99%, which indicates that FCIHMRT has a good classification performance for scenes with a small feature complexity. Meanwhile, it was demonstrated that the OA of palace scenes was reduced to 80%, in which some palace scenes were categorized into the church intersection and island classes, indicating that the classification capacity of FCIHMRT still requires improvement for similar scenes but in general, it can distinguish different scenes with rich spatial information.…”

Section: Results Using Nwpumentioning

confidence: 99%

“…GoogLeNet [40] 94.31 ± 0.89 92.70 ± 0.60 VGG-16 [40] 95.21 ± 1.20 94.14 ± 0.69 CRAN [42] 95.75 ± 0.80 94.21 ± 0.75 MobileNet V2 [43] 99.01 ± 0.21 97.88 ± 0.31 SE-MDPMNet [44] 98.95 ± 0.12 98.36 ± 0.14 Two-Stream Fusion [45] 98.02 ± 1.03 96.97 ± 0.75 ViT [4] 99.29 ± 0.34 98.75 ± 0.21 CFDNN [46] 98.62 ± 0.27 97.65 ± 0.18 Inception-v3-CapsNet [18] 99.05 ± 0.24 97.59 ± 0.16 GSSF [47] 99.24 ± 0.47 97.86 ± 0.56 PCNet [48] 99.25 ± 0.37 98.71 ± 0.22 GAN [26] 98.58 ± 0.33 97.54 ± 0. As shown in Figure 7, the confusion matrix with all 21 classes was also created to further examine the performance of FCIHMRT with an 80% training ratio.…”

Section: Methods 80% Training Ratio (Oa) 50% Training Ratio (Oa)mentioning

confidence: 99%

See 2 more Smart Citations

FCIHMRT: Feature Cross-Layer Interaction Hybrid Method Based on Res2Net and Transformer for Remote Sensing Scene Classification

Huo,

Gang,

Guan

2023

Electronics

View full text Add to dashboard Cite

Scene classification is one of the areas of remote sensing image processing that is gaining much attention. Aiming to solve the problem of the limited precision of optical scene classification caused by complex spatial patterns, a high similarity between classes, and a high diversity of classes, a feature cross-layer interaction hybrid algorithm for optical remote sensing scene classification is proposed in this paper. Firstly, a number of features are extracted from two branches, a vision transformer branch and a Res2Net branch, to strengthen the feature extraction capability of the strategy. A novel interactive attention technique is proposed, with the goal of focusing on the strong correlation between the two-branch features, to fully use the complementing advantages of the feature information. The retrieved feature data are further refined and merged. The combined characteristics are then employed for classification. The experiments were conducted by using three open-source remote sensing datasets to validate the feasibility of the proposed method, which performed better in scene classification tasks than other methods.

show abstract

Section: Results Using Aidmentioning

confidence: 99%

Section: Results Using Nwpumentioning

confidence: 99%

Section: Methods 80% Training Ratio (Oa) 50% Training Ratio (Oa)mentioning

confidence: 99%

See 1 more Smart Citation

FCIHMRT: Feature Cross-Layer Interaction Hybrid Method Based on Res2Net and Transformer for Remote Sensing Scene Classification

Huo,

Gang,

Guan

2023

Electronics

View full text Add to dashboard Cite

show abstract

“…Target dataset RESISC45 AID ResNet50+EAN (Zhao et al, 2020) 93.51 93.64 GLDBS (Xu et al, 2021) 94.46 95.45 PCNet (Zhang et al, 2021) 94.59 95.53 Million-AID (Long et al, 2022) 94.26 95.40 Domain-adaptive pre-training (ours)…”

Section: Methodsmentioning

confidence: 99%

“…In Table 10 the results on RESISC45 and AID obtained using DA pre-training are compared to recent state-of-the-art HRRS scene classification methods. Two of the methods, ResNet50+EAN (Zhao et al, 2020) and PCNet (Zhang et al, 2021) use the same ResNet-50 backbone as in our experiments, while GLDBS (Xu et al, 2021) uses ResNet-34. The best results obtained using pre-training on Million-AID used DenseNet-169 and ResNet-101 for classification of RESISC45 and AID, respectively.…”

Section: Feature Extractionmentioning

confidence: 99%

Do We Still Need Imagenet Pre-Training in Remote Sensing Scene Classification?

Risojević

Stojnić

2022

Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.

View full text Add to dashboard Cite

Abstract. Due to the scarcity of labeled data, using supervised models pre-trained on ImageNet is a de facto standard in remote sensing scene classification. Recently, the availability of larger high resolution remote sensing (HRRS) image datasets and progress in self-supervised learning have brought up the questions of whether supervised ImageNet pre-training is still necessary for remote sensing scene classification and would supervised pre-training on HRRS image datasets or self-supervised pre-training on ImageNet achieve better results on target remote sensing scene classification tasks. To answer these questions, in this paper we both train models from scratch and fine-tune supervised and self-supervised ImageNet models on several HRRS image datasets. We also evaluate the transferability of learned representations to HRRS scene classification tasks and show that self-supervised pre-training outperforms the supervised one, while the performance of HRRS pre-training is similar to self-supervised pre-training or slightly lower. Finally, we propose using an ImageNet pre-trained model combined with a second round of pre-training using in-domain HRRS images, i.e. domain-adaptive pre-training. The experimental results show that domain-adaptive pre-training results in models that achieve state-of-the-art results on HRRS scene classification benchmarks. The source code and pre-trained models are available at https://github.com/risojevicv/RSSC-transfer.

show abstract