2022
DOI: 10.3389/fevo.2022.1083801
|View full text |Cite
|
Sign up to set email alerts
|

GSC-MIM: Global semantic integrated self-distilled complementary masked image model for remote sensing images scene classification

Abstract: Masked image modeling (MIM) is a learning method in which the unmasked components of the input are utilized to learn and predict the masked signal, enabling learning from large amounts of unannotated data. However, due to the scale diversity and complexity of features in remote sensing images (RSIs), existing MIMs face two challenges in the RSI scene classification task: (1) If the critical local patches of small-scale objects are randomly masked out, the model will be unable to learn its representation. (2) T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 29 publications
0
1
0
Order By: Relevance
“…Based on this, Sun et al [103] proposed that RingMo uses the incomplete strategy method to improve the capture of small objects. Similar studies have been conducted [144], which have simultaneously reconstructed complementary masked-visible-region views and incorporated a global semantic distillation strategy (GSD) to ensure that salient areas of small objects are not lost. Muhtar et al [99] constructed a knowledge distillation network that inputs data-enhanced images to the teacher network and masked images to the student network for updated learning, and this method aims to accomplish the SSL task using the MIM.…”
Section: Masked Image Modeling Methods Based On Rsis For Semantic Seg...mentioning
confidence: 96%
“…Based on this, Sun et al [103] proposed that RingMo uses the incomplete strategy method to improve the capture of small objects. Similar studies have been conducted [144], which have simultaneously reconstructed complementary masked-visible-region views and incorporated a global semantic distillation strategy (GSD) to ensure that salient areas of small objects are not lost. Muhtar et al [99] constructed a knowledge distillation network that inputs data-enhanced images to the teacher network and masked images to the student network for updated learning, and this method aims to accomplish the SSL task using the MIM.…”
Section: Masked Image Modeling Methods Based On Rsis For Semantic Seg...mentioning
confidence: 96%