2018
DOI: 10.3390/rs10101602
|View full text |Cite
|
Sign up to set email alerts
|

Attention-Mechanism-Containing Neural Networks for High-Resolution Remote Sensing Image Classification

Abstract: A deep neural network is suitable for remote sensing image pixel-wise classification because it effectively extracts features from the raw data. However, remote sensing images with higher spatial resolution exhibit smaller inter-class differences and greater intra-class differences; thus, feature extraction becomes more difficult. The attention mechanism, as a method that simulates the manner in which humans comprehend and perceive images, is useful for the quick and accurate acquisition of key features. In th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
45
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 65 publications
(45 citation statements)
references
References 41 publications
0
45
0
Order By: Relevance
“…Since SE-Net [41] only adopts the global average pooling [43] (pp. [29][30][31][32][33][34][35][36][37][38][39], it can encode the entire spatial feature on a channel as a global feature, which is effectively used for standard image segmentation. However, the most important channels in an interactive segmentation task are mostly decided by the user-interactions.…”
Section: Attention-guided Convolution (Agc) Modulementioning
confidence: 99%
See 3 more Smart Citations
“…Since SE-Net [41] only adopts the global average pooling [43] (pp. [29][30][31][32][33][34][35][36][37][38][39], it can encode the entire spatial feature on a channel as a global feature, which is effectively used for standard image segmentation. However, the most important channels in an interactive segmentation task are mostly decided by the user-interactions.…”
Section: Attention-guided Convolution (Agc) Modulementioning
confidence: 99%
“…First, we applied a global average pooling [43] (pp. [29][30][31][32][33][34][35][36][37][38][39] and a global max pooling [43] (pp. [29][30][31][32][33][34][35][36][37][38][39], respectively, to squeeze global spatial information, and obtained the output squeeze gap ∈ R C,1,1 and squeeze gmp ∈ R C,1,1 .…”
Section: Attention-guided Convolution (Agc) Modulementioning
confidence: 99%
See 2 more Smart Citations
“…Futhermore, even online learning applications with non-stationary and/or imbalanced streaming data can be managed by OS-ELM-based algorithms [33][34][35]. Although, obviously, there are applications that cannot be implemented on FPGA since their limiting resources, as the implementation of attention mechanisms in remote sensing image pixel-wise classification [36], acoustic adaptation models addressing the presence of microphone mismatch in Automatic Speech Recognition (ASR) systems [37], or the prediction of infectious diseases [38].…”
mentioning
confidence: 99%