2022
DOI: 10.3390/rs14194732
|View full text |Cite
|
Sign up to set email alerts
|

Hyperspectral Image Classification via Spectral Pooling and Hybrid Transformer

Abstract: Hyperspectral images (HSIs) contain spatially structured information and pixel-level sequential spectral attributes. The continuous spectral features contain hundreds of wavelength bands and the differences between spectra are essential for achieving fine-grained classification. Due to the limited receptive field of backbone networks, convolutional neural networks (CNNs)-based HSI classification methods show limitations in modeling spectral-wise long-range dependencies with fixed kernel size and a limited numb… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 53 publications
0
2
0
Order By: Relevance
“…The core of the transformer lies in its multi-head self-attention mechanism. Figure 2 shows the structure of self-attention, which is actually the scaled dot-product attention mechanism (Ma et al, 2022). The self-attention head function can be represented as follows:…”
Section: Global Contextual Association Information Extraction Based O...mentioning
confidence: 99%
“…The core of the transformer lies in its multi-head self-attention mechanism. Figure 2 shows the structure of self-attention, which is actually the scaled dot-product attention mechanism (Ma et al, 2022). The self-attention head function can be represented as follows:…”
Section: Global Contextual Association Information Extraction Based O...mentioning
confidence: 99%
“…The novel backbone network rethinking HSI classification from a sequential perspective with transformers [51] introduces ViT to the field of HSI classification, but it largely focuses on mining spectral information and lacks the utilization of spatial information. To address this limitation, various methods were proposed to combine CNNs with ViT to extract both spatial and spectral features, including the convolutional transformer network [52], spatial-spectral transformer [53], and neighborhood enhancement hybrid transformer network [54]. Other approaches such as spectral-spatial feature tokenization transformer [55] and hyperspectral image transformer (HiT) [56] were developed to extract spectral-spatial features and semantic features.…”
Section: Introductionmentioning
confidence: 99%