2022
DOI: 10.1117/1.jrs.16.046509
|View full text |Cite
|
Sign up to set email alerts
|

Transformer-based multi-scale feature fusion network for remote sensing change detection

Abstract: Change detection (CD) is the operation of quantitatively analyzing the surface changes of a phenomenon or objects over two different times. Lately, CD based on deep learning has developed to become more and more powerful, and convolutional neural networks (CNNs) have dominated the field of remote sensing (RS) CD. In particular, in many fields of computer vision, neural networks based on U-Net network and skip connections have been generally used. However, despite the excellent performance achieved by CNN, it d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
3
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 60 publications
0
3
0
Order By: Relevance
“…Among these deep learning methods, supervised learning methods have performed successfully in remote sensing image CD due to their powerful modeling and learning capabilities 12 16 For example, Deng et al 15 . incorporated depthwise separable convolution and multi-headed self-attention in UNet++, which effectively enhances the network’s ability to extract local and global information.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Among these deep learning methods, supervised learning methods have performed successfully in remote sensing image CD due to their powerful modeling and learning capabilities 12 16 For example, Deng et al 15 . incorporated depthwise separable convolution and multi-headed self-attention in UNet++, which effectively enhances the network’s ability to extract local and global information.…”
Section: Introductionmentioning
confidence: 99%
“…incorporated depthwise separable convolution and multi-headed self-attention in UNet++, which effectively enhances the network’s ability to extract local and global information. Yan et al 16 . proposed a multilevel feature aggregation and enhancement network to sense information at different scales and capture their dependencies.…”
Section: Introductionmentioning
confidence: 99%
“…However, because the current segmentation methods based on brain tumors mostly rely on CNN and its variants, although CNN has achieved excellent performance, it cannot learn global and remote semantic information interaction well due to the locality of convolutional operation [13], [14], [15], lacks the ability to model long-term dependencies explicitly. later, although some researchers have introduced the brilliant transformer architecture in the field of Natural Language Processing(NLP) into the field of image segmentation, transformer and its variants, such as vision transformer, require a large number of data sets for pre-training.…”
mentioning
confidence: 99%