Multi-Scale Feature Fusion Attention Network for Building Extraction in Remote Sensing Images

Liu, Jia; Gu, Hang; Li, Zuhe; Chen, Hongyang; Chen, Hao

doi:10.3390/electronics13050923

Cited by 4 publications

(1 citation statement)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Zeng et al [19] proposed a new cross-scale semantic feature network by using the multiscale convolution module to obtain multiscale context from different receptive fields. Liu et al [20] developed a multi-resolution attention model based on multiscale channel and spatial attention for exacting important features. Xu et al [21] proposed a multiscale fusion network with atrous spatial pyramid pooling and varisized convolutions to effectively extract and fuse the features from multi-modal images.…”

Section: Introductionmentioning

confidence: 99%

An Efficient Semantic Segmentation Method for Remote-Sensing Imagery Using Improved Coordinate Attention

Huo,

Gang,

Dong

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

Semantic segmentation stands as a prominent domain within remote sensing that is currently garnering significant attention. This paper introduces a pioneering semantic segmentation model based on TransUNet architecture with improved coordinate attention for remote-sensing imagery. It is composed of an encoding stage and a decoding stage. Notably, an enhanced and improved coordinate attention module is employed by integrating two pooling methods to generate weights. Subsequently, the feature map undergoes reweighting to accentuate foreground information and suppress background information. To address the issue of time complexity, this paper introduces an improvement to the transformer model by sparsifying the attention matrix. This reduces the computing expense of calculating attention, making the model more efficient. Additionally, the paper uses a combined loss function that is designed to enhance the training performance of the model. The experimental results conducted on three public datasets manifest the efficiency of the proposed method. The results indicate that it excels in delivering outstanding performance for semantic segmentation tasks pertaining to remote-sensing images.

show abstract

Section: Introductionmentioning

confidence: 99%