2022
DOI: 10.3390/sym14050906
|View full text |Cite
|
Sign up to set email alerts
|

A Multi-Attention UNet for Semantic Segmentation in Remote Sensing Images

Abstract: In recent years, with the development of deep learning, semantic segmentation for remote sensing images has gradually become a hot issue in computer vision. However, segmentation for multicategory targets is still a difficult problem. To address the issues regarding poor precision and multiple scales in different categories, we propose a UNet, based on multi-attention (MA-UNet). Specifically, we propose a residual encoder, based on a simple attention module, to improve the extraction capability of the backbone… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
25
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 41 publications
(25 citation statements)
references
References 42 publications
0
25
0
Order By: Relevance
“…Sun et al and Liu et al applied an attention mechanism in the semantic segmentation task of remote sensing images. They proposed a multi-attention-based UNet [ 41 ] and an attention-based residual encoder [ 42 ], respectively. Through channel attention and spatial attention, the capability of fine-grained features was improved.…”
Section: Related Workmentioning
confidence: 99%
“…Sun et al and Liu et al applied an attention mechanism in the semantic segmentation task of remote sensing images. They proposed a multi-attention-based UNet [ 41 ] and an attention-based residual encoder [ 42 ], respectively. Through channel attention and spatial attention, the capability of fine-grained features was improved.…”
Section: Related Workmentioning
confidence: 99%
“…ResUnet++ is an extension of the u-net architecture that incorporates residual connections in the encoder and the decoder paths [ 29 ]. DoubleU-Net is another variant of the u-net architecture that includes Atrous Spatial Pyramid Pooling (ASPP) to selectively highlight contextual features [ 30 ]. Deep Residual u-net is a variant of the u-net architecture that contains residual connections and deep supervision to improve segmentation accuracy [ 31 ].…”
Section: Related Workmentioning
confidence: 99%
“…The past decade has witnessed the continuous development of neural network, which is also used in cloud segmentation [8-[13], power transmission system [14], and building segmentation [15][16][17][18][19][20], etc. In 2015, Long et al [21] proposed the Fully Convolutional Network (FCN), which was the first network successfully used for image segmentation.…”
Section: Introductionmentioning
confidence: 99%
“…Zhao et al [24] proposed the Pyramid Scene Parsing Network (PSPNet), which used a pyramid pooling module to extract contextual information and stacked the contextual information with the extracted feature information to complete image segmentation, urban buildings are obscured by shadows on high-resolution remote sensing images, and the network 's extraction of object information is incomplete. Sun et al [17] proposed a Multi-Attention UNet (MA-UNet) based on the UNet network, adding a residual encoder with an attention mechanism and a self-attention mechanism. However, due to the complex background of the ground features and irregular boundaries, this method will produce some misclassifications and omissions during extraction.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation