2024
DOI: 10.3390/electronics13050923
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Scale Feature Fusion Attention Network for Building Extraction in Remote Sensing Images

Jia Liu,
Hang Gu,
Zuhe Li
et al.

Abstract: The efficient semantic segmentation of buildings in high spatial resolution remote sensing images is a technical prerequisite for land resource management, high-precision mapping, construction planning and other applications. Current building extraction methods based on deep learning can obtain high-level abstract features of images. However, the extraction of some occluded buildings is inaccurate, and as the network deepens, small-volume buildings are lost and edges are blurred. Therefore, we introduce a mult… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 44 publications
0
1
0
Order By: Relevance
“…Zeng et al [19] proposed a new cross-scale semantic feature network by using the multiscale convolution module to obtain multiscale context from different receptive fields. Liu et al [20] developed a multi-resolution attention model based on multiscale channel and spatial attention for exacting important features. Xu et al [21] proposed a multiscale fusion network with atrous spatial pyramid pooling and varisized convolutions to effectively extract and fuse the features from multi-modal images.…”
Section: Introductionmentioning
confidence: 99%
“…Zeng et al [19] proposed a new cross-scale semantic feature network by using the multiscale convolution module to obtain multiscale context from different receptive fields. Liu et al [20] developed a multi-resolution attention model based on multiscale channel and spatial attention for exacting important features. Xu et al [21] proposed a multiscale fusion network with atrous spatial pyramid pooling and varisized convolutions to effectively extract and fuse the features from multi-modal images.…”
Section: Introductionmentioning
confidence: 99%