2022
DOI: 10.3390/rs14143382
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Category Segmentation of Sentinel-2 Images Based on the Swin UNet Method

Abstract: Medium-resolution remote sensing satellites have provided a large amount of long time series and full coverage data for Earth surface monitoring. However, the different objects may have similar spectral values and the same objects may have different spectral values, which makes it difficult to improve the classification accuracy. Semantic segmentation of remote sensing images is greatly facilitated via deep learning methods. For medium-resolution remote sensing images, the convolutional neural network-based mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(9 citation statements)
references
References 40 publications
0
9
0
Order By: Relevance
“…Since transformer-based models can acquire long-range dependencies and convolutional neural networks can capture fine-grained local features, existing literature tends to construct U-shaped architecture based on transformer blocks and convolutional neural networks, which exhibit promising results on remote sensing datasets [ 29 , 30 , 31 , 32 ]. Inspired by these, we propose a novel dual encoder of two branches: Swin Transformer blocks and reslayers in reverse order, along with a decoder of only Swin Transformer blocks.…”
Section: Methodsmentioning
confidence: 99%
“…Since transformer-based models can acquire long-range dependencies and convolutional neural networks can capture fine-grained local features, existing literature tends to construct U-shaped architecture based on transformer blocks and convolutional neural networks, which exhibit promising results on remote sensing datasets [ 29 , 30 , 31 , 32 ]. Inspired by these, we propose a novel dual encoder of two branches: Swin Transformer blocks and reslayers in reverse order, along with a decoder of only Swin Transformer blocks.…”
Section: Methodsmentioning
confidence: 99%
“…End-to-end semantic segmentation networks are predominantly employed in deep learning-based remote sensing image classification to accomplish pixel-level classification 38 . However, for complex feature objects, these semantic segmentation methods exhibit a "pretzel effect," as accurately determining the appropriate class for each pixel can be quite difficult 20,21 .…”
Section: Image Classification Techniquesmentioning
confidence: 99%
“…This technique is extremely useful for classifi cation tasks in EO due to images rarely containing only one class, and where contex within an image is important [12]. Figure 2 presents an example of multiclass image seg mentation in the context of land with several land cover classification [27]. The convolutional layer performs the majority of the computation.…”
Section: Image Segmentation and Cnnsmentioning
confidence: 99%