SWCGAN: Generative Adversarial Network Combining Swin Transformer and CNN for Remote Sensing Image Super-Resolution

Tu, Jingzhi; Mei, Gang; Ma, Zhengjing; Piccialli, Francesco

doi:10.1109/jstars.2022.3190322

Cited by 42 publications

(16 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Lei et al [38] proposed a Transformer-based Enhancement Network (TransENet), where the transformer is employed to extract features at different stages, and the multi-stage design allows for the fusion of high-dimensional and low-dimensional features. Tu et al [47] combined the Swin Transformer with generative adversarial networks (GANs) to propose SWCGAN, where the generator is composed of both convolution and swin and the discriminator consists solely of the Swin Transformer. Shang et al [48] designed a hybrid-scale hierarchical transformer network (HSTNet) to acquire long-range dependencies and effectively compute the correlations between high-dimensional and low-dimensional features.…”

Section: Sisr Methods Of Remote-sensing Imagesmentioning

confidence: 99%

An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution

Zhang,

Tan,

et al. 2024

Remote Sensing

View full text Add to dashboard Cite

Transformer models have great potential in the field of remote sensing super-resolution (SR) due to their excellent self-attention mechanisms. However, transformer models are prone to overfitting because of their large number of parameters, especially with the typically small remote sensing datasets. Additionally, the reliance of transformer-based SR models on convolution-based upsampling often leads to mismatched semantic information. To tackle these challenges, we propose an efficient super-resolution hybrid network (EHNet) based on the encoder composed of our designed lightweight convolution module and the decoder composed of an improved swin transformer. The encoder, featuring our novel Lightweight Feature Extraction Block (LFEB), employs a more efficient convolution method than depthwise separable convolution based on depthwise convolution. Our LFEB also integrates a Cross Stage Partial structure for enhanced feature extraction. In terms of the decoder, based on the swin transformer, we innovatively propose a sequence-based upsample block (SUB) for the first time, which directly uses the sequence of tokens in the transformer to focus on semantic information through the MLP layer, which enhances the feature expression ability of the model and improves the reconstruction accuracy. Experiments show that EHNet’s PSNR on UCMerced and AID datasets obtains a SOTA performance of 28.02 and 29.44, respectively, and is also visually better than other existing methods. Its 2.64 M parameters effectively balance model efficiency and computational demands.

show abstract

Section: Sisr Methods Of Remote-sensing Imagesmentioning

confidence: 99%

An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution

Zhang,

Tan,

et al. 2024

Remote Sensing

View full text Add to dashboard Cite

show abstract

“…A weighted channel-wise concatenation is used to replace the summation of elements in all skip connections, which further facilitates the combination of high-frequency features, enhancing information integration. Tu et al [35] proposed SWCGAN, which combines Swin Transformer and convolutional layers. The method significantly improves the perceptual quality of the reconstructed HR image by stacking deep feature extraction blocks (RDSTB) and adopting a novel Swin Transformer discriminator.…”

Section: A Remote Sensing Image Super-resolutionmentioning

confidence: 99%

MFFAGAN: Generative Adversarial Network With Multilevel Feature Fusion Attention Mechanism for Remote Sensing Image Super-Resolution

Tang,

Wang,

Liu

2024

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

“…Ye et al [42] proposed a transformer-based super-resolution method for RSIs, and they employed self-attention to establish dependencies relationships within local and global features. Tu et al [43] presented a GAN that draws on the strengths of the CNN and Swin Transformer, termed the SWCGAN. The SWCGAN fully considers the characteristics of large size, a large amount of information, and a strong relevance between pixels required for RSISR.…”

Section: Transformer-based Sr Modelsmentioning

confidence: 99%

Hybrid-Scale Hierarchical Transformer for Remote Sensing Image Super-Resolution

Shang

Gao

et al. 2023

Remote Sensing

View full text Add to dashboard Cite

Super-resolution (SR) technology plays a crucial role in improving the spatial resolution of remote sensing images so as to overcome the physical limitations of spaceborne imaging systems. Although deep convolutional neural networks have achieved promising results, most of them overlook the advantage of self-similarity information across different scales and high-dimensional features after the upsampling layers. To address the problem, we propose a hybrid-scale hierarchical transformer network (HSTNet) to achieve faithful remote sensing image SR. Specifically, we propose a hybrid-scale feature exploitation module to leverage the internal recursive information in single and cross scales within the images. To fully leverage the high-dimensional features and enhance discrimination, we designed a cross-scale enhancement transformer to capture long-range dependencies and efficiently calculate the relevance between high-dimension and low-dimension features. The proposed HSTNet achieves the best result in PSNR and SSIM with the UCMecred dataset and AID dataset. Comparative experiments demonstrate the effectiveness of the proposed methods and prove that the HSTNet outperforms the state-of-the-art competitors both in quantitative and qualitative evaluations.

show abstract

SWCGAN: Generative Adversarial Network Combining Swin Transformer and CNN for Remote Sensing Image Super-Resolution

Cited by 42 publications

References 42 publications

An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution

An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution

MFFAGAN: Generative Adversarial Network With Multilevel Feature Fusion Attention Mechanism for Remote Sensing Image Super-Resolution

Hybrid-Scale Hierarchical Transformer for Remote Sensing Image Super-Resolution

Contact Info

Product

Resources

About