ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9054071
|View full text |Cite
|
Sign up to set email alerts
|

ESRGAN+ : Further Improving Enhanced Super-Resolution Generative Adversarial Network

Abstract: Enhanced Super-Resolution Generative Adversarial Network (ESRGAN) is a perceptual-driven approach for single image super resolution that is able to produce photorealistic images. Despite the visual quality of these generated images, there is still room for improvement. In this fashion, the model is extended to further improve the perceptual quality of the images. We have designed a novel block to replace the one used by the original ESRGAN. Moreover, we introduce noise inputs to the generator network in order … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
90
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 151 publications
(103 citation statements)
references
References 29 publications
0
90
0
Order By: Relevance
“…GANs provide a state-of-the-art framework for producing high-quality and "photorealistic" SRR images. Recent GAN variations [92][93][94] have been focusing on optimisations of the original residual architecture of the generator network [85,91] and/or on better modelling of the perceptual loss, in order to improve the visual quality of the SRR results. In this work, we based our model on the ESRGAN architecture [92] due to its solid performance on real-world images.…”
Section: Marsgan Architecturementioning
confidence: 99%
See 3 more Smart Citations
“…GANs provide a state-of-the-art framework for producing high-quality and "photorealistic" SRR images. Recent GAN variations [92][93][94] have been focusing on optimisations of the original residual architecture of the generator network [85,91] and/or on better modelling of the perceptual loss, in order to improve the visual quality of the SRR results. In this work, we based our model on the ESRGAN architecture [92] due to its solid performance on real-world images.…”
Section: Marsgan Architecturementioning
confidence: 99%
“…In this work, we based our model on the ESRGAN architecture [92] due to its solid performance on real-world images. Inspired by the adaptive weighted learning process proposed in AWSRN [88] and the optimisations introduced in ESRGANplus [93], we propose an Adaptive Weighted RRDB with noise inputs (AWRRDB) to replace the original RRDB basic block in ESRGAN for more effective and efficient residual learning. Moreover, we use a multi-scale reconstruction scheme [88] based on subpixel-shuffling [80,85] to replace the up-sampling layers used in ESRGAN to make full use of both low-frequency and high-frequency residuals while avoiding the checkerboard patterned artefacts from using up-sampling layers [91].…”
Section: Marsgan Architecturementioning
confidence: 99%
See 2 more Smart Citations
“…Recently, deep learning and machine learning have shown the potential ability and achieved outstanding performance in many fields [5][6][7][8][9][10], especially some image-to-image tasks such as semantic segmentation [11,12], image superresolution [13,14]. Inspired by this, works most recently tend to train an end-to-end CNN [14][15][16][17][18] for single image deblurring and achieve more effective results than previously established methods.…”
Section: Introductionmentioning
confidence: 99%