2023
DOI: 10.3390/app13148110
|View full text |Cite
|
Sign up to set email alerts
|

A Multi-Scale Deep Back-Projection Backbone for Face Super-Resolution with Diffusion Models

Abstract: Face verification and recognition are important tasks that have made great progress in recent years. However, recognizing low-resolution faces from small images is still a difficult problem. In this paper, we advocate using diffusion models (DMs) to enhance face resolution and improve their quality for various downstream applications. Most existing DMs for super-resolution use U-Net as their backbone network, which only exploits multi-scale features along the spatial dimension. These approaches result in a slo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 56 publications
0
1
0
Order By: Relevance
“…Inspired by the great success of generative adversarial networks (GANs) in the image processing field [16][17][18], Yang et al [19] introduced a collaborative suppression and replenishment framework based on GANs. Gao et al [20] proposed a conditional generative model based on the diffusion model, which replaces the U-Net in super-resolution to capture complex details and fine textures. Addressing the fact that GAN-based methods require greater computational resources, PCA-SRGAN [21] uses Principal Component Analysis decomposition, while SPGAN [22] employs supervised pixel-wise loss to ease the GAN training process.…”
Section: Introductionmentioning
confidence: 99%
“…Inspired by the great success of generative adversarial networks (GANs) in the image processing field [16][17][18], Yang et al [19] introduced a collaborative suppression and replenishment framework based on GANs. Gao et al [20] proposed a conditional generative model based on the diffusion model, which replaces the U-Net in super-resolution to capture complex details and fine textures. Addressing the fact that GAN-based methods require greater computational resources, PCA-SRGAN [21] uses Principal Component Analysis decomposition, while SPGAN [22] employs supervised pixel-wise loss to ease the GAN training process.…”
Section: Introductionmentioning
confidence: 99%