2022
DOI: 10.48550/arxiv.2210.17287
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Versatile Diffusion-based Generative Refiner for Speech Enhancement

Abstract: Although deep neural network (DNN)-based speech enhancement (SE) methods outperform the previous non-DNN-based ones, they often degrade the perceptual quality of generated outputs. To tackle this problem, We introduce a DNN-based generative refiner aiming to improve perceptual speech quality pre-processed by an SE method. As the refiner, we train a diffusion-based generative model by utilizing a dataset consisting of clean speech only. Then, the model replaces the degraded and distorted parts caused by a prece… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…Research on speech enhancement has achieved significant improvement in terms of signal-to-noise (SNR) ratio but sometimes degrades the speech quality (e.g., naturalness), leading to the degradation of downstream applications. To remove the distortions of speech enhancement outputs, Refiner [93] applies a diffusion model pretrained on clean speech data to detect the degraded part, and then replaces them with newly generated clean ones in the manner of denoising diffusion restoration models(DDRM) [35]. Experimental results show that Refiner [93] is versatile since it improves speech quality with regard to various speech enhancement methods.…”
Section: Unsupervised Restorationmentioning
confidence: 99%
See 2 more Smart Citations
“…Research on speech enhancement has achieved significant improvement in terms of signal-to-noise (SNR) ratio but sometimes degrades the speech quality (e.g., naturalness), leading to the degradation of downstream applications. To remove the distortions of speech enhancement outputs, Refiner [93] applies a diffusion model pretrained on clean speech data to detect the degraded part, and then replaces them with newly generated clean ones in the manner of denoising diffusion restoration models(DDRM) [35]. Experimental results show that Refiner [93] is versatile since it improves speech quality with regard to various speech enhancement methods.…”
Section: Unsupervised Restorationmentioning
confidence: 99%
“…To remove the distortions of speech enhancement outputs, Refiner [93] applies a diffusion model pretrained on clean speech data to detect the degraded part, and then replaces them with newly generated clean ones in the manner of denoising diffusion restoration models(DDRM) [35]. Experimental results show that Refiner [93] is versatile since it improves speech quality with regard to various speech enhancement methods. Moreover, the Refiner [93] can also be integrated into the speech enhancement model for joint optimization in the future.…”
Section: Unsupervised Restorationmentioning
confidence: 99%
See 1 more Smart Citation