Chengyun Deng scite author profile

Chengyun Deng

5Publications

40Citation Statements Received

86Citation Statements Given

How they've been cited

How they cite others

103

Affiliations

Anhui University of Science and Technology, Beijing University of Posts and Telecommunications

Publications

Order By: Most citations

On Loss Functions and Recurrency Training for GAN-Based Speech Enhancement Systems

Zhang¹,

Deng²,

Shen³

et al. 2020

View full text Add to dashboard Cite

Recent work has shown that it is feasible to use generative adversarial networks (GANs) for speech enhancement, however, these approaches have not been compared to state-of-the-art (SOTA) non GAN-based approaches. Additionally, many loss functions have been proposed for GAN-based approaches, but they have not been adequately compared. In this study, we propose novel convolutional recurrent GAN (CRGAN) architectures for speech enhancement. Multiple loss functions are adopted to enable direct comparisons to other GAN-based systems. The benefits of including recurrent layers are also explored. Our results show that the proposed CRGAN model outperforms the SOTA GAN-based models using the same loss functions and it outperforms other non-GAN based systems, indicating the benefits of using a GAN for speech enhancement. Overall, the CRGAN model that combines an objective metric loss function with the mean squared error (MSE) provides the best performance over comparison approaches across many evaluation metrics.

show abstract

On Loss Functions and Recurrency Training for GAN-based Speech Enhancement Systems

Zhang

Deng²,

Shen

et al. 2020

Preprint

View full text Add to dashboard Cite

Robust Speaker Extraction Network Based on Iterative Refined Adaptation

Deng¹,

Ma²,

Sha³

et al. 2021

View full text Add to dashboard Cite

Speaker extraction aims to extract target speech signal from a multi-talker environment with interference speakers and surrounding noise, given the target speaker's reference information. Most speaker extraction systems achieve satisfactory performance on the premise that the test speakers have been encountered during training time. Such systems suffer from performance degradation given unseen target speakers and/or mismatched reference voiceprint information. In this paper we propose a novel strategy named Iterative Refined Adaptation (IRA) to improve the robustness and generalization capability of speaker extraction systems in the aforementioned scenarios. Given an initial speaker embedding encoded by an auxiliary network, the extraction network can obtain a latent representation of the target speaker, which is fed back to the auxiliary network to get a refined embedding to provide more accurate guidance for the extraction network. Experiments on WSJ0-2mix-extr and WHAM! dataset confirm the superior performance of the proposed method over the network without IRA in terms of SI-SDR and PESQ improvement.

show abstract

Speech Enhancement Based on A New Architecture of Wasserstein Generative Adversarial Networks

Jiang

Qin

et al. 2018

View full text Add to dashboard Cite

Conv-TasSAN: Separative Adversarial Network Based on Conv-TasNet

Deng¹,

Zhang²,

Ma³

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Chengyun Deng

On Loss Functions and Recurrency Training for GAN-Based Speech Enhancement Systems

On Loss Functions and Recurrency Training for GAN-based Speech Enhancement Systems

Robust Speaker Extraction Network Based on Iterative Refined Adaptation

Speech Enhancement Based on A New Architecture of Wasserstein Generative Adversarial Networks

Conv-TasSAN: Separative Adversarial Network Based on Conv-TasNet

Contact Info

Product

Resources

About