2021
DOI: 10.48550/arxiv.2103.16149
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Time-domain Speech Enhancement with Generative Adversarial Learning

Feiyang Xiao,
Jian Guan,
Qiuqiang Kong
et al.

Abstract: Speech enhancement aims to obtain speech signals with high intelligibility and quality from noisy speech. Recent work has demonstrated the excellent performance of time-domain deep learning methods, such as Conv-TasNet. However, these methods can be degraded by the arbitrary scales of the waveform induced by the scale-invariant signal-to-noise ratio (SI-SNR) loss. This paper proposes a new framework called Timedomain Speech Enhancement Generative Adversarial Network (TSEGAN), which is an extension of the gener… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…In addition, showed how the method may be used to regenerate whispered speech. The research [ 37 ] offers time-domain SE using GAN, an extension of the generative adversarial network in the time-domain with metric assessment to alleviate the scale issue and give model training stability, thereby improving performance. In addition, provides a novel approach based on objective function mapping to analyse Metric GAN’s performance and explain why it is superior to Wasserstein GAN.…”
Section: Related Studiesmentioning
confidence: 99%
“…In addition, showed how the method may be used to regenerate whispered speech. The research [ 37 ] offers time-domain SE using GAN, an extension of the generative adversarial network in the time-domain with metric assessment to alleviate the scale issue and give model training stability, thereby improving performance. In addition, provides a novel approach based on objective function mapping to analyse Metric GAN’s performance and explain why it is superior to Wasserstein GAN.…”
Section: Related Studiesmentioning
confidence: 99%
“…Xiao et al [13] developed time domain speech enhancement using generative adversarial network (GAN) to improve the performance of the generator and also compared difference GANs available for speech enhancement. Tan et al [14] proposed an end-to-end multi task model for VAD which increases the robustness of VAD system for low SNR conditions.…”
Section: Introductionmentioning
confidence: 99%