2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8462588
|View full text |Cite
|
Sign up to set email alerts
|

Speech Bandwidth Extension Using Generative Adversarial Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 42 publications
(11 citation statements)
references
References 7 publications
0
11
0
Order By: Relevance
“…For BWE, its variable discrimination helps to refine details in the high frequencies. The previous GAN works in BWE [31,20,19] typically follow simple designs, using a discriminator of a few fully connected layers or convolutional layers on the spectral features, while the discrimination directly on waveform has rarely been employed.…”
Section: Related Workmentioning
confidence: 99%
“…For BWE, its variable discrimination helps to refine details in the high frequencies. The previous GAN works in BWE [31,20,19] typically follow simple designs, using a discriminator of a few fully connected layers or convolutional layers on the spectral features, while the discrimination directly on waveform has rarely been employed.…”
Section: Related Workmentioning
confidence: 99%
“…Generative Adversarial Networks (GANs) [1] are a class of approaches for learning generative models based on game theory. They find various applications in image processing [2,3,4], wireless communications [5,6] and signal processing [7,8]. The GANs framework can be also easily modified with other loss functions and learning dynamics, triggering numerous variants [9,10,11,12,13,14,15].…”
Section: Introductionmentioning
confidence: 99%
“…The value inside the bracket is default choice in our experiment.3 https://github.com/znxlwm/pytorch-generative-modelcollections4 http://yann.lecun.com/exdb/mnist/ 5 https://github.com/zalandoresearch/fashion-mnist/ 6 https://www.cs.toronto.edu/∼kriz/cifar.html7 We use a small version of imagenet publicly available online: http://image-net.org/small/train_64x64.tar…”
mentioning
confidence: 99%
“…GANs have predominantly been used in computer vision, including but not limited to image generation, face synthesis [8], image translation [9,10,11], texture synthesis [12,13], medical imaging, [14] and super-resolution [15]. Moreover, GANs can be applied in many other fields including but not limited to voice and speech signals [16,17,18], anomaly detection [19], power systems and smart grids [20,21,22], electronics [23,24], and fault diagnosis [25,26,27,28].…”
Section: Introductionmentioning
confidence: 99%