ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683154
|View full text |Cite
|
Sign up to set email alerts
|

Singing Voice Synthesis Based on Generative Adversarial Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
39
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3
3

Relationship

1
8

Authors

Journals

citations
Cited by 57 publications
(39 citation statements)
references
References 12 publications
0
39
0
Order By: Relevance
“…Non-Seq2Seq singing synthesizers include those based on autoregressive architectures [17,21,22], feed-forward CNN [23], and feed-forward GAN-based approaches [24,25].…”
Section: Relation To Prior Workmentioning
confidence: 99%
“…Non-Seq2Seq singing synthesizers include those based on autoregressive architectures [17,21,22], feed-forward CNN [23], and feed-forward GAN-based approaches [24,25].…”
Section: Relation To Prior Workmentioning
confidence: 99%
“…(6) are fed to the discriminator. This method is generally called conditional GAN [19] and its effectiveness has been shown in SPSS [15,16]. Figure 1: Outline of multi-task learning with speaker identification.…”
Section: Speaker Information Utilization Via Conditional Ganmentioning
confidence: 99%
“…Toda et al [10] focused on the significant decrease in the synthetic acoustic feature's global variance (GV) and gave a constraint at the parameter generation so as to generate acoustic features with appropriate GV. In contrast, generative adversarial networks (GANs) [13] have attempted to decrease the difference in acoustic features between synthesized and real speech [14,15,16]. They focus on the remarkable difference in acoustic feature distribution between natural and a synthetic speech [17].…”
Section: Introductionmentioning
confidence: 99%
“…Another approach for parallel waveform generation is to use generative adversarial networks (GANs) [12]. A GAN is a powerful generative model that has been successfully used in various research fields such as image generation [13], speech synthesis [14], and singing voice synthesis [15]. GAN-based models have also been proposed for waveform generation [16,17].…”
Section: Introductionmentioning
confidence: 99%