2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2019
DOI: 10.1109/apsipaasc47483.2019.9023162
|View full text |Cite
|
Sign up to set email alerts
|

SINGAN: Singing Voice Conversion with Generative Adversarial Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
24
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 38 publications
(24 citation statements)
references
References 31 publications
0
24
0
Order By: Relevance
“…These approaches model parallel training samples using statistical methods, such as Gaussian mixture model (GMM)-based many-tomany eigenvoice conversion [11], direct waveform modification based on spectrum difference [12,13]. GAN-based parallel approach has also been proposed to improve conversion performance [14]. Since parallel SVC approaches require parallel data, which is expensive to collect, for the training process, researchers have investigated many non-parallel SVC approaches.…”
Section: Singing Voice Conversionmentioning
confidence: 99%
“…These approaches model parallel training samples using statistical methods, such as Gaussian mixture model (GMM)-based many-tomany eigenvoice conversion [11], direct waveform modification based on spectrum difference [12,13]. GAN-based parallel approach has also been proposed to improve conversion performance [14]. Since parallel SVC approaches require parallel data, which is expensive to collect, for the training process, researchers have investigated many non-parallel SVC approaches.…”
Section: Singing Voice Conversionmentioning
confidence: 99%
“…The early studies for singing voice conversion generally follow the statistical generation architectures [1,2], which often use Gaussian mixture model (GMM) with parallel singing data. [3] updates the conversion framework with Generative Adversarial Network (GAN). However, it still requires the source and the target speakers to sing the same songs during the training phase.…”
Section: Introductionmentioning
confidence: 99%
“…statistical methods, such as Gaussian mixture model (GMM) based many-to-many eigenvoice conversion [1], direct waveform modification based on spectrum difference [2,3]. Artificial neural network (ANN) based approaches are also proposed to improve conversion performance [4,5].…”
Section: Introductionmentioning
confidence: 99%