2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 2021
DOI: 10.1109/asru51503.2021.9687965
|View full text |Cite
|
Sign up to set email alerts
|

Reconstructing Dual Learning for Neural Voice Conversion Using Relatively Few Samples

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

3
5

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 28 publications
0
3
0
Order By: Relevance
“…The singing voice synthesis method has experienced the traditional concatenative method based on lyrics to singing alignment [15]- [17], and the synthesis method based on statistical parameters [18], [19], to the present end-to-end deep learning method [1], [20]- [23]. The concatenative based synthesis method has high sound quality but weak generalization ability and phonemes that are not covered in the training set cannot be generated [24].…”
Section: Introductionmentioning
confidence: 99%
“…The singing voice synthesis method has experienced the traditional concatenative method based on lyrics to singing alignment [15]- [17], and the synthesis method based on statistical parameters [18], [19], to the present end-to-end deep learning method [1], [20]- [23]. The concatenative based synthesis method has high sound quality but weak generalization ability and phonemes that are not covered in the training set cannot be generated [24].…”
Section: Introductionmentioning
confidence: 99%
“…Text-to-speech synthesis (TTS) aims to generate intelligible and natural speech from the input text or phoneme sequence [1]- [5]. It has a long history in the TTS research, from the method of concatenative synthesis, and statistical parametric synthesis to the recent method of deep neural TTS.…”
Section: Introductionmentioning
confidence: 99%
“…In addition to the feature representation, most research focus on the classifier. Different classifiers have been tried on singer identification, including SVM, GMM, HMM, and random forest [2], [15]- [17]. With the successful application of deep models in various tasks [5], [18], some studies are using deep models to improve performance on singer identification, such as CRNN [19] which is a state of the art method.…”
Section: Introductionmentioning
confidence: 99%