2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 2007
DOI: 10.1109/icassp.2007.367303
|View full text |Cite
|
Sign up to set email alerts
|

One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices

Abstract: This paper describes two flexible frameworks of voice conversion (VC), i.e., one-to-many VC and many-to-one VC. One-to-many VC realizes the conversion from a user's voice as a source to arbitrary target speakers' ones and many-to-one VC realizes the conversion vice versa. We apply eigenvoice conversion (EVC) to both VC frameworks. Using multiple parallel data sets consisting of utterancepairs of the user and multiple pre-stored speakers, an eigenvoice Gaussian mixture model (EV-GMM) is trained in advance. Unsu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
47
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 72 publications
(47 citation statements)
references
References 8 publications
0
47
0
Order By: Relevance
“…We also describe one-to-many EVC [5] as a technique for flexibly controlling voice quality of the converted speech. This method consists of a training process, an adaptation process, and a conversion process.…”
Section: One-to-many Eigenvoice Conversion (Evc)mentioning
confidence: 99%
See 2 more Smart Citations
“…We also describe one-to-many EVC [5] as a technique for flexibly controlling voice quality of the converted speech. This method consists of a training process, an adaptation process, and a conversion process.…”
Section: One-to-many Eigenvoice Conversion (Evc)mentioning
confidence: 99%
“…On the other hand, the EV-GMM for the aperiodic component estimation is trained using multiple joint feature vector sets consisting of the spectral segment features of the laryngectomee and the aperiodic components of the pre-stored non-laryngectomees. In this paper, we perform the PCA-based training method for the EV-GMM [5].…”
Section: Es-to-speech Based On One-to-many Evcmentioning
confidence: 99%
See 1 more Smart Citation
“…Following the same idea, eigenvoice-based conversion [11], and tensor representation of speaker space [12] are examples of similar successful attempts in voice conversion. However, these methods all require a Z. Wu and E. S. Chng are with the School of Computer Engineering, Nanyang Technological University, Singapore 639798, and also with Temasek Lab@NTU, Nanyang Technological University, Singapore.…”
Section: Introductionmentioning
confidence: 99%
“…The trained conversion model makes it possible to convert the acoustic features of the source singer's singing voice into those of the target singer's singing voice in any song, keeping the linguistic information of the lyrics unchanged. Furthermore, to develop a more flexible SVC system, eigenvoice conversion (EVC) techniques [6] have been applied to SVC [7]. In an SVC system based on many-to-many EVC [8], which is one particular variety of EVC, an initial conversion model called the canonical eigenvoice GMM (EV-GMM) is trained in advance using multiple parallel data sets including song pairs of a single reference singer and many other singers.…”
Section: Introductionmentioning
confidence: 99%