2011
DOI: 10.4236/jsip.2011.22017
|View full text |Cite
|
Sign up to set email alerts
|

A Study of Bilinear Models in Voice Conversion

Abstract: This paper presents a voice conversion technique based on bilinear models and introduces the concept of contextual modeling. The bilinear approach reformulates the spectral envelope representation from line spectral frequencies feature to a two-factor parameterization corresponding to speaker identity and phonetic information, the so-called style and content factors. This decomposition offers a flexible representation suitable for voice conversion and facilitates the use of efficient training algorithms based … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…line spectral frequencies to a two-factor parameterization corresponding to speaker identity and phonetic information. The spectral vector y sc , uttered by speaker s and corresponding to the phonetic content class c, is represented as a product of a speaker-dependent matrix A s and a phonetic content vector b c using the asymmetric bilinear model (Popa et al, 2011):…”
Section: Bilinear Modelsmentioning
confidence: 99%
“…line spectral frequencies to a two-factor parameterization corresponding to speaker identity and phonetic information. The spectral vector y sc , uttered by speaker s and corresponding to the phonetic content class c, is represented as a product of a speaker-dependent matrix A s and a phonetic content vector b c using the asymmetric bilinear model (Popa et al, 2011):…”
Section: Bilinear Modelsmentioning
confidence: 99%
“…Content and style factors can be seen as two independent elements which influence the object and determine the observation [15], as mentioned before. The information contributed to recognition is defined as content factor and the negative information is style factor during pattern recognition.…”
Section: Factors Analysismentioning
confidence: 99%
“…Solutions are given through the optimization of both intuitive and pragmatic objectives, with the aim of extracting low-dimensional, simplified, and robust features from the high-dimensional, redundant, disturbed, and distorted original data. Content and style factors can be seen as the two components of an object [15]. During pattern recognition field, the information explored as base of recognition is defined as content factor and the information disturbed recognition is defined as style factor.…”
Section: Introductionmentioning
confidence: 99%