2012
DOI: 10.1007/978-3-642-35292-8_4
|View full text |Cite
|
Sign up to set email alerts
|

Improving the Quality of Standard GMM-Based Voice Conversion Systems by Considering Physically Motivated Linear Transformations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 10 publications
0
7
0
Order By: Relevance
“…A post-filter based on modulation spectrum modification is found useful to address the inherent over-smoothing issue in statistical modeling [122], such as GMM approach, which effectively compensates the global variance. The GMM approach is a parametric solution [123]- [127]. It represents a successful statistical modeling technique that works well with parallel training data.…”
Section: A Gaussian Mixture Modelsmentioning
confidence: 99%
“…A post-filter based on modulation spectrum modification is found useful to address the inherent over-smoothing issue in statistical modeling [122], such as GMM approach, which effectively compensates the global variance. The GMM approach is a parametric solution [123]- [127]. It represents a successful statistical modeling technique that works well with parallel training data.…”
Section: A Gaussian Mixture Modelsmentioning
confidence: 99%
“…Hence, the technique proposed in [16] was used in this work. This approach is based on frequency warping (FW) combined with amplitude scaling (AS), which consists in applying a linear trans-form in the cepstral domain [20]:…”
Section: Gender Conversionmentioning
confidence: 99%
“…For every source-target speaker pair in the database, the FW + AS function was trained as follows. First, A was computed via (2) where W was calculated through the so-called Dynamic FW technique (see [23] for details); then, b was obtained as…”
Section: Fig 2 Parametric Definition Of An Fw Function (Left) and Anmentioning
confidence: 99%
“…In practice, fa, fb and k are set to suitable constant values and α is varied arbitrarily. Given W ( f ), the corresponding matrix A is calculated asA=CWSwhere S is a rectangular matrix that transforms a Mel‐cepstral vector into its corresponding discrete log‐amplitude spectrum, C is its pseudo‐inverse, and W is a sparse square matrix that captures the correspondence between original and warped spectral bins given W ( f ) [23]. As for the AS function, it can be defined using a set of weighted overlapping Hanning‐like bands, similarly as proposed in [21].…”
Section: Speaker De‐identificationmentioning
confidence: 99%