2014
DOI: 10.1186/s13636-014-0038-1
|View full text |Cite
|
Sign up to set email alerts
|

A uniform phase representation for the harmonic model in speech synthesis applications

Abstract: Feature-based vocoders, e.g., STRAIGHT, offer a way to manipulate the perceived characteristics of the speech signal in speech transformation and synthesis. For the harmonic model, which provide excellent perceived quality, features for the amplitude parameters already exist (e.g., Line Spectral Frequencies (LSF), Mel-Frequency Cepstral Coefficients (MFCC)). However, because of the wrapping of the phase parameters, phase features are more difficult to design. To randomize the phase of the harmonic model during… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
61
1

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 50 publications
(63 citation statements)
references
References 59 publications
1
61
1
Order By: Relevance
“…4, the clean spectral phase exhibits a low variance with a visible harmonic structure compared to the large variance in the noisy speech. In particular, as reported in [25], at voiced frames, the shape of the glottal pulse changes smoothly hence a low variance of phase is observed.…”
Section: Harmonic Structure In Phase and Motivationsupporting
confidence: 66%
See 2 more Smart Citations
“…4, the clean spectral phase exhibits a low variance with a visible harmonic structure compared to the large variance in the noisy speech. In particular, as reported in [25], at voiced frames, the shape of the glottal pulse changes smoothly hence a low variance of phase is observed.…”
Section: Harmonic Structure In Phase and Motivationsupporting
confidence: 66%
“…The harmonic structure in the clean spectral phase across time or frequency as well as across harmonics at voiced frames captured by the low variance of phase [25], inspires us to propose a time-frequency smoothing filtering approach and apply it at speech harmonics at least for voiced speech segments in order to obtain enhanced phase estimates at harmonics. In this work, we propose a method to smooth the harmonic phase across time and frequency to reduce the variance of the noisy phase at the signal harmonics.…”
Section: Harmonic Structure In Phase and Motivationmentioning
confidence: 99%
See 1 more Smart Citation
“…One of the issues with current day speech synthesizers is generation of excitation source signal, which is mainly manifested in the phase spectrum of the speech signal. Attempts to incorporate phase of speech signals into synthesis systems include usage of complex cepstrum [66], adding instantaneous phase randomness features to HMM based synthesis [67] and compensating phase mismatches in concatenative synthesis [68]. Alternatively, the proposed AP filter, which models the phase spectral characteristics of speech signal, can be used for speech synthesis.…”
Section: Speech Synthesismentioning
confidence: 99%
“…relative phase shift [7], group delay [8], phase dispersion [9], phase distortion [10] and the complex cepstrum [6] for speech synthesis. For example, in [6] and [11], complex cepstra or a cepstrum-like representation calculated from the standard deviation of phase distortion have been modelled, respectively, using an additional independent stream in HMM-based statistical parametric speech synthesis (SPSS) to improve the quality of the vocoded speech.…”
Section: Introductionmentioning
confidence: 99%