1986
DOI: 10.1109/tassp.1986.1164910
|View full text |Cite
|
Sign up to set email alerts
|

Speech analysis/Synthesis based on a sinusoidal representation

Abstract: A sinusoidal model for the speech waveform is used to develop a new analysislsynthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves. These parameters are estimated from the short-time Fourier transform using a simple peak-picking algorithm. Rapid changes in the highly resolved spectral components are tracked using the concept of "birth" and "death" of the underlying sine waves. For a given frequency track a cubic function is used to unwrap and interpola… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

4
607
0
3

Year Published

1999
1999
2010
2010

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 1,183 publications
(614 citation statements)
references
References 8 publications
4
607
0
3
Order By: Relevance
“…The model approximates speech signals as the sum of sinusoidal components with instantaneous amplitudes and frequencies that continuously vary with time [4]. In the present study, we propose a novel analysis method to estimate VTTF and F 0 for nonstationary voiced speech on the basis of sinusoidal representation.…”
Section: Introductionmentioning
confidence: 99%
“…The model approximates speech signals as the sum of sinusoidal components with instantaneous amplitudes and frequencies that continuously vary with time [4]. In the present study, we propose a novel analysis method to estimate VTTF and F 0 for nonstationary voiced speech on the basis of sinusoidal representation.…”
Section: Introductionmentioning
confidence: 99%
“…The stochastic signal is calculated by a complex spectrum envelope of the residual and an inverse STFT. We then add the deterministic component with stochastic one using an overlap add method [4,7] in time domain for each frame to obtain the synthesized whale sound. (Fig.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…In 1986, Robert McAulay and Thomas Quatieri proposed a new method for analysis/synthesis of continuous time speech signals which turned out to be a reconstruction process that provided a close approximation of the original signal [12]. EEG waves represent the combined activity of many neuronal cells which can generate sinusoidal-like oscillatory waves.…”
Section: E Mq Sinusoidal Analysismentioning
confidence: 99%
“…A magnitude condition is also imposed so that contiguous peaks at the same frequency which have large magnitude differences are proposed to belong to different tracks (the partials). The process of matching each frequency in frame t to some frequency in frame t + 1 is given in [12]. Fig.…”
Section: E Mq Sinusoidal Analysismentioning
confidence: 99%
See 1 more Smart Citation