Speech analysis/Synthesis based on a sinusoidal representation

McAulay, R.; Quatieri, Thomas F.

doi:10.1109/tassp.1986.1164910

Cited by 1,183 publications

(614 citation statements)

References 8 publications

Supporting

Mentioning

607

Contrasting

Unclassified

Order By: Relevance

“…The model approximates speech signals as the sum of sinusoidal components with instantaneous amplitudes and frequencies that continuously vary with time [4]. In the present study, we propose a novel analysis method to estimate VTTF and F 0 for nonstationary voiced speech on the basis of sinusoidal representation.…”

Section: Introductionmentioning

confidence: 99%

Source-filter separation for nonstationary voiced speech based on sinusoidal representation

Ito

Ohara

Ito

et al. 2010

Acoust. Sci. & Tech.

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

Source-filter separation for nonstationary voiced speech based on sinusoidal representation

Ito

Ohara

Ito

et al. 2010

Acoust. Sci. & Tech.

View full text Add to dashboard Cite

“…The stochastic signal is calculated by a complex spectrum envelope of the residual and an inverse STFT. We then add the deterministic component with stochastic one using an overlap add method [4,7] in time domain for each frame to obtain the synthesized whale sound. (Fig.…”

Section: Proposed Methodsmentioning

confidence: 99%

Baleen Whale Sound Synthesis using a Modified Spectral Modeling

Jun¹,

Dhar²,

Kim³

et al. 2010

The KIPS Transactions:PartB

View full text Add to dashboard Cite

Spectral modeling synthesis (SMS) has been used as a powerful tool for musical sound modeling. This technique considers a sound as a combination of a deterministic plus a stochastic component. The deterministic component is represented by the series of sinusoids that are described by amplitude, frequency, and phase functions and the stochastic component is represented by a series of magnitude spectrum envelopes that functions as a time varying filter excited by white noise. These representations make it possible for a synthesized sound to attain all the perceptual characteristics of the original sound. However, sometimes considerable phase variations occur in the deterministic component by using the conventional SMS for the complex sound such as whale sounds when the partial frequencies in successive frames differ. This is because it utilizes the calculated phase to synthesize deterministic component of the sound. As a result, it does not provide a good spectrum matching between original and synthesized spectrum in higher frequency region. To overcome this problem, we propose a modified SMS that provides good spectrum matching of original and synthesized sound by calculating complex residual spectrum in frequency domain and utilizing original phase information to synthesize the deterministic component of the sound. Analysis and simulation results for synthesizing whale sounds suggest that the proposed method is comparable to the conventional SMS in both time and frequency domain. However, the proposed method outperforms the SMS in better spectrum matching.

show abstract

“…In 1986, Robert McAulay and Thomas Quatieri proposed a new method for analysis/synthesis of continuous time speech signals which turned out to be a reconstruction process that provided a close approximation of the original signal [12]. EEG waves represent the combined activity of many neuronal cells which can generate sinusoidal-like oscillatory waves.…”

Section: E Mq Sinusoidal Analysismentioning

confidence: 99%

“…A magnitude condition is also imposed so that contiguous peaks at the same frequency which have large magnitude differences are proposed to belong to different tracks (the partials). The process of matching each frequency in frame t to some frequency in frame t + 1 is given in [12]. Fig.…”

Section: E Mq Sinusoidal Analysismentioning

confidence: 99%

See 1 more Smart Citation

New approach in features extraction for EEG signal detection

Guerrero-Mosquera

Vázquez

2009

2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society

View full text Add to dashboard Cite

Abstract-This paper describes a new approach in features extraction using time-frequency distributions (TFDs) for detecting epileptic seizures to identify abnormalities in electroencephalogram (EEG). Particularly, the method extracts features using the Smoothed Pseudo Wigner-Ville distribution combined with the McAulay-Quatieri sinusoidal model and identifies abnormal neural discharges. We propose a new feature based on the length of the track that, combined with energy and frequency features, allows to isolate a continuous energy trace from another oscillations when an epileptic seizure is beginning. We evaluate our approach using data consisting of 16 different seizures from 6 epileptic patients. The results show that our extraction method is a suitable approach for automatic seizure detection, and opens the possibility of formulating new criteria to detect and analyze abnormal EEGs.

show abstract

Speech analysis/Synthesis based on a sinusoidal representation

Cited by 1,183 publications

References 8 publications

Source-filter separation for nonstationary voiced speech based on sinusoidal representation

Source-filter separation for nonstationary voiced speech based on sinusoidal representation

Baleen Whale Sound Synthesis using a Modified Spectral Modeling

New approach in features extraction for EEG signal detection

Contact Info

Product

Resources

About