R. McAulay scite author profile

A sinusoidal model for the speech waveform is used to develop a new analysislsynthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves. These parameters are estimated from the short-time Fourier transform using a simple peak-picking algorithm. Rapid changes in the highly resolved spectral components are tracked using the concept of "birth" and "death" of the underlying sine waves. For a given frequency track a cubic function is used to unwrap and interpolate the phase such that the phase track is m,aximally smooth. This phase function is applied to a sine-wave generator, which is amplitude modulated and added to the other sine waves to give the final speech output. The resulting synthetic waveform preserves the general waveform shape and is essentially perceptually indistinguishable from the original speech. Furthermore, in the presence of noise the perceptual characteristics of the speech as well as the noise are maintained. In addition, it was found that the representation was sufficiently general that high-quality reproduction was obtained for a larger class of inputs including: two overlapping, superposed speech waveforms; music waveforms; speech in musical backgrounds; and certain marine biologic sounds. Finally, the analysis/synthesis system forms the basis for new approaches to the problems of speech transformations including timescale and pitch-scale modification, and midrate speech coding [SI, [9].

show abstract

Speech enhancement using a soft-decision noise suppression filter

McAulay

Malpass

1980

IEEE Trans. Acoust., Speech, Signal Process.

660

269

View full text Add to dashboard Cite

Shape invariant time-scale and pitch modification of speech

Quatieri

McAulay

1992

IEEE Trans. Signal Process.

164

View full text Add to dashboard Cite

Barankin Bounds on Parameter Estimation

McAulay

Hofstetter

1971

IEEE Trans. Inform. Theory

122

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

R. McAulay

Speech analysis/Synthesis based on a sinusoidal representation

Speech enhancement using a soft-decision noise suppression filter

Shape invariant time-scale and pitch modification of speech

Barankin Bounds on Parameter Estimation

Contact Info

Product

Resources

About