Gwo-Hwa Ju scite author profile

In a previous work, we have successfully integrated the transformation-based signal subspace technique with the generalized singular value decomposition (GSVD) algorithm to develop an improved speech enhancement framework [1]. In this paper, we further incorporate the perceptual masking effect of the psychoacoustics model as extra constraints of the previously proposed GSVD-based algorithm to obtain improved sound feature, and furthermore make sure the undesired residual noise to be nearly unperceivable. Both subjective listening tests and spectrogram-plot comparison showed that the closed-form solution developed here can offer significantly better speech quality than either the conventional spectral subtraction algorithm or the previously proposed GSVD-based technique, regardless of whether the additive noise is white or not.

show abstract

Speech enhancement based on generalized singular value decomposition approach

Ju¹,

Lee²

2002

View full text Add to dashboard Cite

A Perceptually Constrained GSVD-Based Approach for Enhancing Speech Corrupted by Colored Noise

Lee

2007

IEEE Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

The singular value decomposition (SVD)-based method for single-channel speech enhancement has been shown to be very useful when the additive noise is white. For colored noise, with this approach, one needs to whiten the noise spectrum prior to SVD-based approach and perform the inverse whitening processing afterwards. A truncated quotient SVD (QSVD)-based approach has been proposed to handle this problem and found very useful. In this paper, a generalized SVD (GSVD)-based subspace approach for speech enhancement is first extended from the concept of the truncated QSVD-based approach, in which the dimension of the signal subspace can be precisely and automatically determined for each frame of the noisy signal. But with this new approach some residual noise is still perceivable under lower signal-to-noise ratio conditions. Therefore a perceptually constrained GSVD (PCGSVD)-based approach is further proposed to incorporate the masking properties of human auditory system to make sure the undesired residual noise to be nearly un-perceivable. Closed-form solutions are obtained for both the GSVD-and PCGSVD-based enhancement approaches. Very carefully performed objective evaluations and subjective listening tests show that the PCGSVD-based approach proposed here can offer improved speech quality, intelligibility and recognition accuracy, whether the noise is stationary or nonstationary, especially when the additive noise is nonwhite.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Gwo-Hwa Ju

Liveness detection using frequency entropy of image sequences

Perceptually-constrained generalized singular value decomposition-based approach for enhancing speech corrupted by colored noise

Speech enhancement based on generalized singular value decomposition approach

A Perceptually Constrained GSVD-Based Approach for Enhancing Speech Corrupted by Colored Noise

Contact Info

Product

Resources

About