2003
DOI: 10.1049/el:20030480
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive noise estimation algorithm for speech enhancement

Abstract: A fast and robust speech noise estimation technique is proposed. The noisy speech is decomposed using a critical-band-rate filterbank so that a perceptual modification of Wiener filtering can be applied in speech denoising. The subband noise estimate is updated adaptively using a smoothing parameter that depends on the estimated signal-to-noise ratio (SNR). This noise estimation technique can give accurate results even at very low signal-tonoise ratios. Speech denoising using perceptually modified Wiener filte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
18
0

Year Published

2009
2009
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 42 publications
(18 citation statements)
references
References 7 publications
0
18
0
Order By: Relevance
“…Our MMSE solution is similar, in essence, to what was reported in [7] and [8] and is based on solving an overdetermined system of equations using GMMs of speech and different noise source candidates. In fact the mean vectors of power spectra models of noise and clean speech are formed as follows: ∑ ∑ (6) in which refers to the frequency bins in the FFT domain and varies from 0 to 256 in our case for each noise source candidate whose mixture model is available. As there are as many equations as frequency bins but only unknowns, the MMSE solution, provided by a standard algorithm, returns not only the parameters and (negative values are excluded and the remaining are normalized to attain the same energies of speech and noise signals involved) but also the minimum squared error between the actual and its estimate averaged over all frequencies .…”
Section: Mmse Solution Using Over-determined System Ofmentioning
confidence: 99%
See 1 more Smart Citation
“…Our MMSE solution is similar, in essence, to what was reported in [7] and [8] and is based on solving an overdetermined system of equations using GMMs of speech and different noise source candidates. In fact the mean vectors of power spectra models of noise and clean speech are formed as follows: ∑ ∑ (6) in which refers to the frequency bins in the FFT domain and varies from 0 to 256 in our case for each noise source candidate whose mixture model is available. As there are as many equations as frequency bins but only unknowns, the MMSE solution, provided by a standard algorithm, returns not only the parameters and (negative values are excluded and the remaining are normalized to attain the same energies of speech and noise signals involved) but also the minimum squared error between the actual and its estimate averaged over all frequencies .…”
Section: Mmse Solution Using Over-determined System Ofmentioning
confidence: 99%
“…However, in many practical applications the noise is time-varying and hence leads to sub-optimum results. Several techniques, found in the literature, address this problem [5]- [6]. Most of them concentrate on avoiding explicit speech/non-speech classification and resort to measures of recursively estimating the noise Power Spectral Density (PSD).…”
Section: Introductionmentioning
confidence: 99%
“…The value of VAS parameter obtained from VAD approach is used to calculate the subband noise power. The noise estimation for each subband is computed using the adaptive noise estimation algorithm proposed by Lin et al [24].…”
Section: Proposed Speech Enhancement Algorithmmentioning
confidence: 99%
“…However, in many practical applications the noise is time-varying and hence leads to sub-optimum results. Several techniques, found in literature, address this problem [5]- [6]. Most of them concentrate on avoiding explicit speech/non-speech classification and resort to measures of recursively estimating the noise PSD.…”
Section: Introductionmentioning
confidence: 99%