2006
DOI: 10.1016/j.sigpro.2005.06.005
|View full text |Cite
|
Sign up to set email alerts
|

Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models

Abstract: In this paper, we develop and evaluate speech enhancement algorithms, which are based on supergaussian generalized autoregressive conditional heteroscedasticity (GARCH) models in the short-time Fourier transform (STFT) domain. We consider three different statistical models, two fidelity criteria, and two approaches for the estimation of the variances of the STFT coefficients. The statistical model is either Gaussian, Gamma or Laplacian; the fidelity criteria include minimum mean-squared error (MMSE) of the STF… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
37
0

Year Published

2007
2007
2019
2019

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 53 publications
(37 citation statements)
references
References 25 publications
0
37
0
Order By: Relevance
“…The independency assumption in the complex domain is also inconsistent with the data. We think further improvements in speech enhancement performance are still possible by considering more sophisticated pdf models and better spectral variance estimators [15], [31], [32] simultaneously. For certain types of distortions, other methods may be more appropriate.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…The independency assumption in the complex domain is also inconsistent with the data. We think further improvements in speech enhancement performance are still possible by considering more sophisticated pdf models and better spectral variance estimators [15], [31], [32] simultaneously. For certain types of distortions, other methods may be more appropriate.…”
Section: Discussionmentioning
confidence: 99%
“…We see that for small arguments, is approximated well by only a few terms. Substituting (15) into (14) (15) converges and because changing the order of integration and summation as is used in the derivation of (16) is allowed for according to Fubini's theorem [22].…”
Section: ) Change Of Variablementioning
confidence: 99%
See 2 more Smart Citations
“…The a priori SNR, which is the ratio of the speech and noise power, is widely used in speech enhancement algorithms and is typically estimated using the decision-directed approach of Ephraim and Malah [6]. Alternative techniques are based on GARCH models [4] and cepstro-temporal smoothing [3].…”
Section: Prior Workmentioning
confidence: 99%