2013
DOI: 10.1109/tasl.2013.2258013
|View full text |Cite
|
Sign up to set email alerts
|

Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition

Abstract: Abstract-In this contribution we extend a previously proposed BAYESIAN approach for the enhancement of reverberant logarithmic mel power spectral coefficients for robust automatic speech recognition to the additional compensation of background noise. A recently proposed observation model is employed whose time-variant observation error statistics are obtained as a side product of the inference of the a posteriori probability density function of the clean speech feature vectors. Further a reduction of the compu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 23 publications
0
2
0
Order By: Relevance
“…It can be divided into linear filtering, spectrum enhancement, and feature enhancement. The linear filtering dereverberates time-domain signals or STFT coefficients, e.g., [3,4], the spectrum enhancement dereverberates the corrupted power spectra of signal, e.g., [5][6][7], and the feature enhancement dereverberates the corrupted feature vectors, e.g., [8][9][10]. Meanwhile, the back-end-based approaches attempt to modify the acoustic model and/or decoder so that they are suitable for reverberant environment, e.g., [11,12].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…It can be divided into linear filtering, spectrum enhancement, and feature enhancement. The linear filtering dereverberates time-domain signals or STFT coefficients, e.g., [3,4], the spectrum enhancement dereverberates the corrupted power spectra of signal, e.g., [5][6][7], and the feature enhancement dereverberates the corrupted feature vectors, e.g., [8][9][10]. Meanwhile, the back-end-based approaches attempt to modify the acoustic model and/or decoder so that they are suitable for reverberant environment, e.g., [11,12].…”
Section: Introductionmentioning
confidence: 99%
“…Several single-channel feature enhancement methods have been proposed. Some of them do not need stereo data at all, e.g., cepstral mean normalization (CMN) [16,17], long-term feature normalization [18], vector Taylor series (VTS) [19], particle filter [8,20], and extended Kalman filter [9,21]. Meanwhile, some of them assume that stereo training data can be acquired.…”
Section: Introductionmentioning
confidence: 99%