2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2012
DOI: 10.1109/icassp.2012.6288821
|View full text |Cite
|
Sign up to set email alerts
|

A novel approach to soft-mask estimation and Log-Spectral enhancement for robust speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
4
1

Relationship

4
5

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 12 publications
0
6
0
Order By: Relevance
“…Recently, van Hout and Alwan propose to estimate a smoothed ratio mask using noise power estimators and a median filter, which they use to perform feature enhancement in the log Mel spectral domain before cepstral transformation [17].…”
Section: Prior Workmentioning
confidence: 99%
“…Recently, van Hout and Alwan propose to estimate a smoothed ratio mask using noise power estimators and a median filter, which they use to perform feature enhancement in the log Mel spectral domain before cepstral transformation [17].…”
Section: Prior Workmentioning
confidence: 99%
“…Finally, the transform gate of our architecture is similar to adaptive soft-masking filtering [13] in speech enhancement. Hence, it is expected that knowledge can be shared between voice conversion and speech enhancement.…”
Section: Discussionmentioning
confidence: 99%
“…The LSEN feature was initially introduced in [25] for enhancement of the mel-spectrum with applications to noiserobust speech recognition. It was adapted in [26] to enhance the gammatone power-normalized spectra of noisy speech obtained with the power-normalized cepstral coefficients (PNCC) features pipeline and renamed LSEN-PNCC.…”
Section: Log-spectrally Enhanced Power Normalized Cepstral Coefficienmentioning
confidence: 99%