2012
DOI: 10.1186/1687-4722-2012-29
|View full text |Cite
|
Sign up to set email alerts
|

A perceptual masking approach for noise robust speech recognition

Abstract: This article describes a modified technique for enhancing noisy speech to improve automatic speech recognition (ASR) performance. The proposed approach improves the widely used spectral subtraction which inherently suffers from the associated musical noise effects. Through a psychoacoustic masking and critical band variance normalization technique, the artifacts produced by spectral subtraction are minimized for improving the ASR accuracy. The popular advanced ETSI-2 front end is tested for comparison purposes… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
12
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(13 citation statements)
references
References 10 publications
1
12
0
Order By: Relevance
“…Type 4: II requires three HEQ operations, the same as S-HEQ, demonstrating that S-HEQ and WS-HEQ (1) II are similar in computational complexity.…”
Section: Proposed Approach: Ws-heqmentioning
confidence: 96%
See 2 more Smart Citations
“…Type 4: II requires three HEQ operations, the same as S-HEQ, demonstrating that S-HEQ and WS-HEQ (1) II are similar in computational complexity.…”
Section: Proposed Approach: Ws-heqmentioning
confidence: 96%
“…Equations 12 to 15, by assigning α as less than 1.0, WS-HEQ (1) , which requires three HEQ operations, displays the best behavior, regardless of the selected structure. However, the two types that require only two HEQ operations (i.e., WS-HEQ (2) and …”
Section: Among the Four Types Of Ws-heq Listed Inmentioning
confidence: 99%
See 1 more Smart Citation
“…To reduce the effect of any present tones which caused by increased variance at random frequencies, M anti and Matassoni [19] performed variance normalizat across the critical bands for spectral subtraction speech hancement algorithm. The variance is computed as in (…”
Section: Varinace Normalizationmentioning
confidence: 99%
“…Lu [18], employed an optimal smoothing factor, adapted by the variation of signal to spectral deviation ratio in successive frames. Variance normalization was used in spectral subtraction speech enhancement algorithm by Maganti and Matassoni [19] across the critical bands to smoothen the output signal, removing the spikes in the output which reduced the effect of increased variance at random frequencies.…”
Section: Introductionmentioning
confidence: 99%