IEEE International Conference on Acoustics Speech and Signal Processing 2002
DOI: 10.1109/icassp.2002.5743884
|View full text |Cite
|
Sign up to set email alerts
|

Speech recognizer-based microphone array processing for robust hands-free speech recognition

Abstract: We present a new array processing algorithm for microphone array speech recognition. Conventionally, the goal of array processing is to take distorted signals captured by the array and generate a cleaner output waveform. However, speech recognition systems operate on a set of features derived from the waveform, rather than the waveform itself. The goal of an array processor used in conjunction with a recognition system is to generate a waveform which produces a set of recognition features which maximize the li… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2004
2004
2012
2012

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 33 publications
(26 citation statements)
references
References 5 publications
0
26
0
Order By: Relevance
“…Unfortunalty, it requires knowing the clean speech which is not avalaible in most practical applications. Further improvements in recognition accuracy can be obtained at lower signal-to-noise ratios by the use of multiple microphones (Silverman et al, 1997) (Seltzer, 2003).…”
Section: Classical Proposed Solutions In Speech Recognition Robustnessmentioning
confidence: 99%
“…Unfortunalty, it requires knowing the clean speech which is not avalaible in most practical applications. Further improvements in recognition accuracy can be obtained at lower signal-to-noise ratios by the use of multiple microphones (Silverman et al, 1997) (Seltzer, 2003).…”
Section: Classical Proposed Solutions In Speech Recognition Robustnessmentioning
confidence: 99%
“…It can be shown [11] that when the HMM state distributions are modeled as mixtures of Gaussians, the gradient expression can be expressed as (13) where represents the a posteriori probability of the th mixture component of state , given . Comparing (11) and (13), it is clear that the gradient expression in the Gaussian mixture case is simply a weighted sum of the gradients of each of the Gaussian components in the mixture, where the weight on each mixture component represents its a posteriori probability of generating the observed feature vector.…”
Section: ) Gaussian State Output Distributionsmentioning
confidence: 99%
“…The full derivation of the Jacobian matrix for log mel spectral or cepstral features can be found in [11].…”
Section: ) Gaussian State Output Distributionsmentioning
confidence: 99%
“…The beamforming algorithms presented in this paper have been studied in great detail and have found to be effective on varied data [8]. Nevertheless, they can only be considered preliminary -they are computationally expensive, and in the case of speaker separation make the rather serious assumption that word sequences uttered by the speakers are known.…”
Section: Discussionmentioning
confidence: 99%