1993
DOI: 10.1007/978-1-4615-3122-7
|View full text |Cite
|
Sign up to set email alerts
|

Acoustical and Environmental Robustness in Automatic Speech Recognition

Abstract: This dissertation describes a number of algorithms developed to increase the robustness of automatic speech recognition systems with respect to changes in the environment. These algorithms attempt to improve the recognition accuracy of speech recognition systems when they are trained and tested in different acoustical environments, and when a desk-top microphone (rather than a close-talking microphone) is used for speech input. Without such processing, mismatches between training and testing conditions produce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
174
0
6

Year Published

1999
1999
2012
2012

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 232 publications
(180 citation statements)
references
References 83 publications
(108 reference statements)
0
174
0
6
Order By: Relevance
“…Another stage of bilinear transform can be cascaded with the existing one to accommodate the VTLN warping. It has been shown [10] that the combination of two bilinear transforms with warping factors α1 and α2 is equivalent to a bilinear transform with single warping factor given by:…”
Section: Estimating Warping Parametersmentioning
confidence: 99%
“…Another stage of bilinear transform can be cascaded with the existing one to accommodate the VTLN warping. It has been shown [10] that the combination of two bilinear transforms with warping factors α1 and α2 is equivalent to a bilinear transform with single warping factor given by:…”
Section: Estimating Warping Parametersmentioning
confidence: 99%
“…A derivation of (24) is provided in [29]. To get a flat transfer function, we now apply the inverted weighting function…”
Section: No Warping Warping In Time Domainmentioning
confidence: 99%
“…fŜ R g = argmax k l P (Y S k R l ) (6) where S k = fs 1 k s 2 k : : : s M k g is a particular speech state sequence and R l is a particular noise state sequence.…”
Section: Hidden Markov Model Decompositionmentioning
confidence: 99%
“…These techniques are called feature-based methods and model-based methods, respectively. Cepstral Mean Normalization [8] and Spectral Subtraction [9], and their extensions [6] are examples of feature-based methods.…”
Section: Introductionmentioning
confidence: 99%