[Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing 1991
DOI: 10.1109/icassp.1991.150483
|View full text |Cite
|
Sign up to set email alerts
|

Robust speech recognition by normalization of the acoustic space

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0
1

Year Published

1995
1995
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 47 publications
(22 citation statements)
references
References 9 publications
0
21
0
1
Order By: Relevance
“…The selection of warping function is sometimes accomplished by choosing from a set of candidate functions in a fashion that maximizes the likelihood of the observations, and sometimes directly on the basis of speaker-specific speech parameters. In a relatively early study, Acero blindly estimated the optimal frequency-distortion parameter for the bilinear transform to accomplish frequency warping for LPCderived cepstra (Acero 1993;Acero and Stern 1991). This technique produced 12% decrease in the relative error rate on the CMU speaker-independent alphanumeric census task.…”
Section: Estimation Of Warping Factormentioning
confidence: 98%
“…The selection of warping function is sometimes accomplished by choosing from a set of candidate functions in a fashion that maximizes the likelihood of the observations, and sometimes directly on the basis of speaker-specific speech parameters. In a relatively early study, Acero blindly estimated the optimal frequency-distortion parameter for the bilinear transform to accomplish frequency warping for LPCderived cepstra (Acero 1993;Acero and Stern 1991). This technique produced 12% decrease in the relative error rate on the CMU speaker-independent alphanumeric census task.…”
Section: Estimation Of Warping Factormentioning
confidence: 98%
“…One of the early attempts to obtain a LT was by Acero et al (Acero 1990;Acero & Stern 1991). They proposed the use of bilinear warping for achieving variable frequency warping for speaker normalization.…”
Section: Review Of Existing Approaches To Obtain Ltmentioning
confidence: 99%
“…Motivated by the work of Acero (1990) and Acero & Stern (1991) and based on the observation that frequency warping functions used in most VTLN methods can be approximated to a reasonable degree by the bilinear transform, McDonough et al (1998) suggested the use of conformal maps such as bilinear transform and its generalizations for speaker normalization. Since the unit circle is mapped back onto the unit circle, McDonough refers to these conformal maps as all-pass systems; such systems have uniform frequency response and thus pass signals of all frequencies with neither attenuation nor amplification.…”
Section: Review Of Existing Approaches To Obtain Ltmentioning
confidence: 99%
“…Accordingly, using different microphones in the training mode and the recognition mode causes performance degradation (32,34). Several methods have been proposed to cope with this problem (35,36).…”
Section: Robust Algorithmsmentioning
confidence: 99%