2002
DOI: 10.1016/s0167-6393(01)00039-5
|View full text |Cite
|
Sign up to set email alerts
|

An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2010
2010
2010
2010

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…A transformation is used to minimize the mismatch between new utterances and average utterances. This study shows simple and commonly used speaker adaptation techniques (MLLR (Goronzy et al, 2004;Giuliani et al, 2006), CMLLR (Hariharan et al, 2002;Sundermann et al, 2003;Legetter and Woodland, 1995;Shen and Reynolds, 2008) and VTLN) to compensate for speaker-specific differences caused by non-native language influence for isolated words. Table 2 shows the results of the baseline system adapted with speaker adaptation techniques (MLLR, CMLLR and VTLN).…”
Section: Acoustic Model Adaptationmentioning
confidence: 99%
See 1 more Smart Citation
“…A transformation is used to minimize the mismatch between new utterances and average utterances. This study shows simple and commonly used speaker adaptation techniques (MLLR (Goronzy et al, 2004;Giuliani et al, 2006), CMLLR (Hariharan et al, 2002;Sundermann et al, 2003;Legetter and Woodland, 1995;Shen and Reynolds, 2008) and VTLN) to compensate for speaker-specific differences caused by non-native language influence for isolated words. Table 2 shows the results of the baseline system adapted with speaker adaptation techniques (MLLR, CMLLR and VTLN).…”
Section: Acoustic Model Adaptationmentioning
confidence: 99%
“…Perceptual-based evaluation of human raters is not only to simply value non-native utterances as accepted/rejected but also to analyze and locate specific errors on segmental aspects. Further, the acoustic model adaptation is combined with three speaker adaptation techniques Maximum Likelihood Linear Regression (MLLR) as proposed in (Goronzy et al, 2004;Giuliani et al, 2006;Haraty and El Ariss, 2007), Constrained MLLR (CMLLR) and Vocal Track Length ormalization (VTLN) as proposed in (Hariharan et al, 2002;Sundermann et al, 2003;Legetter and Woodland, 1995;Shen and Reynolds, 2008;Al-Haddad et al, 2009;Gales and Young, 2008) in order to eliminate interspeaker variability. Performance of the proposed acoustic model adaptation is evaluated in five measures of alignment analysis between recognition results and perceptual based evaluation: Hit, False Alarm (FA), Miss, Rejection and Hit + Rejection.…”
Section: Introductionmentioning
confidence: 99%
“…Especially when the handling of speech sound field, HMM has gotten thorough research and specific application. Now, it has gotten fairly good identification effect in isolating word identification, joins word identification and continuous speech recognition [2]. HMM has the very strong ability and sequential pattern classification of building mould for development process time sequence, can solve the random uncertain problem and suits the analysis of unsteady repetitive signal that has not good repeatability especially.…”
Section: Hmm Theorymentioning
confidence: 99%
“…HMM, as a dynamic time-series statistical model, has the reliable computing performance and rigorous data structure, and has been widely used for speech recognition [1,2]. While SVM with a solid mathematical theoretical basis is a kind of high-performance method to express the dependence of complex functions in the high-dimensional space.…”
Section: Introductionmentioning
confidence: 99%