2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
DOI: 10.1109/icassp.2001.940845
|View full text |Cite
|
Sign up to set email alerts
|

Very fast adaptation with a compact context-dependent eigenvoice model

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
17
0

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(17 citation statements)
references
References 8 publications
0
17
0
Order By: Relevance
“…We call this a Subspace GMM (SGMM). This method has similarities to Joint Factor Analysis as used in speaker recognition (Kenny et al, 2008) and to Eigenvoices (Kuhn et al, 2001) and Cluster Adaptive Training (CAT) (Gales, 2001) as proposed for speech recognition.…”
Section: Introductionmentioning
confidence: 99%
“…We call this a Subspace GMM (SGMM). This method has similarities to Joint Factor Analysis as used in speaker recognition (Kenny et al, 2008) and to Eigenvoices (Kuhn et al, 2001) and Cluster Adaptive Training (CAT) (Gales, 2001) as proposed for speech recognition.…”
Section: Introductionmentioning
confidence: 99%
“…There are 163 speakers (of both (sr) can be computed from Eqn. (9). As a result, the Q(w) function is quadratic and its derivative is linear, and the optimal weights can be found by solving a system of linear equation as expected.…”
Section: A1 Tidigits Corpusmentioning
confidence: 99%
“…Since the number of estimation parameters is greatly reduced, fast speaker adaptation using EV adaptation is possible with a few seconds of speech. The simple algorithm was later extended to work for large-vocabulary continuous speech recognition [9], [10], eigenspace-based MLLR [11], [12], and to approximate the model prior in MAP adaptation [13], [14], [15]. In addition, the eigenspace may be learned automatically by MLES [16], or during model training as in CAT [17].…”
Section: Introductionmentioning
confidence: 99%
“…), and each individual speaker is then a point in the eigenspace. The simple algorithm was later extended to work for large-vocabulary continuous speech recognition [5,6,7]. In practice, a few to a few tens of eigenvoices are found adequate for fast speaker adaptation.…”
Section: Introductionmentioning
confidence: 99%