IEEE International Conference on Acoustics Speech and Signal Processing 1993
DOI: 10.1109/icassp.1993.319371
|View full text |Cite
|
Sign up to set email alerts
|

Rapid speaker adaptation using speaker-mixture allophone models applied to speaker-independent speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

1996
1996
1998
1998

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 8 publications
0
4
0
Order By: Relevance
“…We have already proposed a speaker-mixture method (Kosaka et al, 1993) that can yield highly accurate speaker-independent phone models. The same method is also used for recognition in this paper.…”
Section: Recognition Using a Speaker-mixture Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…We have already proposed a speaker-mixture method (Kosaka et al, 1993) that can yield highly accurate speaker-independent phone models. The same method is also used for recognition in this paper.…”
Section: Recognition Using a Speaker-mixture Modelmentioning
confidence: 99%
“…HMnets belonging to the selected node are merged to form a K-mixture HMnet by using a speaker-mixture method (Kosaka, Takami & Sagayama, 1993). This HMnet is used for recognition.…”
Section: Recognition Modementioning
confidence: 99%
“…In contrast to the above schemes, the adaptation scheme described here is based on the fact that the training data contains a number of training speakers, some of whom are closer, acoustically, to the test speaker, than the others [10]. 1 If the model parameters are reestimated from the subset of training speakers who are acoustically close to the test speaker, they should be reasonably close to the speakerdependent parameters that would be obtained by training on large amounts of data from the test speaker (if such data were available) [13], [14]. 2 1 Some similar ideas have recently been reported in [11] and [12].…”
Section: B Review Of Adaptation Techniquesmentioning
confidence: 99%
“…The SSS-LR achieved good performance in the 1, OOOword recognition experiments, probably due to the high accuracy of the m e t -b a s e d representation. Speaker-independent experiments based on the speaker-mixture HMnet [15] have also been tried. Each mixture component is derived from a particular speaker, and training speech data are used to determine the speaker-mixing weights.…”
Section: Discussionmentioning
confidence: 99%