Proceedings of the Workshop on Human Language Technology - HLT '94 1994
DOI: 10.3115/1075812.1075888
|View full text |Cite
|
Sign up to set email alerts
|

Adaptation to new microphones using tied-mixture normalization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

1996
1996
2013
2013

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…Speaker normalization strategies aim at reducing inter-speaker variability and thus reducing the fragmentation of the acoustic models learned by an ASR system. In general, normalization can be achieved by either explicitly removing some acoustic peculiarities of the speaker (e.g., through Cepstral mean removal, Anastasakos et al, 1994, or vocal tract length normalization, Eide and Gish, 1996), by explicitly mapping the speech acoustics of some speakers into the acoustic domain of a reference speaker (e.g., Huang, 1992), or by creating compact models (i.e., models that are robust across inter-speaker variations) that can then be adapted to the different speakers (this is usually referred to as adaptive training strategy, Anastasakos et al, 1996). …”
Section: Introductionmentioning
confidence: 99%
“…Speaker normalization strategies aim at reducing inter-speaker variability and thus reducing the fragmentation of the acoustic models learned by an ASR system. In general, normalization can be achieved by either explicitly removing some acoustic peculiarities of the speaker (e.g., through Cepstral mean removal, Anastasakos et al, 1994, or vocal tract length normalization, Eide and Gish, 1996), by explicitly mapping the speech acoustics of some speakers into the acoustic domain of a reference speaker (e.g., Huang, 1992), or by creating compact models (i.e., models that are robust across inter-speaker variations) that can then be adapted to the different speakers (this is usually referred to as adaptive training strategy, Anastasakos et al, 1996). …”
Section: Introductionmentioning
confidence: 99%
“…In the dry-run system, we made several simplifying assumptions for expediency's sake which should not be construed as desirable for a broadcast news transcription system. For example, we made no attempt to deal with telephone bandwidth data nor did we try unsupervised adaptation on the test, even though our past experience indicates these will enhance performance signicantly [4,5].…”
Section: Dry-run Systemmentioning
confidence: 99%
“…It is well known that subtracting the cepstrum mean from the input is a simple and eective procedure for removing linear dierences between channels [4]. We had unintentionally omitted this step in our dry-run system in which we adapted the seed model, estimated from WSJ acoustics, directly to the Marketplace data for each of the three prior classes.…”
Section: Adaptation To Channelmentioning
confidence: 99%