Modeling long distance dependence in language: topic mixtures vs. dynamic cache models

Iyer, Rishabh; Ostendorf, Mari

doi:10.1109/icslp.1996.607085

Cited by 47 publications

(47 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Individual data sources will be more appropriate depending on the task, for example, broadcast news or conversational telephone speech. To reduce the mismatch between the interpolated model and the target domain of interest, interpolation weights may be tuned by minimizing the perplexity on some held-out data similar to the target domain (Jelinek and Mercer, 1980;Kneser and Steinbiss, 1993;Iyer et al, 1994;Bahl et al, 1995;Rosenfeld, 1996Rosenfeld, , 2000Jelinek, 1997;Clarkson and Robinson, 1997;Kneser and Peters, 1997;Seymore and Rosenfeld, 1997;Iyer and Ostendorf, 1999). These weights indicate the "usefulness" of each source for a particular task.…”

Section: Introductionmentioning

confidence: 99%

“…To further improve robustness to varying styles or tasks, unsupervised test-set adaptation, for example, to a particular broadcast show, may be used (Della Pietra et al, 1992;Bulyko et al, 2012Bulyko et al, , 2007Federico, 1999Federico, , 2003Gildea and Hofmann, 1999;Chen et al, 2001;Mrva andWoodland, 2004, 2006;Chien et al, 2005;Tam and Schultz, 2005;Liu et al, 2007Liu et al, , 2008Liu et al, , 2009Liu et al, , 2010. As directly adapting n-gram word probabilities is impractical on limited amounts of data, standard adaptation schemes only involve updating one single, context independent interpolation weight for the component models (Iyer et al, 1994;Rosenfeld, 1996;Clarkson and Robinson, 1997;Seymore and Rosenfeld, 1997;Iyer and Ostendorf, 1999;Mrva and Woodland, 2006). …”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Use of contexts in language model interpolation and adaptation

Liu

Gales

Woodland

2013

Computer Speech & Language

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Use of contexts in language model interpolation and adaptation

Liu

Gales

Woodland

2013

Computer Speech & Language

View full text Add to dashboard Cite

“…There are several references showing effectiveness of monolingual topic-dependent language models [cf. e.g., Iyer and Ostendorf 1999], and our approach may be regarded as similar to the monolingual topic-dependent language model. This motivates us to construct topic-dependent LMs and contrast their performance with our models.…”

Section: Topic-dependent Language Modelsmentioning

confidence: 99%

Lexical triggers and latent semantic analysis for cross-lingual language model adaptation

Kim

Khudanpur

2004

ACM Transactions on Asian Language Information Processing

View full text Add to dashboard Cite

In-domain texts for estimating statistical language models are not easily found for most languages of the world. We present two techniques to take advantage of in-domain text resources in other languages. First, we extend the notion of lexical triggers, which have been used monolingually for language model adaptation, to the cross-lingual problem, permitting the construction of sharper language models for a target-language document by drawing statistics from related documents in a resource-rich language. Next, we show that cross-lingual latent semantic analysis is similarly capable of extracting useful statistics for language modeling. Neither technique requires explicit translation capabilities between the two languages! We demonstrate significant reductions in both perplexity and word error rate on a Mandarin speech recognition task by using these techniques.

show abstract

“…We can place this type of modeling within our adaptation framework by viewing the first-pass hypothesis transcription of an article to be another topic adaptation text. We can adapt our 3 This procedure is a crude but quick approximation to maximum entropy training with this feature set. It would be more sound (but vastly more expensive) to set the parameters £ using a true maximum entropy training algorithm.…”

Section: ¢ -Gram Probabilitiesmentioning

confidence: 99%

“…1 Numerous efforts have demonstrated large improvements in the measure of perplexity [2,4,9]; however, perplexity has been shown to correlate poorly with speech recognition performance. Several papers have reported modest speech recognition word-error rate (WER) improvements of about 0.5% absolute: Sekine and Grishman [14] add ad hoc topic and cache scores to their language model score in log probability space, and Iyer and Ostendorf [3] This work was supported by the National Security Agency under grants MDA904-96-1-0113 and MDA904-97-1-0006. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the U.S. government.…”

Section: Introductionmentioning

confidence: 99%

Topic adaptation for language modeling using unnormalized exponential models

Chen

Seymore

Rosenfeld

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181

View full text Add to dashboard Cite

In this paper, we present novel techniques for performing topic adaptation on an ¢ -gram language model. Given training text labeled with topic information, we automatically identify the most relevant topics for new text. We adapt our language model toward these topics using an exponential model, by adjusting probabilities in our model to agree with those found in the topical subset of the training data. For efficiency, we do not normalize the model; that is, we do not require that the "probabilities" in the language model sum to 1. With these techniques, we were able to achieve a modest reduction in speech recognition word-error rate in the Broadcast News domain.

show abstract

Modeling long distance dependence in language: topic mixtures vs. dynamic cache models

Cited by 47 publications

References 21 publications

Use of contexts in language model interpolation and adaptation

Use of contexts in language model interpolation and adaptation

Lexical triggers and latent semantic analysis for cross-lingual language model adaptation

Topic adaptation for language modeling using unnormalized exponential models

Contact Info

Product

Resources

About