Language model adaptation through topic decomposition and MDI estimation

Federico,

doi:10.1109/icassp.2002.1005854

Cited by 12 publications

(10 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In many works [5,7,10], α(w) is exponentially smoothed by a coefficient lower than 1, optimized on heldout data. However, in our experiments, we chose to use (10) as it is, since this paper does not seek to perfectly tune a LM adaptation but rather aims at better understanding mechanisms that are useful for topic adaptation.…”

Section: Minimum Discriminant Information Language Model Adaptationmentioning

confidence: 99%

“…where n is an empirically set parameter. As it can be shown from (5) any topic-specific word reduces to 1, i.e., their probability is directly reported from the baseline LM except the normalization factor. Figure 1 presents word error rate (WER) and perplexity variations measured on our development set using either topic terminologies of different sizes or using the whole vocabulary.…”

Section: Feature Selectionmentioning

confidence: 99%

“…In many topic adaptation works using MDI, constraints are derived from n-gram probabilities trained on topic-specific corpora. However, since these corpora are rather small to estimate reliable statistics, unigram probabilities are frequently used as the sole information source [5]. To circumvent this problem, [6] proposes to also consider reliable higher order n-grams by computing confidence intervals from which inequality constraints are derived.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Constraint selection for topic-based MDI adaptation of language models

Lecorvé¹,

Gravier²,

Sébillot³

2009

Interspeech 2009

View full text Add to dashboard Cite

Section: Minimum Discriminant Information Language Model Adaptationmentioning

confidence: 99%

Section: Feature Selectionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Constraint selection for topic-based MDI adaptation of language models

Lecorvé¹,

Gravier²,

Sébillot³

2009

Interspeech 2009

View full text Add to dashboard Cite

“…For example, in [3], it is used for efficiently calculating the relative entropy when n-gram parameter is pruned. In [4], it is used to calculate the normalization parameters for MDI estimation. In this paper, it is used for efficient LMLA probabilities generation.…”

Section: The Data Sparseness Of N-gram Modelmentioning

confidence: 99%

Efficient language model look-ahead probabilities generation using lower order LM look-ahead information

Chen

Chin

2008

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

View full text Add to dashboard Cite

In this paper, an efficient method for language model lookahead probability generation is presented. Traditional methods generate language model look-ahead (LMLA) probabilities for each node in the LMLA tree recursively in a bottom to up manner. The new method presented in this paper makes use of the sparseness of the n-gram model and starts the process of generating an n-gram LMLA tree from a backoff LMLA tree. Only a small number of nodes are updated with explicitly estimated LM probabilities. This speeds up the bigram and trigram LMLA tree generation by a factor of 3 and 12 respectively.

show abstract

“…One of the focuses of future work is integrating fast marginal adaptation directly into the decoder. An efficient implementation has been described in [20]. Also, we wish to replace current manual sentence and story segmentation with an automatic segmentation system.…”

Section: Discussionmentioning

confidence: 99%

Comparison of Different Modeling Units for Language Model Adaptation for Inflected Languages

Alumäe

Computational Linguistics and Intelligent Text Processing

View full text Add to dashboard Cite

This paper presents a language model adaptation framework for highly inflected languages that use sub-word units as basic units in a language model for large vocabulary speech recognition. The proposed adaptation method uses latent semantic analysis based information retrieval to find documents similar to a tiny adaptation corpus. The approach enables to use different language units for modeling document similarity. The method is tested on an Estonian broadcast news transcription task. We compare words, lemmas and morphemes as basic units for similarity modeling. We observe a drop in speech recognition error rate after building adapted language model for each news story. Morpheme-based adaptation is found to give significantly larger improvement than word and lemma-based adaptation.

show abstract

Language model adaptation through topic decomposition and MDI estimation

Cited by 12 publications

References 5 publications

Constraint selection for topic-based MDI adaptation of language models

Constraint selection for topic-based MDI adaptation of language models

Efficient language model look-ahead probabilities generation using lower order LM look-ahead information

Comparison of Different Modeling Units for Language Model Adaptation for Inflected Languages

Contact Info

Product

Resources

About