1999
DOI: 10.21236/ada360974
|View full text |Cite
|
Sign up to set email alerts
|

A Gaussian Prior for Smoothing Maximum Entropy Models

Abstract: In certain contexts, maximum entropy (ME) modeling can be viewed as maximum likelihood training for exponential models, and like other maximum likelihood methods is prone to overfitting of training data. Several smoothing methods for maximum entropy models have been proposed to address this problem, but previous results do not make it clear how these smoothing methods compare with smoothing methods for other types of related models. In this work, we survey previous work in maximum entropy smoothing and compare… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
141
0

Year Published

2005
2005
2013
2013

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 229 publications
(143 citation statements)
references
References 21 publications
2
141
0
Order By: Relevance
“…Then, the benefit of using our framework will be even greater than what is suggested in Section 5.2. The principle of maximum entropy [14] has been successfully applied in different domains, including linguistics [13] [9] and databases [16]. Faloutsos et al apply maximum entropy, in addition to other techniques, for one-dimensional data reconstruction [16].…”
Section: Discussionmentioning
confidence: 99%
“…Then, the benefit of using our framework will be even greater than what is suggested in Section 5.2. The principle of maximum entropy [14] has been successfully applied in different domains, including linguistics [13] [9] and databases [16]. Faloutsos et al apply maximum entropy, in addition to other techniques, for one-dimensional data reconstruction [16].…”
Section: Discussionmentioning
confidence: 99%
“…To perform the convolution on discrete data, a convolution matrix is generated, typically with dimensions 6σ x × 6σ y , eventually leading to a compactly supported kernel. Gaussian filters have been used in language modelling to address data sparseness (Chen and Rosenfeld, 1999). We introduce this compactly supported two-dimensional Gaussian filter over the two-way seriated sparse data.…”
Section: Mapping Evolving Semantic Structuresmentioning
confidence: 99%
“…The optimal model weights are determined by a posteriori estimation using a Gaussian and a Laplace prior [10]. The resulting training criterion is also known as maximum mutual information (MMI) with 2 -and 1 -regularization.…”
Section: Combination and Optimizationmentioning
confidence: 99%