2011
DOI: 10.1109/tit.2011.2137210
|View full text |Cite
|
Sign up to set email alerts
|

Probability Estimation in the Rare-Events Regime

Abstract: We address the problem of estimating the probability of an observed string that is drawn i.i.d. from an unknown distribution. Motivated by models of natural language, we consider the regime in which the length of the observed string and the size of the underlying alphabet are comparably large. In this regime, the maximum likelihood distribution tends to overestimate the probability of the observed letters, so the Good-Turing probability estimator is typically used instead. We show that when used to estimate th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2011
2011
2014
2014

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 27 publications
(21 citation statements)
references
References 39 publications
0
21
0
Order By: Relevance
“…In the rare events regime, WVK show in [3] that both M n,r and G n,r satisfy the same Poisson limit, in a strong sense:…”
Section: B Rare Probability Estimationmentioning
confidence: 98%
See 1 more Smart Citation
“…In the rare events regime, WVK show in [3] that both M n,r and G n,r satisfy the same Poisson limit, in a strong sense:…”
Section: B Rare Probability Estimationmentioning
confidence: 98%
“…This is done notably by Wagner, Viswanath, and Kulkarni, whom we refer to as WVK henceforth, in [3]. Let P n be the law of np n (X n,1 ), i.e.…”
Section: A Rare-events Sourcesmentioning
confidence: 99%
“…In the above definition, the constraints in (15) and (16) ensure that v VV ′ is a (potentially improper) distribution histogram, whereas the constraints in (17) are based on computations what the statistics of the profile ϕ ϕ(µ) of the multiplicity vector µ µ(ψ) of a pattern ψ generated by a source with distribution histogram v ′ looks like. The size and the elements of the set Q are chosen such that a feasible vector v ′ can be found efficiently, yet so that one can guarantee nice properties of the solution.…”
Section: Definition 19 (Simplified Vv Estimate)mentioning
confidence: 99%
“…The composite-versus-composite case with growing alphabets is addressed in limited form by Wagner et al [17], who develop a probability estimator for the "rare-events" regime where underlying probabilities are all order Θ(n −1 ) and therefore alphabet size is order Θ(n). Other practical approaches may also be taken, see for example Orlitsky-Santhanam-Zhang (OSZ) [18], [19], support vector machines [20], and techniques from pattern recognition and machine learning [21].…”
Section: B Related Workmentioning
confidence: 99%
“…Let {p n , q n } be a sequence of pairs of distributions and denote by µ 2 n (x, y) the shadow (see [17]), i.e. distribution of the random vector np n (X n ),…”
Section: Appendix B Proofs: Section IVmentioning
confidence: 99%