On the MDL principle for i.i.d. sources with large alphabets

Shamir, G.I.

doi:10.1109/tit.2006.872846

Cited by 26 publications

(62 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This yields the well known bound, for which the cost of each unknown probability parameter is 0.5 log n bits. Recently, we showed [30], [33] that in the case of large alphabets, the simple grid used to achieve the fixed k bound is not sufficient. In the minimax case, a non-uniform grid with increasing spacing in each dimension was created, and resulted in a cost of 0.5 log(n/k) bits for each unknown probability parameter.…”

Section: Average Case -Backgroundmentioning

confidence: 99%

“…In [30], [32]- [33], it was established that for a large known alphabet of size k, choosing a set Ω of M sources θ whose k − 1 free components are placed only at points on a non-uniform grid of increased spacing in each dimension yields a set of distinguishable sources if the grid spacing is properly chosen. The k − 1 components of grid points take values only from the grid vector…”

Section: A Maximin and Minimax Lower Boundmentioning

confidence: 99%

“…case (see, e.g., [30]). In particular, we must include A k in the error event, although we can use the assumption that θ k ≥ θ i , for all i; 1 ≤ i ≤ k − 1.…”

Section: Appendix B -Proof Of Lemma 52mentioning

confidence: 99%

“…The upper bounds are obtained by a constructive approach. For small k's it combines Rissanen's approach [24] with our recent approach from [30], [33] and with the more demanding conditions in coding patterns.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Universal Lossless Compression With Unknown Alphabets—The Average Case

Shamir

2006

IEEE Trans. Inform. Theory

Self Cite

View full text Add to dashboard Cite

show abstract

Section: Average Case -Backgroundmentioning

confidence: 99%

Section: A Maximin and Minimax Lower Boundmentioning

confidence: 99%

“…case (see, e.g., [30]). In particular, we must include A k in the error event, although we can use the assumption that θ k ≥ θ i , for all i; 1 ≤ i ≤ k − 1.…”

Section: Appendix B -Proof Of Lemma 52mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Universal Lossless Compression With Unknown Alphabets—The Average Case

Shamir

2006

IEEE Trans. Inform. Theory

Self Cite

View full text Add to dashboard Cite

show abstract

“…It was first introduced byÅberg in [8] as a solution to the multi-alphabet coding problem, where the message x contains only a small subset of the known alphabet A. It was further studied and motivated in a series of articles by Shamir [9][10][11][12] and by Jevtić, Orlitsky, Santhanam and Zhang [13][14][15][16] for practical applications: the alphabet is unknown and has to be transmitted separately anyway (for instance, transmission of a text in an unknown language), or the alphabet is very large in comparison to the message (consider the case of images with k = 2 24 colors, or texts when taking words as the alphabet units).…”

Section: Dictionary and Patternmentioning

confidence: 99%

A Lower-Bound for the Maximin Redundancy in Pattern Coding

Garivier

2009

Entropy

View full text Add to dashboard Cite

We show that the maximin average redundancy in pattern coding is eventually larger than 1.84for messages of length n. This improves recent results on pattern redundancy, although it does not fill the gap between known lower-and upper-bounds. The pattern of a string is obtained by replacing each symbol by the index of its first occurrence. The problem of pattern coding is of interest because strongly universal codes have been proved to exist for patterns while universal message coding is impossible for memoryless sources on an infinite alphabet. The proof uses fine combinatorial results on partitions with small summands.

show abstract

Mask operations in discrete fractional Fourier transform domains with nearly white real valued wide sense stationary output signals

Ling

Yang

et al. 2014

Digital Signal Processing

View full text Add to dashboard Cite

On the MDL principle for i.i.d. sources with large alphabets

Cited by 26 publications

References 22 publications

Universal Lossless Compression With Unknown Alphabets—The Average Case

Universal Lossless Compression With Unknown Alphabets—The Average Case

A Lower-Bound for the Maximin Redundancy in Pattern Coding

Mask operations in discrete fractional Fourier transform domains with nearly white real valued wide sense stationary output signals

Contact Info

Product

Resources

About