Efficient and Compact Representations of Prefix Codes

Gagie, Travis; Navarro, Gonzalo; Nekrich, Yakov; Ordóñez, Alberto

doi:10.1109/tit.2015.2452252

Cited by 10 publications

(9 citation statements)

References 55 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Some technicalities about possible coders and decoders for P -codes can be further found in [ 18 ]. Information on effective decoding algorithms can be found in [ 19 , 20 ] and on memory-efficient representation of prefix codes can be found in [ 21 ].…”

Section: Discussionmentioning

confidence: 99%

A Cipher Based on Prefix Codes

Grošek

Hromada

Horák

2021

Sensors

View full text Add to dashboard Cite

A prefix code, a P-code, is a code where no codeword is a prefix of another codeword. In this paper, a symmetric cipher based on prefix codes is proposed. The simplicity of the design makes this cipher usable for Internet of Things applications. Our goal is to investigate the security of this cipher. A detailed analysis of the fundamental properties of P-codes shows that the keyspace of the cipher is too large to mount a brute-force attack. Specifically, in this regard we will find bounds on the number of minimal P-codes containing a binary word given in advance. Furthermore, the statistical attack is difficult to mount on such cryptosystem due to the attacker’s lack of information about the actual words used in the substitution mapping. The results of a statistical analysis of possible keys are also presented. It turns out that the distribution of the number of minimal P-codes over all binary words of a fixed length is Gaussian.

show abstract

Section: Discussionmentioning

confidence: 99%

A Cipher Based on Prefix Codes

Grošek

Hromada

Horák

2021

Sensors

View full text Add to dashboard Cite

show abstract

“…We have run experiments to compare the solution of Theorem 2 (referred to as WMM in the sequel, for Wavelet Matrix Model) with the only previous encoding, that is, the one used by Claude et al [1] (denoted by TABLE). Note that our codes are not canonical, so other solutions [5] do not apply. Claude et al [1] use for encoding a single table of σL bits storing the code of each symbol, and thus they easily encode in constant time.…”

Section: Methodsmentioning

confidence: 99%

“…If the alphabet consists of σ characters and the maximum codeword length is L, then we can build an O(σ log L)-bit data structure with O(log L) query time that, given a character, returns its codeword's length and rank among codewords of that length, or vice versa. If L is at most a constant times the size of a machine word (which it is when we are considering, e.g., Huffman codes for strings in the RAM model) then in theory we can make the predecessor search and the data structure's queries constant-time, meaning we can encode and decode in constant time [5].…”

Section: Introductionmentioning

confidence: 99%

“…Interestingly, since the mapping between symbols and leaves is fixed, alphabetic codes need only to store the tree topology, which can be represented more succinctly than optimal prefix-free codes, in 2σ + o(σ) bits [9], so that encoding and decoding can still be done in time O(ℓ). There are no, however, equivalents to the faster encoding/decoding methods used on canonical codes [5].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Efficient and Compact Representations of Some Non-canonical Prefix-Free Codes

Fariña

Gagie

Manzini

et al. 2016

String Processing and Information Retrieval

Self Cite

View full text Add to dashboard Cite

Abstract. For many kinds of prefix-free codes there are efficient and compact alternatives to the traditional tree-based representation. Since these put the codes into canonical form, however, they can only be used when we can choose the order in which codewords are assigned to characters. In this paper we first show how, given a probability distribution over an alphabet of σ characters, we can store a nearly optimal alphabetic prefix-free code in o(σ) bits such that we can encode and decode any character in constant time. We then consider a kind of code introduced recently to reduce the space usage of wavelet matrices (Claude, Navarro, and Ordóñez, Information Systems, 2015). They showed how to build an optimal prefix-free code such that the codewords' lengths are non-decreasing when they are arranged such that their reverses are in lexicographic order. We show how to store such a code in O σ log L + 2 ǫL bits, where L is the maximum codeword length and ǫ is any positive constant, such that we can encode and decode any character in constant time under reasonable assumptions. Otherwise, we can always encode and decode a codeword of ℓ bits in time O(ℓ) using O(σ log L) bits of space.

show abstract

“…The same idea is presented perhaps more lucidly in [25, Section 2.6.3], with an explicit claim that this representation requires σ lg σ + O(lg 2 n) bits 2 and achieves O(lg lg n) time per codeword. Given ℓ max is the maximum length of the codewords, an improvement, both in space and time, has been achieved by Gagie et al [13], who gave a representation of the canonical Huffman tree within…”

Section: Introductionmentioning

confidence: 99%

Space-Efficient Huffman Codes Revisited

Grabowski¹,

Köppl²

2021

Preprint

View full text Add to dashboard Cite

Canonical Huffman code is an optimal prefix-free compression code whose codewords enumerated in the lexicographical order form a list of binary words in non-decreasing lengths. Gagie et al. ( 2015) gave a representation of this coding capable to encode or decode a symbol in constant worst case time. It uses σ lg ℓmax + o(σ) + O(ℓ 2 max ) bits of space, where σ and ℓmax are the alphabet size and maximum codeword length, respectively. We refine their representation to reduce the space complexity to σ lg ℓmax(1 + o(1)) bits while preserving the constant encode and decode times. Our algorithmic idea can be applied to any canonical code.

show abstract

Efficient and Compact Representations of Prefix Codes

Cited by 10 publications

References 55 publications

A Cipher Based on Prefix Codes

A Cipher Based on Prefix Codes

Efficient and Compact Representations of Some Non-canonical Prefix-Free Codes

Space-Efficient Huffman Codes Revisited

Contact Info

Product

Resources

About