2015
DOI: 10.1016/j.is.2014.06.002
|View full text |Cite
|
Sign up to set email alerts
|

The wavelet matrix: An efficient wavelet tree for large alphabets

Abstract: The wavelet tree is a flexible data structure that permits representing sequences S[1, n] of symbols over an alphabet of size σ, within compressed space and supporting a wide range of operations on S. When σ is significant compared to n, current wavelet tree representations incur in noticeable space or time overheads. In this article we introduce the wavelet matrix, an alternative representation for large alphabets that retains all the properties of wavelet trees but is significantly faster. We also show how t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
75
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 59 publications
(75 citation statements)
references
References 62 publications
0
75
0
Order By: Relevance
“…Zero-order compression is also obtained, with faster time in practice, by retaining the CM representation but using a tree with Huffman [19] shape instead of a balanced one, which gives n(H 0 (S) + 1)(1 + o(1)) + O(σ log n) bits. The results are called WTH (Huffmanshaped WT) or WMH (Huffman-shaped WM [13]). …”
Section: Sequence Representationsmentioning
confidence: 99%
“…Zero-order compression is also obtained, with faster time in practice, by retaining the CM representation but using a tree with Huffman [19] shape instead of a balanced one, which gives n(H 0 (S) + 1)(1 + o(1)) + O(σ log n) bits. The results are called WTH (Huffmanshaped WT) or WMH (Huffman-shaped WM [13]). …”
Section: Sequence Representationsmentioning
confidence: 99%
“…There is an alternative implementation of the wavelet tree called wavelet matrix (Claude et al, 2015) that was specifically proposed in the literature to account for big alphabets. Given an alphabet its size can be extended to match the next power of two, yielding a complete binary tree for the wavelet tree representation.…”
Section: Big Alphabets and The Wavelet Matrixmentioning
confidence: 99%
“…Existing experimental analyses about wavelet trees focus mostly on compression characteristics (Claude et al, 2015). Moreover, they do not consider the time required to build the structure because from the compression point of view the preprocessing time is not the most relevant parameter.…”
Section: Performance Testsmentioning
confidence: 99%
“…We implemented the kd-tree and the z-order-based KDW-tree in Java. The KDW-tree was built on a practical variant of the wavelet tree called the wavelet matrix [12]. The coordinates that occur in each dataset are sorted in ascending order for each dimension and compressed in sdarray [22], a compressed indexable dictionary, which was used to convert points and query regions from general space to rank space.…”
Section: Settingsmentioning
confidence: 99%