2014
DOI: 10.1007/978-3-319-07959-2_36
|View full text |Cite
|
Sign up to set email alerts
|

Faster Compressed Suffix Trees for Repetitive Text Collections

Abstract: Abstract. Recent compressed suffix trees targeted to highly repetitive text collections reach excellent compression performance, but operation times in the order of milliseconds. We design a new suffix tree representation for this scenario that still achieves very low space usage, only slightly larger than the best previous one, but supports the operations within microseconds. This puts the data structure in the same performance level of compressed suffix trees designed for standard text collections, which on … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
21
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
6
2

Relationship

5
3

Authors

Journals

citations
Cited by 19 publications
(22 citation statements)
references
References 38 publications
1
21
0
Order By: Relevance
“…An extension of the RLFM-index [80] still needs O(n/s) space to carry out most of the suffix tree operations in time O(s log n). Some variants that are designed for repetitive text collections [1,92] are heuristic and do not offer worst-case guarantees. Only recently a compressed suffix tree was presented [8] that uses O(e) space and carries out operations in O(log n) time.…”
Section: Compressed Suffix Treesmentioning
confidence: 99%
See 1 more Smart Citation
“…An extension of the RLFM-index [80] still needs O(n/s) space to carry out most of the suffix tree operations in time O(s log n). Some variants that are designed for repetitive text collections [1,92] are heuristic and do not offer worst-case guarantees. Only recently a compressed suffix tree was presented [8] that uses O(e) space and carries out operations in O(log n) time.…”
Section: Compressed Suffix Treesmentioning
confidence: 99%
“…The first compressed suffix tree for repetitive collections was built on runs [80], but just like the self-index, it needed Θ(n/s) space to obtain O(s log n) time in key operations like accessing SA. Other compressed suffix trees for repetitive collections appeared later [1,92,29], but they do not offer formal space guarantees (see later). A recent one, instead, uses O(e) words and supports a number of operations in time typically O(log n) [8].…”
Section: A Run-length Compressed Suffix Treementioning
confidence: 99%
“…An extension of the RLFM-index [65] still needs O(n/s) space to carry out most of the suffix tree operations in time O(s log n). Some variants that are designed for repetitive text collections [1,76] are heuristic and do not offer worstcase guarantees. Only recently a compressed suffix tree was presented [5] that uses O(e) space and carries out operations in O(log n) time.…”
Section: Compressed Suffix Treesmentioning
confidence: 99%
“…Components (2) and (3), which are usually less relevant in terms of space, may become dominant if they are represented without exploiting repetitiveness. For (2), we compare GCT, a tree representation aimed at repetitive topologies [27], with a classical representation (FF [1]). For (3), we will use our new repetitionaware sequence representations, comparing them with the alternative proposed in SXSI (MATRIX, using one compressed bitmap per tag) and a WTH representation.…”
Section: Application: Xpath Queries On Highly Repetitive Collectionsmentioning
confidence: 99%