2015
DOI: 10.1145/2594408
|View full text |Cite
|
Sign up to set email alerts
|

Locally Compressed Suffix Arrays

Abstract: Compressed text (self-)indexes have matured up to a point where they can replace a text by a data structure that requires less space and, in addition to giving access to arbitrary text passages, support indexed text searches. At this point those indexes are competitive with traditional text indexes (which are very large) for counting the number of occurrences of a pattern in the text. Yet, they are still hundreds to thousands of times slower when it comes to locating those occurrences in the text. In this pape… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
4

Relationship

3
5

Authors

Journals

citations
Cited by 20 publications
(22 citation statements)
references
References 49 publications
0
22
0
Order By: Relevance
“…Abeliuk et al [1], and also Fischer et al [37], claim that the grammar produced by RePair [75] is of size O(r log(n/r)). This is an incorrect result borrowed from González et al [51,52], where it was claimed for DSA. The proof fails for a reason we describe in our technical report [44,Sec.…”
Section: Genericmentioning
confidence: 88%
“…Abeliuk et al [1], and also Fischer et al [37], claim that the grammar produced by RePair [75] is of size O(r log(n/r)). This is an incorrect result borrowed from González et al [51,52], where it was claimed for DSA. The proof fails for a reason we describe in our technical report [44,Sec.…”
Section: Genericmentioning
confidence: 88%
“…Abeliuk et al [1], and also Fischer et al [33], claim that RePair compression [60] reaches size O(r log(n/r)). This is an incorrect result borrowed from González et al [43,44], where it was claimed for DSA. The proof fails for a reason we describe in Appendix A.…”
Section: Exploiting the Runsmentioning
confidence: 88%
“…They [44] also propose another method, whose proof is also subtly incorrect. They follow the cycle of π = LF (or its inverse, Ψ ).…”
Section: A2 Following πmentioning
confidence: 99%
“…Unfortunately, MakCSA [6] cannot (directly) access single SA cells and we were unable to run LCSA [23] in this kind of queries (despite discussing this issue with the LCSA authors). From the comparison with the results presented in [23] we conclude that FBCSA is a few times faster in single cell access than the other related algorithms, MakCSA [6] (augmented with a compressed bitmap from [30] to extract arbitrary ranges of the suffix array) and LCSA [23]. Extracting c consecutive cells is not however an efficient operation for FBCSA (as opposed to MakCSA and LCSA, see ), yet for small ss the time growth is slower than linear, due to a few sampled (and thus written explicitly) SA offsets in a typical block (Fig.…”
Section: Compact and Hash Based Variants Of The Suffix Arraymentioning
confidence: 99%