6th International Symposium on String Processing and Information Retrieval. 5th International Workshop on Groupware (Cat. No.PR
DOI: 10.1109/spire.1999.796581
|View full text |Cite
|
Sign up to set email alerts
|

An efficient method for in memory construction of suffix arrays

Abstract: The su x array is a string-indexing structure and a memory e cient alternative to the su x tree. It has many advantages for text processing. Here w e p r opose an e cient algorithm for sorting su xes. We call this algorithm the two-stage su x sort. One of our ideas is to exploit the speci c relationships between adjacent su xes. Our algorithm makes it possible to use the suf-x array for much larger texts and suggests new areas of application. Our experiments on several text data sets including 514-MB Japanese … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
42
0
2

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 44 publications
(44 citation statements)
references
References 10 publications
0
42
0
2
Order By: Relevance
“…• DivSufSort by Yuta Mori is an optimized implementation of the two-stage algorithm [IT99]. DivSufSort is one of the fastest SACA for byte alphabets.…”
Section: Notable Implementationsmentioning
confidence: 99%
See 2 more Smart Citations
“…• DivSufSort by Yuta Mori is an optimized implementation of the two-stage algorithm [IT99]. DivSufSort is one of the fastest SACA for byte alphabets.…”
Section: Notable Implementationsmentioning
confidence: 99%
“…Using parallelRange we can parallelize the DivSufSort implementation by Mori of the twostage algorithm [IT99]. In this section we will first give an overview over our parallel DivSufSort algorithm.…”
Section: Parallel Divsufsortmentioning
confidence: 99%
See 1 more Smart Citation
“…Induced copying SACAs are currently the fastest CPU implementations and we compare our results with the best one, libdivsufsort. To illustrate this approach, we describe the method of Itoh and Tanaka [25], two stage induced copying.…”
Section: Induced Copyingmentioning
confidence: 99%
“…In 2003, three papers were published [58,75,26] that describe a worst-case linear time construction of a suffix array without the need of an initial construction of the respective suffix tree. Other algorithms for constructing suffix arrays can be found in [17,36,47,55,68]. Moreover, a recent line of research concerns compressed suffix arrays [46,38,39,35].…”
Section: Lcp(t S a T [L]···n T S A T [M]···n ) And Lcp(t S A T [M]·mentioning
confidence: 99%