2002
DOI: 10.7146/brics.v9i13.21731
|View full text |Cite
|
Sign up to set email alerts
|

Solving the String Statistics Problem in Time O(n log n)

Abstract: The string statistics problem consists of preprocessing a string of length n such that given a query pattern of length m, the maximum number of non-overlapping occurrences of the query pattern in the string can be reported efficiently. Apostolico and Preparata introduced the minimal augmented suffix tree (MAST) as a data structure for the string statistics problem, and showed how to construct the MAST in time O(n log 2 n) and how it supports queries in time O(m) for constant sized alphabets. A subsequent theor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2002
2002
2020
2020

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 16 publications
0
9
0
Order By: Relevance
“…Squares play a role in an augmentation of the suffix tree suitable to report, for any query pattern, the number of its non-overlapping occurrences. 6,10 There are multiple uses of suffix trees in setting up some kind of signature for text strings, as well as measures of similarity or difference.…”
Section: Fallout Extensions and Challengesmentioning
confidence: 99%
“…Squares play a role in an augmentation of the suffix tree suitable to report, for any query pattern, the number of its non-overlapping occurrences. 6,10 There are multiple uses of suffix trees in setting up some kind of signature for text strings, as well as measures of similarity or difference.…”
Section: Fallout Extensions and Challengesmentioning
confidence: 99%
“…To pick the one with maximum SavedCost, we need the count of non-overlapping occurrences of these substrings. A Minimal Augmented Suffix Tree [5] over IT ∪M can be constructed and used to count the number of non-overlapping occurrences of all right-maximal repeats in overall O(L log L) time, where L is the total length of target strings. Using a regular suffix tree instead, this can be achieved in only O(L) time; but suffix tree may count overlapping occurrences.…”
Section: The Greedy Lexis Algorithmmentioning
confidence: 99%
“…They are labeled with the starting positions of the suffixes of w. We introduce the Cover Suffix Tree of w, denoted by CST (w), as an augmented -new nodes are added -suffix tree in which the nodes are annotated with information relevant to covers. CST (w) is similar to the data structure named Minimal Augmented Suffix Tree (see [3,5]).…”
Section: Augmented and Annotated Suffix Treesmentioning
confidence: 99%