Proceedings of the Thirty-Sixth Annual ACM Symposium on Theory of Computing 2004
DOI: 10.1145/1007352.1007374
|View full text |Cite
|
Sign up to set email alerts
|

Dictionary matching and indexing with errors and don't cares

Abstract: This paper considers various flavors of the following online problem: preprocess a text or collection of strings, so that given a query string p, all matches of p with the text can be reported quickly.In this paper we consider matches in which a bounded number of mismatches are allowed, or in which a bounded number of "don't care" characters are allowed.The specific problems we look at are: indexing, in which there is a single text t, and we seek locations where p matches a substring of t; dictionary queries, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
247
0

Year Published

2005
2005
2020
2020

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 224 publications
(250 citation statements)
references
References 42 publications
2
247
0
Order By: Relevance
“…The improvement can be also embedded in the extensions. Searching Substrings Internally in the Suffix Tree Cole et al [7], when proposing data structures for indexing a text w[1..n] with k mismatches, k errors and k wildcards, suggested the LCP data structure. The LCP data structure comes in two variants, rooted LCP and unrooted LCP.…”
Section: Arxiv:14067716v1 [Csds] 30 Jun 2014mentioning
confidence: 99%
“…The improvement can be also embedded in the extensions. Searching Substrings Internally in the Suffix Tree Cole et al [7], when proposing data structures for indexing a text w[1..n] with k mismatches, k errors and k wildcards, suggested the LCP data structure. The LCP data structure comes in two variants, rooted LCP and unrooted LCP.…”
Section: Arxiv:14067716v1 [Csds] 30 Jun 2014mentioning
confidence: 99%
“…can be supported for a pattern p plus a time complexity equal to the size of the output. Using techniques presented in [22], the structure can be modified to solve the problem in O(nm log(nm) + n(c 1 log n) k+1 /k!) preprocessing time, and O(m + (c 2 log n) k log log n) query time (c 1 and c 2 are constants); this approach is worse than the trie approach for small values of .…”
Section: Index Structures For Indeterminate Stringsmentioning
confidence: 99%
“…There exist some solutions avoiding the convolution method as well [18][19][20]. A number of solutions exist in the literature that consider the problem of text indexing with don't cares [21][22][23][24]. Notably, in the literature, the don't cares are also referred to as wildcards.…”
Section: Introductionmentioning
confidence: 99%