2023
DOI: 10.14778/3598581.3598586
|View full text |Cite
|
Sign up to set email alerts
|

Text Indexing for Long Patterns: Anchors are All you Need

Abstract: In many real-world database systems, a large fraction of the data is represented by strings: sequences of letters over some alphabet. This is because strings can easily encode data arising from different sources. It is often crucial to represent such string datasets in a compact form but also to simultaneously enable fast pattern matching queries. This is the classic text indexing problem. The four absolute measures anyone should pay attention to when designing or implementing a text in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 97 publications
0
1
0
Order By: Relevance
“…We note that the smallest cyclic rotation appears in the literature outside this paper. It was recently used as a k -mer sampling technique (33, 34). But we are not aware of other usage of this technique in the context of exact k -mer set representation, and the most widely used k -mer transform for sequence data-structures remains the Burrows Wheeler transform (BWT).…”
Section: Methodsmentioning
confidence: 99%
“…We note that the smallest cyclic rotation appears in the literature outside this paper. It was recently used as a k -mer sampling technique (33, 34). But we are not aware of other usage of this technique in the context of exact k -mer set representation, and the most widely used k -mer transform for sequence data-structures remains the Burrows Wheeler transform (BWT).…”
Section: Methodsmentioning
confidence: 99%