2022
DOI: 10.1101/2022.10.18.512682
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Efficient minimizer orders for large values ofkusing minimum decycling sets

Abstract: Minimizers are ubiquitously used in data structures and algorithms for efficient searching, mapping, and indexing of high-throughput DNA sequencing data. Minimizer schemes select a minimumk-mer in everyL-long sub-sequence of the target sequence, where minimality is with respect to a predefinedk-mer order. Commonly used minimizer orders select morek-mers overall than necessary and therefore provide limited improvement to runtime and memory usage of downstream analysis tasks. The recently introduced universalk-m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 44 publications
0
4
0
Order By: Relevance
“…Through the application of “random minimizers”, which employ a hashing function, it is estimated that the number of selected minimizers required is twice the minimal theoretical number. Nevertheless, by adopting advanced minimizer selection algorithms [41, 31], onecan surpass these expectations and further reduce the number of selected minimizers in practice.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Through the application of “random minimizers”, which employ a hashing function, it is estimated that the number of selected minimizers required is twice the minimal theoretical number. Nevertheless, by adopting advanced minimizer selection algorithms [41, 31], onecan surpass these expectations and further reduce the number of selected minimizers in practice.…”
Section: Methodsmentioning
confidence: 99%
“…Recent advancements in minimizer selection techniques aim to closely approach this theoretical lower bound, thus reducing the quantity of necessary minimizers. We employ decycling set minimizers [31], which minimize the count of selected minimizers, albeit at the cost of increased computational overhead.…”
Section: Minimizer Schemementioning
confidence: 99%
“…This union function is not entirely new. Although motivated by a different goal, it is suggested in [18] to use an order for minimizers which is based on the set φ u , where φ is the Mykkeltveit set.…”
Section: The Union Set and Sparse Canonicalizationmentioning
confidence: 99%
“…The Mykkeltveit and Champarnaud sets are two known construction methods for decycling sets of minimum size. Although these sets are not used on their own as sketching methods, the Mykkeltveit set in particular has been used as a starting point to define sketching methods [17, 16, 5, 18]. By construction, these sets are decycling.…”
Section: Decycling In K-nonical Spacementioning
confidence: 99%