2023
DOI: 10.1101/2023.05.31.543043
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

kmindex and ORA: indexing and real-time user-friendly queries in terabyte-sized complex genomic datasets

Téo Lemane,
Nolan Lezzoche,
Julien Lecubin
et al.

Abstract: Despite their wealth of biological information, public sequencing databases are largely underutilized. One cannot efficiently search for a sequence of interest in these immense resources. Sophisticated computational methods such as approximate membership query data structures allow searching for fixed-length words (k-mers) in large datasets. Yet they face scalability challenges when applied to thousands of complex sequencing experiments. In this context we propose kmindex, a new approach that uses inverted ind… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 32 publications
(36 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?