2012
DOI: 10.1007/978-3-642-31265-6_17
|View full text |Cite
|
Sign up to set email alerts
|

FEMTO: Fast Search of Large Sequence Collections

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2013
2013
2016
2016

Publication Types

Select...
2
1
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…This paper has initiated the study of a new class of pattern matching problems over “Stringomes,” and proposed several algorithmic solutions, which are instantiations of a basic data-structural scheme. The resulting algorithms are shown to be rather simple and yet efficient in space and time, so they are amenable to be implemented by using known geometric and string-matching libraries (such as LEDA and PizzaChili, just to name a few) or as an extension of the FEMTO software package [8]. The solutions proposed here have immediate applications to next-generation sequencing technologies, base-calling, variant-calling, expression analysis, population studies and onco-genomics.…”
Section: Discussionmentioning
confidence: 99%
“…This paper has initiated the study of a new class of pattern matching problems over “Stringomes,” and proposed several algorithmic solutions, which are instantiations of a basic data-structural scheme. The resulting algorithms are shown to be rather simple and yet efficient in space and time, so they are amenable to be implemented by using known geometric and string-matching libraries (such as LEDA and PizzaChili, just to name a few) or as an extension of the FEMTO software package [8]. The solutions proposed here have immediate applications to next-generation sequencing technologies, base-calling, variant-calling, expression analysis, population studies and onco-genomics.…”
Section: Discussionmentioning
confidence: 99%
“…Ferguson shows that it is even possible to execute regular expressions on data stored using an fm-index [8]. In his paper, he describes a system called femto, which can index large datasets while still maintaining adequate performance.…”
Section: Burrows-wheeler Transform and Fm-indexmentioning
confidence: 99%
“…Other recent work is by Ferguson [26], who describes a search structure called FEMTO, and provides experiments on 43 GB of English text (Project Gutenberg files), and on 182 GB of genomic data. The FEMTO system uses a partitioned FM-INDEX, with the search for each pattern proceeding through (at least) one disk block per symbol.…”
Section: B Other Recent Workmentioning
confidence: 99%