Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics 2017
DOI: 10.1145/3107411.3107482
|View full text |Cite
|
Sign up to set email alerts
|

Scalable Genomic Assembly through Parallel de Bruijn Graph Construction for Multiple K-mers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
2
1

Relationship

3
3

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 11 publications
0
6
0
Order By: Relevance
“…However, a reference genome is typically needed to serve as ground truth for such evaluations, making this approach infeasible for de novo sequencing tasks. To the best of our knowledge, existing tools leave the best parameter choice to the end user [9,1], although Kmergenie provides intuitive abundance histograms or heuristics to guide k-value selection, albeit for genome assembly. Further, Kmergenie does not provide the optimal k-value for different tools, only accounting for the dataset for computing the abundance histograms.…”
Section: During Genomic Error Correctionmentioning
confidence: 99%
See 3 more Smart Citations
“…However, a reference genome is typically needed to serve as ground truth for such evaluations, making this approach infeasible for de novo sequencing tasks. To the best of our knowledge, existing tools leave the best parameter choice to the end user [9,1], although Kmergenie provides intuitive abundance histograms or heuristics to guide k-value selection, albeit for genome assembly. Further, Kmergenie does not provide the optimal k-value for different tools, only accounting for the dataset for computing the abundance histograms.…”
Section: During Genomic Error Correctionmentioning
confidence: 99%
“…The search space can be large and since the cost of searching shows up as a runtime delay, it is important to reduce the time that it takes to evaluate that metric of each search point. In today's state-of-the-art, to find the best value of a configuration parameter [26,1], e.g., k-value, the method would be to pick a k (a single point in the space), run the EC tool with that value, then perform alignment (with one of several available tools such as Bowtie2), and finally compute the metric (alignment rate or EC gain) for that value. In contrast, with Athena, to explore one point in the search space, we run the EC algorithm with the k-value, and then compute the perplexity metric, which does not involve the time consuming step of alignment.…”
Section: Impact On Assembly Qualitymentioning
confidence: 99%
See 2 more Smart Citations
“…However, there are limitations to this vertical scaling up to the point where the application itself becomes the bottleneck. With the "genomical" data sizes, often noSQL backends are deployed to scale with the increasing sizes and diversity of queries, such as in genomics [101] and IoT-related domains [102]. Further, we have to keep in mind guarantees of safety, availability, and timeliness, either deterministically or stochastically in a complex environment, with humans-inthe-loop.…”
Section: Planet-scale Iotmentioning
confidence: 99%