2016
DOI: 10.1093/bioinformatics/btw279
|View full text |Cite
|
Sign up to set email alerts
|

Compacting de Bruijn graphs from sequencing data quickly and in low memory

Abstract: Motivation: As the quantity of data per sequencing experiment increases, the challenges of fragment assembly are becoming increasingly computational. The de Bruijn graph is a widely used data structure in fragment assembly algorithms, used to represent the information from a set of reads. Compaction is an important data reduction step in most de Bruijn graph based algorithms where long simple paths are compacted into single vertices. Compaction has recently become the bottleneck in assembly pipelines, and impr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
260
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
4
2

Relationship

2
8

Authors

Journals

citations
Cited by 209 publications
(262 citation statements)
references
References 40 publications
2
260
0
Order By: Relevance
“…Figure 1 shows the pipeline. We first self-correct the Illumina reads using Lighter [35], then build the de Bruijn graph using BCalm2 [36], align the long reads using GraphAligner with default parameters and finally extract the path as the corrected read. Due to fluctuations and biases of Illumina coverage, some genomic areas are impossible to correct with short reads even in principle.…”
Section: Error Correctionmentioning
confidence: 99%
“…Figure 1 shows the pipeline. We first self-correct the Illumina reads using Lighter [35], then build the de Bruijn graph using BCalm2 [36], align the long reads using GraphAligner with default parameters and finally extract the path as the corrected read. Due to fluctuations and biases of Illumina coverage, some genomic areas are impossible to correct with short reads even in principle.…”
Section: Error Correctionmentioning
confidence: 99%
“…Ntcard [24] is used to select the best-suited k-mer size. A compacted DBG is then constructed using Bcalm2 [25]. The Btrim [26] module cleans the graph, and the reads are finally mapped back on the de Bruijn graph using Bgreat2 [27].…”
Section: Dbg-based Reads Correctionmentioning
confidence: 99%
“…Here we only consider local schemes, and exclude global schemes, such as the counting-based orderings used in Chikhi et al (2015, 2016). …”
Section: Introductionmentioning
confidence: 99%