2017
DOI: 10.1101/gr.215087.116
|View full text |Cite
|
Sign up to set email alerts
|

Canu: scalable and accurate long-read assembly via adaptivek-mer weighting and repeat separation

Abstract: Long-read single-molecule sequencing has revolutionized de novo genome assembly and enabled the automated reconstruction of reference-quality genomes. However, given the relatively high error rates of such technologies, efficient and accurate assembly of large repeats and closely related haplotypes remains challenging. We address these issues with Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences. Canu introduces support for nanopore sequencing, halves dept… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

15
4,185
0
2

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 6,105 publications
(4,447 citation statements)
references
References 78 publications
15
4,185
0
2
Order By: Relevance
“…The data were assembled using Canu (Koren et al, 2017) and SMARTdenovo, which represent state-of-the-art assemblers known to support Oxford nanopore sequencing technology (Istace et al, 2017). Furthermore, data were assembled with miniasm (Li, 2016), which is a fast assembler without a consensus step, thus necessitating a postassembly polishing and/or consensus step.…”
Section: Genome Assembly Strategies and Metricsmentioning
confidence: 99%
“…The data were assembled using Canu (Koren et al, 2017) and SMARTdenovo, which represent state-of-the-art assemblers known to support Oxford nanopore sequencing technology (Istace et al, 2017). Furthermore, data were assembled with miniasm (Li, 2016), which is a fast assembler without a consensus step, thus necessitating a postassembly polishing and/or consensus step.…”
Section: Genome Assembly Strategies and Metricsmentioning
confidence: 99%
“…Yet, we obtained complete and single-contig assemblies for both the PacBio and ONT E. coli long read datasets, using KD-tree together with the Canu assembler (v1.5; Koren et al, 2017). This indicates that exhaustive overlap detection is not necessary for genome assembly, especially if the overlapping is precise and sequencing depth high.…”
Section: Sensitivity and Precisionmentioning
confidence: 99%
“…They are reported in the MHAP output format, a tabular format compatible with e.g. the Canu assembler (Koren et al, 2017).…”
Section: Filtering Of Read Overlapsmentioning
confidence: 99%
“…The dataset is available from (R. Wick 2017a) and detailed in Supplementary Table 3. *or comparison Canu (Koren et al 2017) (version 1.5) assemblies and Unicycler (Wick et al 2016) (version 0.4.0) assemblies, utilising Miniasm (Li 2016) and Racon (Vaser et al 2017), post Nanopolish (https://github.com/jts/nanopolish) (version 0.7.0), created using only the long read data (R. Wick 2017b) were used. There are currently no large publicly available ONT datasets.…”
Section: Ont Samplesmentioning
confidence: 99%