2021
DOI: 10.1038/s41467-020-20236-7
|View full text |Cite
|
Sign up to set email alerts
|

Efficient assembly of nanopore reads via highly accurate and intact error correction

Abstract: Long nanopore reads are advantageous in de novo genome assembly. However, nanopore reads usually have broad error distribution and high-error-rate subsequences. Existing error correction tools cannot correct nanopore reads efficiently and effectively. Most methods trim high-error-rate subsequences during error correction, which reduces both the length of the reads and contiguity of the final assembly. Here, we develop an error correction, and de novo assembly tool designed to overcome complex errors in nanopor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
188
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 259 publications
(212 citation statements)
references
References 34 publications
0
188
0
Order By: Relevance
“…To construct chromosomal assemblies for the two type-B individuals, we employed the strategy shown in supplementary figure 1 A , Supplementary Material online. First, for each specimen, we built contigs from long Nanopore reads with the NECAT assembler ( Chen et al 2021 ) and polished obtained contigs with Illumina reads using the Nextpolish software ( Hu et al 2020 ). The total lengths of the contigs greatly exceeded the estimated genome size.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To construct chromosomal assemblies for the two type-B individuals, we employed the strategy shown in supplementary figure 1 A , Supplementary Material online. First, for each specimen, we built contigs from long Nanopore reads with the NECAT assembler ( Chen et al 2021 ) and polished obtained contigs with Illumina reads using the Nextpolish software ( Hu et al 2020 ). The total lengths of the contigs greatly exceeded the estimated genome size.…”
Section: Resultsmentioning
confidence: 99%
“…Contig assembly was performed using the NECAT program with default parameters ( Chen et al 2021 ). Then contigs were polished using Nextpolish ( Hu et al 2020 ) with Illumina reads obtained from the same specimens.…”
Section: Methodsmentioning
confidence: 99%
“…Notably, we did not find any multicopy, rearrangements, the presence of backbone sequences, or truncations in the maize genome. Although the nanopore-based sequencing technology help us acquire long sequencing reads, the error rate of nanopore reads is about 5–15% (Chen et al, 2021 ), which remains a formidable challenge for bioinformatics and for accurately determining the insertion position. However, when coupled to a linkage analysis, this may help us reliably infer the region of a given insertion position, by eliminating interference from other regions of the genome.…”
Section: Discussionmentioning
confidence: 99%
“…The first stage of our assembly approach involved comparing three long-read assemblers using the ONT data as input: NECAT v0.01 [24], Flye (Flye, RRID:SCR_017016) v2.6 [25] and Canu (Canu, RRID:SCR_015880) v1.9 [26]. The genome size parameter used for the assemblers was 1134 Mb, as previously reported for Telopea truncata [27].…”
Section: Draft Long-read and 10x Assembliesmentioning
confidence: 99%