2020
DOI: 10.1371/journal.pone.0226234
|View full text |Cite|
|
Sign up to set email alerts
|

Can we use it? On the utility of de novo and reference-based assembly of Nanopore data for plant plastome sequencing

Abstract: The chloroplast genome harbors plenty of valuable information for phylogenetic research. Illumina short-read data is generally used for de novo assembly of whole plastomes. PacBio or Oxford Nanopore long reads are additionally employed in hybrid approaches to enable assembly across the highly similar inverted repeats of a chloroplast genome. Unlike for Pac-Bio, plastome assemblies based solely on Nanopore reads are rarely found, due to their high error rate and non-random error profile. However, the actual qua… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

6
47
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 40 publications
(53 citation statements)
references
References 89 publications
6
47
0
Order By: Relevance
“…Here, using only long reads and Canu, we produced the first complete circular mitochondrial genome for D. coniospora and were able to generate chromosome-scale assemblies for the nuclear genome. The rare misassembled contigs, formed by Canu because of single very long chimeric reads, as previously described [ 16 ], could be detected by read coverage anomalies and comparisons with unitigs, suggesting that solutions to avoid their creation could be implemented within Canu. The majority of reads that were flagged as chimeric arose from sequencing or polishing errors.…”
Section: Discussionmentioning
confidence: 86%
“…Here, using only long reads and Canu, we produced the first complete circular mitochondrial genome for D. coniospora and were able to generate chromosome-scale assemblies for the nuclear genome. The rare misassembled contigs, formed by Canu because of single very long chimeric reads, as previously described [ 16 ], could be detected by read coverage anomalies and comparisons with unitigs, suggesting that solutions to avoid their creation could be implemented within Canu. The majority of reads that were flagged as chimeric arose from sequencing or polishing errors.…”
Section: Discussionmentioning
confidence: 86%
“…Using low-coverage whole genome shot-gun long-read sequencing, most of the pipelines used herein retrieved a complete mitochondrial genome, as shown when these long-read assemblies were compared to a high-coverage ‘reference’ assembly generated from the same individual but with Illumina short reads. Canu was the only pipeline that failed to assemble and circularize the studied mitochondrial chromosome even when parameters were customized to optimize the assembly of short circular molecules ([ 39 ], see also [ 40 ]). Earlier versions of Canu were known to ‘have trouble’ assembling short circular molecules i.e., plasmids, in datasets with uneven coverage [ 39 ].…”
Section: Discussionmentioning
confidence: 99%
“…This difference is a missing base in a homopolymer region at approximately 669 base pairs in the MinION sequence (Additional file 1 : Figure S1). It is known that Nanopore sequencing has trouble with homopolymer sequences [ 33 , 47 49 ]. These issues can be observed in the unpolished sequence in Additional file 1 : Figure S1.…”
Section: Discussionmentioning
confidence: 99%