2017
DOI: 10.3389/fpls.2017.01271
|View full text |Cite
|
Sign up to set email alerts
|

De Novo Assembly of Complete Chloroplast Genomes from Non-model Species Based on a K-mer Frequency-Based Selection of Chloroplast Reads from Total DNA Sequences

Abstract: Whole Genome Shotgun (WGS) sequences of plant species often contain an abundance of reads that are derived from the chloroplast genome. Up to now these reads have generally been identified and assembled into chloroplast genomes based on homology to chloroplasts from related species. This re-sequencing approach may select against structural differences between the genomes especially in non-model species for which no close relatives have been sequenced before. The alternative approach is to de novo assemble the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
31
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(32 citation statements)
references
References 50 publications
0
31
0
1
Order By: Relevance
“…First and foremost, it must be decided if full chloroplast genomes are necessary or if protein-coding gene space will be adequate. Acquiring full chloroplast genomes for each sample can add extra time to data generation and analysis; however, multiple assembly pipelines (e.g., ACRE [Wysocki et al, 2014], IOGA [Bakker et al, 2016], NOVOPlasty [Dierckxsens et al, 2017], Fast-Plast [https:// github.com/mrmckain/Fast-Plast], and a k-mer-based approach [Izan et al, 2017]) have been developed that are capable of assembling complete chloroplast genomes from short-read data. Complete chloroplast genomes can potentially provide more phylogenetic signal from intergenic regions for reconstructing relationships among closely related species (Carbonell-Caballero et al, 2015).…”
Section: Chloroplast Genomes Are Readily and Inexpensively Obtainedmentioning
confidence: 99%
“…First and foremost, it must be decided if full chloroplast genomes are necessary or if protein-coding gene space will be adequate. Acquiring full chloroplast genomes for each sample can add extra time to data generation and analysis; however, multiple assembly pipelines (e.g., ACRE [Wysocki et al, 2014], IOGA [Bakker et al, 2016], NOVOPlasty [Dierckxsens et al, 2017], Fast-Plast [https:// github.com/mrmckain/Fast-Plast], and a k-mer-based approach [Izan et al, 2017]) have been developed that are capable of assembling complete chloroplast genomes from short-read data. Complete chloroplast genomes can potentially provide more phylogenetic signal from intergenic regions for reconstructing relationships among closely related species (Carbonell-Caballero et al, 2015).…”
Section: Chloroplast Genomes Are Readily and Inexpensively Obtainedmentioning
confidence: 99%
“…However, Geneious is closed source and GUI based, which is not in the scope of this study. There is also another publication describing a method for assembling chloroplasts [54]. However, the link to the software is not active anymore.…”
Section: Tool Selectionmentioning
confidence: 99%
“…The reason for that is that a mapping consensus will always adopt characteristics of the reference from which it was inferred. Structural rearrangements or additional transferred genes might easily be missed, especially when mapping short reads; the structure of repeat regions cannot be distinguished by mapping based approaches at all [86]. For the assembly of the globe artichoke (Cynara L.) cp genome with Illumina reads, [76] compared mapping and de novo approaches and found the results to be almost identical.…”
Section: Reference-based Assembly Using Nanopore Datamentioning
confidence: 99%
“…recognized an effect of too low or high coverage on assembly quality, stating that "there is a complex relationship between assembly accuracy and input read coverage when assembling chloroplast genomes with ONT data using Canu" [21]. [6] mention that de novo assembly performance could be improved by downsampling the input reads, and [86] stress that very high coverage may result in alternative assemblies which can lead to contig fragmentation. For the present study, preliminary de novo assembly tests (results not shown) with a reduced compared to the full dataset considerably decreased the number of contigs (approximately by half).…”
Section: Optimizing Nanopore De Novo and Reference-based Assemblymentioning
confidence: 99%