2011
DOI: 10.1186/1471-2105-12-95
|View full text |Cite
|
Sign up to set email alerts
|

Assessing the benefits of using mate-pairs to resolve repeats in de novo short-read prokaryotic assemblies

Abstract: BackgroundNext-generation sequencing technologies allow genomes to be sequenced more quickly and less expensively than ever before. However, as sequencing technology has improved, the difficulty of de novo genome assembly has increased, due in large part to the shorter reads generated by the new technologies. The use of mated sequences (referred to as mate-pairs) is a standard means of disambiguating assemblies to obtain a more complete picture of the genome without resorting to manual finishing. Here, we exam… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
64
0
1

Year Published

2012
2012
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 50 publications
(65 citation statements)
references
References 26 publications
0
64
0
1
Order By: Relevance
“…If WGS is to be used for rapid clinical diagnostics and antiobiogram generation, genome finishing requiring multitudes of Sanger sequencing experiments is too laborious and costly to be applicable. Sequencing mate-pair libraries on the Illumina platform and utilizing these with the paired-end sequences in the de novo assembly would result in a "higher quality" assemblies with significant reduction in the contig numbers [33]. This would possibly result in upstream regions encompassing the IS elements in the final assemblies.…”
Section: Discussionmentioning
confidence: 99%
“…If WGS is to be used for rapid clinical diagnostics and antiobiogram generation, genome finishing requiring multitudes of Sanger sequencing experiments is too laborious and costly to be applicable. Sequencing mate-pair libraries on the Illumina platform and utilizing these with the paired-end sequences in the de novo assembly would result in a "higher quality" assemblies with significant reduction in the contig numbers [33]. This would possibly result in upstream regions encompassing the IS elements in the final assemblies.…”
Section: Discussionmentioning
confidence: 99%
“…The computational issues that repetitive genomes pose to NGS assembly has been discussed in other recent papers [2][3][4]13], but there has been remarkably little emphasis on the relative value of the portion of the genome that remains fragmented in these draft assemblies. To this end, we performed an in silico experiment using simulated long and short read data for the fully sequenced genome of Cupriavidus metallidurans CH34 (hereafter simply referred to as CH34).…”
Section: Introductionmentioning
confidence: 98%
“…These methods rely on newer sequencing chemistries [1] and highly parallel operations that result in high yields at low costs per read but so far produce considerably shorter reads (in the range of 35-500 nucleotides) than Sanger sequencing (600 to 1500 nucleotides). Shorter reads increase the required complexity of the assembly algorithms [2], although the ability to sequence to very high coverage can overcome many of the original issues in genome assembly including read errors and coverage gaps [3]. The utility of next generation sequencing has been demonstrated in examining new variants, or very close relatives, of previously sequenced strains (reviewed in [4]).…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations