2018
DOI: 10.2174/1875036201811010231
|View full text |Cite
|
Sign up to set email alerts
|

The Challenge of Genome Sequence Assembly

Abstract: Background: Although whole genome sequencing is enabling numerous advances in many fields achieving complete chromosome-level sequence assemblies for diverse species presents difficulties. The problems in part reflect the limitations of current sequencing technologies. Chromosome assembly from ‘short read’ sequence data is confounded by the presence of repetitive genome regions with numerous similar sequence tracts which cannot be accurately positioned in the assembled sequence. Longer sequ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 47 publications
0
3
0
Order By: Relevance
“…Additionally, Dvir_v2.0 has a high representation of complete BUSCOs in arthropoda_odb10 (93.6%; 804 single copy and 198 duplicated), but also the greatest proportion of fragmented BUSCOs (4.6%) compared to other coleopteran assemblies. The latter may be a consequence of the fragmented Dvir_2.0 assembly, likely resulting from a combination of using short reads for assembly [79] and the highly repetitive nature of the D. v. virgifera genome [69,80,81] (Fig S1). Thus, the length and low sequence divergence among repeat classes may have posed difficulties in assigning unique placement of reads into assembled contigs.…”
Section: Discussionmentioning
confidence: 99%
“…Additionally, Dvir_v2.0 has a high representation of complete BUSCOs in arthropoda_odb10 (93.6%; 804 single copy and 198 duplicated), but also the greatest proportion of fragmented BUSCOs (4.6%) compared to other coleopteran assemblies. The latter may be a consequence of the fragmented Dvir_2.0 assembly, likely resulting from a combination of using short reads for assembly [79] and the highly repetitive nature of the D. v. virgifera genome [69,80,81] (Fig S1). Thus, the length and low sequence divergence among repeat classes may have posed difficulties in assigning unique placement of reads into assembled contigs.…”
Section: Discussionmentioning
confidence: 99%
“…Repeats has always been the source of most mis-assemblies, although the vast majority of genes in the bacterial genome are single-copy. Assembly errors from short reads or variations resulting from different assembly methods deserve our attention [ 42 , 43 ]. And a potential mechanism for recombination and deletion events could contribute to the antigenic diversity [ 44 ].…”
Section: Discussionmentioning
confidence: 99%
“…There is evidence that SNP-based genetic linkage maps from multiple populations can, in combination, maximise genome anchoring and orientation 20,21 . We compared the ability of multiple parental maps and integrated cultivar-specific maps to increase marker density and accuracy of marker order when compared with individual linkage maps.…”
mentioning
confidence: 99%