2020
DOI: 10.1111/1755-0998.13252
|View full text |Cite
|
Sign up to set email alerts
|

Identifying the causes and consequences of assembly gaps using a multiplatform genome assembly of a bird‐of‐paradise

Abstract: Genome assemblies are currently being produced at an impressive rate by consortia and individual laboratories. The low costs and increasing efficiency of sequencing technologies now enable assembling genomes at unprecedented quality and contiguity. However, the difficulty in assembling repeat‐rich and GC‐rich regions (genomic “dark matter”) limits insights into the evolution of genome structure and regulatory networks. Here, we compare the efficiency of currently available sequencing technologies (short/linked… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

13
135
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
3
1

Relationship

3
7

Authors

Journals

citations
Cited by 118 publications
(156 citation statements)
references
References 159 publications
(302 reference statements)
13
135
0
Order By: Relevance
“…Moreover, the different coverage in different parts of ATF7IP2 suggests a divergence between the repeat units in their structure and composition. To reconstruct the exact structure and context of the ATF7IP2-derived repeat in the W chromosome of E. velox, its genomic DNA should be sequenced with a long-read technology, such as PacBio or Nanopore [69].…”
Section: Discussionmentioning
confidence: 99%
“…Moreover, the different coverage in different parts of ATF7IP2 suggests a divergence between the repeat units in their structure and composition. To reconstruct the exact structure and context of the ATF7IP2-derived repeat in the W chromosome of E. velox, its genomic DNA should be sequenced with a long-read technology, such as PacBio or Nanopore [69].…”
Section: Discussionmentioning
confidence: 99%
“…Previous studies have highlighted the importance of genome assembly quality in repeat annotation, with higher sequencing depths and long read technologies critical for resolving TEs [35,36]. Our repeat analysis reveals significant variation in total TE content between genome assemblies (Table 1, Figure 1), however some of this variation is likely due to large differences in assembly quality rather than differential TE expansions or contractions in certain lineages.…”
Section: Resultsmentioning
confidence: 65%
“…It thus seems that some parts of each genome are missing from our sequencing reads. Transposons and repeat-rich genomic regions can contribute to gaps in genomic assemblies (Peona et al 2020). Some of these missing regions could also be GC-rich, a feature which is known to cause a sequencing-bias with Illumina technology (see for example , Botero-Castro et al 2017.…”
Section: Results and Discussion First Ostracod Reference Genomes And mentioning
confidence: 99%